[AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.
-
Updated
Jul 9, 2024 - Python
[AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.
My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
This project builds a video classification model using CNNs for spatial feature extraction and RNNs for temporal sequence modeling. Utilizing the UCF101 dataset, it covers data preprocessing, feature extraction, model training, and evaluation, providing a comprehensive approach to action recognition in videos.
Implemented a CNN-LSTM Action Recognizer for dynamic motion analysis, integrating convolutional and recurrent neural networks to efficiently recognize and classify actions in video data of UCF101 dataset.
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
End-to-End Semi-Supervised Learning for Video Action Detection [CVPR 2022]
Video Platform for Action Recognition and Object Detection in Pytorch
Salient Video Frames Sampling Method Using the Mean of Deep Features for Efficient Model Training (KIBME 2021)
Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Lightning framework.
Action recognition tutorial using UCF-101 dataset.
A simple and fun video classification/action recognition using VGG16 as a feature extractor and RNN.
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
This repository host the code for real-time action detection paper
Testing code for few-shot action recognition
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
Simple Action Recognition experimentation with the UCF101 Dataset and EfficientNets.
Add a description, image, and links to the ucf101 topic page so that developers can more easily learn about it.
To associate your repository with the ucf101 topic, visit your repo's landing page and select "manage topics."