Introduction to Video Action Recognition - Code World

Introduction to Video Action Recognition

Enterprise 2023-04-08 14:07:11 views: null

1. Classification of Video Timing Action Recognition Algorithms

According to the working mode of the network, video timing action recognition algorithms can be roughly divided into four categories:

Method using 2D convolution
Using the method of 3D convolution
double flow method
The method of introducing VLAD

1.1 Method using 2D convolution

"TSM: Temporal Shift Module for Efficient Video Understanding" algorithm detailed explanation
"TEA: Temporal Excitation and Aggregation for Action Recognition" algorithm detailed explanation
"TDN: Temporal Difference Networks for Efficient Action Recognition" paper detailed explanation
"No frame left behind: Full Video Action Recognition" algorithm details

1.2 Method using 3D convolution

1.2 Dual flow method

1.3 The method of introducing VLAD

"ActionVLAD Algorithm Details"

2. Introduction to common data sets

Sports-1M数据集介绍：
	* 1.1 millions运动视频
	* 487个视频类

UCF101数据集介绍：
	* 13320个视频片段
	* 9.5K训练，3.7K测试视频
	* 视频帧大小320*240
	* 总共101类，内容包含化妆刷牙、爬行、理发、弹奏乐器、体育运动五大类。
	* 每类动作由25个人做动作，每人做4-7组

ActivatyNet数据库介绍

* 人类动作识别数据库
* v1.3版本中有19994段视频，包含200类
* 10024段视频为训练集，4926段视频为验证集，5044段视频为测试集
* 测试集label没有公开，一般就是使用验证集来作为测试集

HMDB51数据介绍
	* 6766个视频
	* 51个动作类别
	* 内容包括人面部、肢体、和物体交互的动作这几大类

Kinetic-400 数据库介绍

* 240k训练视频，20k验证，35k测试
* 400类人类动作类别
* 内容为画画、大笑、拥抱、除草等
* 每个视频大约10秒
* 数据来源于YouTube

Kinetic-600 数据库介绍

* Kinetic-400数据库的扩展
* 600类人类动作类别
* 总共500k段视频

Charades 数据库介绍

* 9848段视频
* 157类室内日常行为
* 多标签
* 每个视频大约30s

For other video task introductions, please check-article <<Introduction to Mainstream Video Action Algorithm Tasks>>

Guess you like

Origin blog.csdn.net/liuxiaoheng1992/article/details/126678605

Introduction to Video Action Recognition

Collaborative Spatioitemporal Feature Learning for Video Action Recognition

Introduction to Video Action Recognition

Introduction to Video Action Recognition

Behavior Recognition-A Comprehensive Study of Deep Video Action Recognition

CVPR2019 | Papers on Behavior/Action Recognition, Gesture Recognition, Timing Action Detection and Video Related

【Video classification】training_extensions/action_recognition reproduce

Video action classification and recognition using UCF101

Video Behavior Recognition (2) - Hierarchical Combination Representation of Small Sample Action Recognition

OFGF Optical Flow Guided Features: Fast and Robust Motion Representation for Video Action Recognition [with source code]

OpenCv video face recognition

Offline video OCR recognition

Action recognition system is based on SVM classifier

[Papers understanding] Attentional Pooling for Action Recognition

OPENCV Gesture Action Recognition - Rock Paper Scissors

OPENCV Gesture Action Recognition - Rock Paper Scissors

Human Action Recognition in Computer Vision Algorithms

PyTorch in action: Implementing MNIST handwritten digit recognition

AI Project 5: Seal Action Recognition

Deep Learning-based Action Recognition

[Unsupervised Video Anomaly Detection] 2023-CVPR - Cue-guided zero-shot abnormal action recognition using pre-trained deep skeleton features

A Brief Introduction to Behavior Recognition

Introduction to Face Recognition

Behavior Recognition-TDN: Temporal Difference Networks for Efficient Action Recognition

opencv face recognition, recognition video library

28.Spark introduction in action

Introduction to "TensorFlow Knowledge Graph in Action"

Basic introduction of net protection action

[Action Pattern Recognition] Realize compound action pattern recognition (offline control module)

[Video recognition] Video traffic statistics [Matlab 455]

Recommended

The number of MaxKB GitHub Stars, an open source knowledge base question and answer system based on large language models, exceeded 5,000!

Ranking

Getting the basic concepts of ROS + catkin Profile

Spring Learning (2) --- Assembling Beans in the IoC Container

js to get the src attribute of the image regularly

STM32 is based on CubeIDE and HAL library basics entry study notes: Bluetooth WIFI STM32 connects to Alibaba Cloud

Short video learning - 3, pandas simple use of pivot_table

Print directory of project configuration log, output log

Understand the difference between vi and vim in Linux in 3 minutes

1 + x certificate Web front-end development HTML5 special exercises

mangodb save and insert the difference

7-1 Family tree processing (50 points) (binary tree solution)

Daily

More

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)