Action recognition model based on attention mechanism and residual network
DOI:
CSTR:
Author:
Affiliation:

School of computer science, Southwest Petroleum University, Chengdu 610599, China

Clc Number:

TP391.4

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The breakthrough of deep learning in the field of image makes the rapid development of feature learning. Aiming at the temporal correlation of consecutive frames in video sequences, a residual 3D convolutional network model based on attention mechanism is proposed for human action recognition. Firstly, residual 3D convolution network is used to learn the temporal correlation between consecutive video frames in video sequence. Then, each feature channel learned by residual 3D convolution structure is given different weights by using channel attention network which is extended to three-dimensional. Finally, the reweighted features are input into the classifier to get the final classification. Experiments are carried out on UCF-101 and HMDB-51 datasets, and the accuracy is 95.8% and 69.7%, respectively. The experimental results show that the proposed model has high recognition accuracy in video human action recognition.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: September 05,2024
  • Published:
Article QR Code