Paper
6 May 2022 Attention based fully convolutional network for video summarization
Cong Li, Shuangxiong Wei, Yuxuan Liu, Siyi Luo, Di Yang, Zengkai Wang
Author Affiliations +
Proceedings Volume 12176, International Conference on Algorithms, Microchips and Network Applications; 121761Q (2022) https://doi.org/10.1117/12.2636379
Event: International Conference on Algorithms, Microchips, and Network Applications 2022, 2022, Zhuhai, China
Abstract
Video summarization technology extracts key frames or video clips that can most effectively express video content from the original long video, so that the summarized video clips contain the content information that users are most interested in, which is convenient for users to quickly browse and retrieve the video. In this paper, we regard video summarization as a sequence annotation problem. Different from the existing methods using recurrent models, this paper proposes a fully convolutional neural network model combining spatial attention mechanism to solve the video summarization problem. Firstly, the pre-training model is used to extract the cube features of the input video frame, then the cube features are aggregated into the attention vector by the attention mechanism, which is input into the fully convolutional neural network model for binary classification, and then the video summary is generated. Extensive experiments and analysis on two benchmark datasets demonstrate the effectiveness of the proposed method.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Cong Li, Shuangxiong Wei, Yuxuan Liu, Siyi Luo, Di Yang, and Zengkai Wang "Attention based fully convolutional network for video summarization", Proc. SPIE 12176, International Conference on Algorithms, Microchips and Network Applications, 121761Q (6 May 2022); https://doi.org/10.1117/12.2636379
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Video surveillance

Convolution

Semantic video

Image segmentation

Binary data

Convolutional neural networks

RELATED CONTENT

ShiftMask: video behavior recognition data augmentation
Proceedings of SPIE (January 12 2023)
State of the art video recognition of human action under...
Proceedings of SPIE (December 28 2022)
A practical time series action detection algorithm
Proceedings of SPIE (November 02 2022)
Optimization of the OSVOS model
Proceedings of SPIE (December 31 2019)

Back to Top