Paper
26 June 2023 Video summarization generation with self-attention and random forest regression
Ziyan Wang
Author Affiliations +
Abstract
Video summarization is used widely in the field of fast browsing and retrieval of videos by generating keyframes or segments to achieve video compression. Existing methods mostly explore based on image content, ignoring the temporal characteristics of videos, resulting in summaries lacking temporal coherence and representativeness. We propose a video summarization network based on an encoder-decoder framework. Specifically, the encoder part extracts features using a convolutional neural network and enhances the weight of key features through self-attention mechanism. The decoder part consists of a bidirectional long short-term memory network fused with a random forest, and adjusts the proportion of the random forest and bidirectional long short-term memory network in the loss function to make the model more stable and accurate in prediction. Experiments were conducted on two datasets and compared with seven other methods, and the comprehensive experimental results prove the effectiveness and feasibility of the proposed method.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ziyan Wang "Video summarization generation with self-attention and random forest regression", Proc. SPIE 12721, Second International Symposium on Computer Applications and Information Systems (ISCAIS 2023), 127211F (26 June 2023); https://doi.org/10.1117/12.2683282
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Random forests

Video coding

Data modeling

Image segmentation

Video processing

Video compression

RELATED CONTENT


Back to Top