Paper
29 October 2018 Silhouettes based human action recognition by Procrustes analysis and Fisher vector encoding
Jiaxin Cai, Xin Tang, Ranxu Zhong
Author Affiliations +
Proceedings Volume 10836, 2018 International Conference on Image and Video Processing, and Artificial Intelligence; 1083612 (2018) https://doi.org/10.1117/12.2506632
Event: 2018 International Conference on Image, Video Processing and Artificial Intelligence, 2018, Shanghai, China
Abstract
Recently, human action recognition in videos has attracted much attention. This paper proposed a framework for human action recognition based on procrustes analysis and Fisher vector encoding. First, we apply a pose based feature extracted from silhouette image by employing Procrustes analysis and local preserving projection. It can preserve the discriminative shape information and local manifold structure of human pose and is invariant to translation, rotation and scaling. After the pose feature is extracted, a recognition framework based on Fisher vector encoding and multi-class supporting vector machine is employed for classifying the human action. Experimental results on benchmarks demonstrate the effectiveness of the proposed method.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jiaxin Cai, Xin Tang, and Ranxu Zhong "Silhouettes based human action recognition by Procrustes analysis and Fisher vector encoding", Proc. SPIE 10836, 2018 International Conference on Image and Video Processing, and Artificial Intelligence, 1083612 (29 October 2018); https://doi.org/10.1117/12.2506632
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication and 2 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Computer programming

Feature extraction

Video coding

Image segmentation

Analytical research

Shape analysis

RELATED CONTENT

Curvature analysis approach to shape coding using B-splines
Proceedings of SPIE (December 29 2000)
WCAM: smart encoding for wireless surveillance
Proceedings of SPIE (March 14 2005)
MPEG-7-based video annotation and browsing
Proceedings of SPIE (November 26 2003)
Content-based classification and retrieval of audio
Proceedings of SPIE (October 02 1998)

Back to Top