Video-text cross-modal retrieval algorithm based on multiple coding

Yufan Xu

doi:10.1117/12.2667669

22 February 2023 Video-text cross-modal retrieval algorithm based on multiple coding

Yufan Xu

Proceedings Volume 12587, Third International Seminar on Artificial Intelligence, Networking, and Information Technology (AINIT 2022); 125871L (2023) https://doi.org/10.1117/12.2667669
Event: Third International Seminar on Artificial Intelligence, Networking, and Information Technology (AINIT 2022), 2022, Shanghai, China

Abstract

Currently, more and more video data and terminal devices accessing video resources are available to users. Video platforms such as Tiktok and Youtube are gradually rising, and the user scale and video resources are increasing day by day, which brings an urgent practical demand for video-text data cross-modal retrieval. This paper proposes a video-text cross-modal retrieval algorithm based on multiple encoding. By encoding the global features, serial features and local features of video and text, the encoded features are mapped to the common embedding space for training, loss function calculation and optimization. Through experimental verification on MASR-VTT data set and comparison with existing methods, the overall performance R@sum increased by 9.22% and 2.86% respectively, which proved the superiority of this method.

Citation Download Citation

Yufan Xu "Video-text cross-modal retrieval algorithm based on multiple coding", Proc. SPIE 12587, Third International Seminar on Artificial Intelligence, Networking, and Information Technology (AINIT 2022), 125871L (22 February 2023); https://doi.org/10.1117/12.2667669

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available