Multilanguage ID document images synthesis for testing recognition pipelines

Yulia S. Chernyshova; Konstantin K. Suloev; Vladimir V. Arlazarov

doi:10.1117/12.3023179

3 April 2024 Multilanguage ID document images synthesis for testing recognition pipelines

Yulia S. Chernyshova, Konstantin K. Suloev, Vladimir V. Arlazarov

Proceedings Volume 13072, Sixteenth International Conference on Machine Vision (ICMV 2023); 130720E (2024) https://doi.org/10.1117/12.3023179
Event: Sixteenth International Conference on Machine Vision (ICMV 2023), 2023, Yerevan, Armenia

Abstract

Datasets are de facto the only way to test the recognition pipelines and to compare them with each other. To avoid the manual gathering of documents and, moreover, to avoid problems with the law in the case of ID documents researchers create synthetic datasets or datasets of fake documents, but this process is also time-consuming. In this paper, we present a simple method to use when you need to test a recognition pipeline or some part of it. The method employs only the information that the developers of such pipelines use in their work and allows them to create natural-looking images. The quantitative experiments show that the recognition accuracy of the synthesized images corresponds with the recognition accuracy of the MIDV-2020 dataset. The qualitative comparison also demonstrates that such images can be helpful in recognition systems’ development.

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Yulia S. Chernyshova, Konstantin K. Suloev, and Vladimir V. Arlazarov "Multilanguage ID document images synthesis for testing recognition pipelines", Proc. SPIE 13072, Sixteenth International Conference on Machine Vision (ICMV 2023), 130720E (3 April 2024); https://doi.org/10.1117/12.3023179

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
7 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Neural networks

Optical character recognition

Video

Scientific research

Statistical modeling

RELATED CONTENT

Research on AI assisted grading of math questions based on...
Proceedings of SPIE (April 22 2022)

Comparison of the scanned pages of the contractual documents
Proceedings of SPIE (April 13 2018)

Methods of weighted combination for text field recognition in a...
Proceedings of SPIE (January 31 2020)

Experimental modeling the flow of character recognition results in video...
Proceedings of SPIE (March 15 2019)

Semi-automatic ground truth generation for license plate recognition system
Proceedings of SPIE (September 24 2011)

Skewness correction in automatic license plate recognition
Proceedings of SPIE (March 01 2005)

Content-based retrieval in multimedia imaging
Proceedings of SPIE (April 14 1993)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years