Paper
1 October 1998 Methods for objective evaluation and improvement of text document images
Valery S. Kot, Alexander V. Bondarenko
Author Affiliations +
Abstract
Any optical character recognition (OCR) system contains preprocessing unit responsible for image binarization, and the entire recognition rate depends dramatically from the accuracy of this unit. In case of poor image quality user must spend much time to find out the best parameters of this unit while recognition rate may still remain unsatisfactory. Thus methods intended for objective evaluation and context- sensitive improvement of text document images are required. In this parameters set is proposed as a tool for integral image description. This compact set allows to select automatically or semiautomatically the optimal image processing sequence from the basic IP functions. For all tested commercial OCR systems, the proposed methods result in recognition errors decreasing about 50 - 60% for text document images of average and poor quality while requiring less than 1 minute per page of additional processing time.
© (1998) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Valery S. Kot and Alexander V. Bondarenko "Methods for objective evaluation and improvement of text document images", Proc. SPIE 3460, Applications of Digital Image Processing XXI, (1 October 1998); https://doi.org/10.1117/12.323158
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Optical character recognition

Image quality

Control systems

Databases

Visualization

Binary data

Back to Top