This topic describes how to OCR ID document images using LEADTOOLS OCR Module - LEAD Engine to produce the best recognition results. ID documents, such as passports, may have textured background. This texture is highly likely to interfere with segmented text for OCR, leading to undesirable results.
One of the best approaches to handle color ID document images for OCR is to convert them to black and white using the thresholding method with a low intensity threshold value. The text color is very dark in comparison to the textured background. The low intensity threshold value is empirically estimated by examining the distribution of text colors and background colors on these images, then by finding the intersection point between their normal distributions. To perform this thresholding method using LEADTOOLS OCR Module - LEAD Engine demo follow these steps:
Run LEADTOOLS OCR Module - LEAD Engine demo and in the menu bar go to Engine > Settings > Recognition > Preprocess settings > black and white conversion method and set its value to user.
Go to Engine > Settings > Recognition > Preprocess settings > black and white conversion threshold and set a low threshold value, for example, 110.
Then open an ID document image (File > Open).
And finally, perform the OCR task.
The steps outlined in this tutorial optimize the handling of color ID document images for OCR by converting them to black and white in order to generate the best OCR results. It is recommended when processing images with LEADTOOLS OCR Module - LEAD Engine to use OCR zone with type “FieldData” to recognize the person’s information regions in the ID document images. This zone type is limited to all English capital letters, digits, and - , . /
symbols.
Help Collections
Raster .NET | C API | C++ Class Library | HTML5 JavaScript
Document .NET | C API | C++ Class Library | HTML5 JavaScript
Medical .NET | C API | C++ Class Library | HTML5 JavaScript
Medical Web Viewer .NET
Multimedia
Direct Show .NET | C API | Filters
Media Foundation .NET | C API | Transforms
Supported Platforms
.NET, Java, Android, and iOS/macOS Assemblies
Imaging, Medical, and Document
C API/C++ Class Libraries
Imaging, Medical, and Document
HTML5 JavaScript Libraries
Imaging, Medical, and Document