Recognizing Document Pages

You can call the L_OcrSpellCheckManager_SetSpellCheckEngine function before starting the recognition process.

The spell check engine will be used to enable or disable the checking sub-system, which will be used in the verification. When all necessary recognition options have been set, the page(s) can be recognized by calling L_OcrPage_Recognize. After the recognition is complete, the recognized characters can be obtained and the recognition results can be saved to a file.

The collection of characters recognized for a specific page can be obtained using L_OcrPage_GetRecognizedCharacters. You can inspect this collection of characters and even modify it and update the recognition data with the L_OcrPage_SetRecognizedCharacters function.

The recognition results can be saved to a file by first creating an OCR document using L_OcrDocumentManager_CreateDocument, adding the page to it with L_OcrDocument_AddPage and then calling L_OcrDocument_Save. This function takes a DOCWRTFORMAT parameter to specify the type of the document format to save (PDF, DOC, TXT, etc.). LEADTOOLS Advantage OCR uses the Document Writer library to save the OCR results to an output file.

In addition to the various formats supported by the document writers, the recognition results can also be saved as XML using L_OcrDocument_SaveXml.

The recognition results can also be obtained directly into a simple C characters pointer string object calling the L_OcrPage_GetText method.

Help Version 19.0.2017.10.27
Products | Support | Contact Us | Copyright Notices
© 1991-2017 LEAD Technologies, Inc. All Rights Reserved.
LEADTOOLS Advantage OCR C API Help