Supported Languages and Language Dictionaries

Languages can be recognized if the OCR engine supports that language's character set. LEADTOOLS recognizes dozens of languages, enumerated in DOC2_LANGIDS.

In addition, many recognized languages also have spelling dictionaries, enabling spell-checking after character recognition is complete. Languages for which spell-checking is supported are listed below, along with their associated dictionary file.

LEADTOOLS OCR Module - OmniPage Engine

All languages to be recognized can be selected using L_Doc2SelectLanguages / L_Doc2SelectLanguagesExt.

The language dictionary to use for spell-checking should be specified in the SpellLangId member of the pRecogOpts parameter passed to the L_Doc2Recognize / L_Doc2RecognizeExt function.

For more information on options for the recognition process, refer to RECOGNIZEOPTS2.

SPELL-CHECK LANGUAGE LANGUAGE DICTIONARY FILE
Catalan R_CAT.DAT
Czech R_CZH.DAT
Danish R_DAN.DAT
Dutch R_DUT.DAT
English R_ENG.DAT
Finnish R_FIN.DAT
French R_FRE.DAT
German R_GER.DAT
Greek R_GRE.DAT
Hungarian R_HUN.DAT
Italian R_ITA.DAT
Norwegian R_NOR.DAT
Polish R_POL.DAT
Portuguese R_POR.DAT
Russian R_RUS.DAT
Slovenian R_SLN.DMD
Spanish R_SPA.DAT
Swedish R_SWE.DAT

✎ NOTE

Also requires the file ICHUNW32.DLL.

For more information, refer to:

Working with Languages

Working with Dictionaries

DOC2_LANGIDS

Punctuation and Miscellaneous Characters

Help Version 21.0.2021.4.7
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2021 LEAD Technologies, Inc. All Rights Reserved.

LEADTOOLS OCR Module - OmniPage Engine C API Help