DOC2_FORMATTYPE

typedef enum 
{ 
   DOC2_TEXT, 
   DOC2_UTEXT, 
   DOC2_FORMATTED_TEXT, 
   DOC2_UFORMATTED_TEXT, 
   DOC2_TEXT_LINEBREAKS, 
   DOC2_UTEXT_LINEBREAKS, 
   DOC2_TEXT_CSV, 
   DOC2_TEXT_UCSV, 
   DOC2_PDF, 
   DOC2_PDF_IMAGE_SUBSTITUTES, 
   DOC2_PDF_IMAGE_ON_TEXT, 
   DOC2_PDF_EDITED, 
   DOC2_XML, 
   DOC2_HTML_3_2, 
   DOC2_HTML_4_0, 
   DOC2_RTF_6, 
   DOC2_RTF_97, 
   DOC2_RTF_2000, 
   DOC2_RTF_WORD_2000, 
   DOC2_WORD_2000, 
   DOC2_WORD_97, 
   DOC2_EXCEL_97, 
   DOC2_EXCEL_2000, 
   DOC2_PPT_97, 
   DOC2_PUB_98, 
   DOC2_MICROSOFT_READER, 
   DOC2_WORDML, 
   DOC2_WORDPERFECT_8, 
   DOC2_WORDPERFECT_10, 
   DOC2_WORDPAD, 
   DOC2_INFOPATH, 
   DOC2_EBOOK, 
   DOC2_PDFA_IMAGE_ON_TEXT, 
   DOC2_PDFA_TEXT_ONLY, 
   DOC2_WORD_2007, 
   DOC2_EXCEL_2007, 
} DOC2_FORMATTYPE; 

The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.

Value Meaning
DOC2_TEXT Simple text output format
DOC2_UTEXT Unicode text output format
DOC2_FORMATTED_TEXT Retain the layout of the page by inserting extra spaces
DOC2_UFORMATTED_TEXT Same as Formatted Text, but using Unicode characters
DOC2_TEXT_LINEBREAKS Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs
DOC2_UTEXT_LINEBREAKS Same as text with line breaks, but using Unicode characters.
DOC2_TEXT_CSV Write the recognized text as a table (Comma delimited text file) that can be read by Excel
DOC2_TEXT_UCSV Same as Text CSV, but using Unicode characters
DOC2_PDF Adobe PDF file. Text only.
DOC2_PDF_IMAGE_SUBSTITUTES Adobe PDF file with image substitutes
DOC2_PDF_IMAGE_ON_TEXT Adobe PDF with image on text
DOC2_PDF_EDITED Adobe PDF edited
DOC2_XML XML output format
DOC2_HTML_3_2 HTML 3.2 output format
DOC2_HTML_4_0 HTML 4.0 output format
DOC2_RTF_6 RTF 6
DOC2_RTF_97 RTF that can only be interpreted by Microsoft Word 97 and up
DOC2_RTF_2000 RTF that can only be interpreted by Microsoft Word 2000 and up
DOC2_RTF_WORD_2000 RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up
DOC2_WORD_2000 Word file that can only be interpreted by Microsoft Word 2000 and up
DOC2_WORD_97 Word file that can only be interpreted by Microsoft Word 97 and up
DOC2_EXCEL_97 Microsoft Excel 97 binary file
DOC2_EXCEL_2000 Microsoft Excel 2000 binary file
DOC2_PPT_97 Microsoft Power Point 97
DOC2_PUB_98 Microsoft Publisher 98
DOC2_MICROSOFT_READER Microsoft Reader convertor
DOC2_WORDML Word ML convertor
DOC2_WORDPERFECT_8 WordPerfect 8 convertor
DOC2_WORDPERFECT_10 WordPerfect 10 convertor
DOC2_WORDPAD Word Pad convertor
DOC2_INFOPATH Info Path convertor
DOC2_EBOOK eBook convertor
DOC2_PDFA_IMAGE_ON_TEXT PDF/A Image on text.
DOC2_PDFA_TEXT_ONLY PDF/A Text only.
DOC2_WORD_2007 Microsoft Word Document Format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.).
DOC2_EXCEL_2007 Microsoft Excel Spreadsheet Format (XLSX).

Comments

The enumeration is used by:

Help Version 20.0.2020.4.2
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2020 LEAD Technologies, Inc. All Rights Reserved.

LEADTOOLS OCR Module - OmniPage Engine C API Help