typedef enum{DOC2_TEXT,DOC2_UTEXT,DOC2_FORMATTED_TEXT,DOC2_UFORMATTED_TEXT,DOC2_TEXT_LINEBREAKS,DOC2_UTEXT_LINEBREAKS,DOC2_TEXT_CSV,DOC2_TEXT_UCSV,DOC2_PDF,DOC2_PDF_IMAGE_SUBSTITUTES,DOC2_PDF_IMAGE_ON_TEXT,DOC2_PDF_EDITED,DOC2_XML,DOC2_HTML_3_2,DOC2_HTML_4_0,DOC2_RTF_6,DOC2_RTF_97,DOC2_RTF_2000,DOC2_RTF_WORD_2000,DOC2_WORD_2000,DOC2_WORD_97,DOC2_EXCEL_97,DOC2_EXCEL_2000,DOC2_PPT_97,DOC2_PUB_98,DOC2_MICROSOFT_READER,DOC2_WORDML,DOC2_WORDPERFECT_8,DOC2_WORDPERFECT_10,DOC2_WORDPAD,DOC2_INFOPATH,DOC2_EBOOK,DOC2_PDFA_IMAGE_ON_TEXT,DOC2_PDFA_TEXT_ONLY,DOC2_WORD_2007,DOC2_EXCEL_2007,} DOC2_FORMATTYPE;
The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.
| Value | Meaning |
|---|---|
| DOC2_TEXT | Simple text output format |
| DOC2_UTEXT | Unicode text output format |
| DOC2_FORMATTED_TEXT | Retain the layout of the page by inserting extra spaces |
| DOC2_UFORMATTED_TEXT | Same as Formatted Text, but using Unicode characters |
| DOC2_TEXT_LINEBREAKS | Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs |
| DOC2_UTEXT_LINEBREAKS | Same as text with line breaks, but using Unicode characters. |
| DOC2_TEXT_CSV | Write the recognized text as a table (Comma delimited text file) that can be read by Excel |
| DOC2_TEXT_UCSV | Same as Text CSV, but using Unicode characters |
| DOC2_PDF | Adobe PDF file. Text only. |
| DOC2_PDF_IMAGE_SUBSTITUTES | Adobe PDF file with image substitutes |
| DOC2_PDF_IMAGE_ON_TEXT | Adobe PDF with image on text |
| DOC2_PDF_EDITED | Adobe PDF edited |
| DOC2_XML | XML output format |
| DOC2_HTML_3_2 | HTML 3.2 output format |
| DOC2_HTML_4_0 | HTML 4.0 output format |
| DOC2_RTF_6 | RTF 6 |
| DOC2_RTF_97 | RTF that can only be interpreted by Microsoft Word 97 and up |
| DOC2_RTF_2000 | RTF that can only be interpreted by Microsoft Word 2000 and up |
| DOC2_RTF_WORD_2000 | RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up |
| DOC2_WORD_2000 | Word file that can only be interpreted by Microsoft Word 2000 and up |
| DOC2_WORD_97 | Word file that can only be interpreted by Microsoft Word 97 and up |
| DOC2_EXCEL_97 | Microsoft Excel 97 binary file |
| DOC2_EXCEL_2000 | Microsoft Excel 2000 binary file |
| DOC2_PPT_97 | Microsoft Power Point 97 |
| DOC2_PUB_98 | Microsoft Publisher 98 |
| DOC2_MICROSOFT_READER | Microsoft Reader convertor |
| DOC2_WORDML | Word ML convertor |
| DOC2_WORDPERFECT_8 | WordPerfect 8 convertor |
| DOC2_WORDPERFECT_10 | WordPerfect 10 convertor |
| DOC2_WORDPAD | Word Pad convertor |
| DOC2_INFOPATH | Info Path convertor |
| DOC2_EBOOK | eBook convertor |
| DOC2_PDFA_IMAGE_ON_TEXT | PDF/A Image on text. |
| DOC2_PDFA_TEXT_ONLY | PDF/A Text only. |
| DOC2_WORD_2007 | Microsoft Word Document Format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.). |
| DOC2_EXCEL_2007 | Microsoft Excel Spreadsheet Format (XLSX). |
The enumeration is used by: