public int MaximumPagesBeforeLtd { get; set; }
An integer value that indicate the maximum number of pages to process before using LTD as a temporary format. Default value is 8.
The LEADTOOLS OCR Module - LEAD Engine handles this operation internally by using a file-based document and does not load more than one page in memory at a time and will not use the value of MaximumPagesBeforeLtd.
The IOcrAutoRecognizeManager interface also has the following options to use with the Run, RunJob and RunJobAsync methods:
| Option | Description |
|---|---|
| IOcrAutoRecognizeManager.MaximumPagesBeforeLtd |
Used to add support for converting a document with unlimited number of pages. An OCR recognition operation on a document that contains a large amount of pages (10 and more) might result in an out of memory error. All of the LEADTOOLS OCR engines supports saving the intermediate recognition results to a temporary LTD file (DocumentFormat.LTD). The result of subsequent pages will be appended to this temporary file. When all the pages of the document have been recognized, the engine will convert the temporary LTD file to the desired output format. The IOcrAutoRecognizeManager.MaximumPagesBeforeLtd property defines the maximum number of pages processed as a whole. For example, if the original document has 20 pages and the value of this property is 8, the engine will recognize the first 8 pages and saves the result to a temporary file, recognizes the second 8 pages and append the results, and finally, recognize the last 4 pages and convert the temporary document to the final format. |
| IOcrAutoRecognizeManager.PreprocessPageCommands |
Holds an array of OcrAutoPreprocessPageCommand items to control what auto-preprocess operation to perform on each page document prior to recognition. |
Note: This property is not used and will be ignored when using engine native format (DocumentFormat.User and IOcrDocumentManager.EngineFormat).
using Leadtools;using Leadtools.Codecs;using Leadtools.Ocr;using Leadtools.Document.Writer;using Leadtools.Forms.Common;using Leadtools.WinForms;public void OcrAutoRecognizeManagerExample(){Console.WriteLine("Preparing the source and destination directories...");string sourceDirectory = LEAD_VARS.ImagesDir;string destinationDirectory = Path.Combine(LEAD_VARS.ImagesDir, "AutoRecognizeManagerExample");// Prepare the output directoryif (!Directory.Exists(destinationDirectory)){Directory.CreateDirectory(destinationDirectory);}// OCR some images from the source directory into the destination directory:IList<string> imageFiles = new List<string>();for (int i = 1; i <= 4; i++){imageFiles.Add(Path.Combine(sourceDirectory, string.Format("Ocr{0}.tif", i)));}Console.WriteLine("Creating an instance of the engine...");// Create an instance of the engineusing (IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD)){// Start the engine using default parametersConsole.WriteLine("Starting up the engine...");ocrEngine.Startup(null, null, null, LEAD_VARS.OcrLEADRuntimeDir);IOcrAutoRecognizeManager ocrAutoRecognizeManager = ocrEngine.AutoRecognizeManager;// Use LTD as a temporary format if a document has more than 4 pages to save memoryocrAutoRecognizeManager.MaximumPagesBeforeLtd = 4;// Use maximum CPUs/cores of current machine to speed up recognition// Either passing 0 or System.Environment.ProcessorCountocrAutoRecognizeManager.MaximumThreadsPerJob = 0;// Deskew and auto-orient all pages before recognitionocrAutoRecognizeManager.PreprocessPageCommands.Clear();ocrAutoRecognizeManager.PreprocessPageCommands.Add(OcrAutoPreprocessPageCommand.Deskew);ocrAutoRecognizeManager.PreprocessPageCommands.Add(OcrAutoPreprocessPageCommand.Rotate);// Create PDFs with Image/Text optionPdfDocumentOptions pdfOptions = ocrEngine.DocumentWriterInstance.GetOptions(DocumentFormat.Pdf) as PdfDocumentOptions;pdfOptions.ImageOverText = true;ocrEngine.DocumentWriterInstance.SetOptions(DocumentFormat.Pdf, pdfOptions);// Loop through all the TIF files in the source directory, convert to PDF in the destination directoryforeach (string imageFile in imageFiles){// Construct the name of the document filestring documentFileName = Path.Combine(destinationDirectory, Path.GetFileNameWithoutExtension(imageFile));documentFileName = Path.ChangeExtension(documentFileName, "pdf");// OCR the fileConsole.WriteLine("Processing {0}", imageFile);ocrAutoRecognizeManager.Run(imageFile, documentFileName, DocumentFormat.Pdf, null, null);Console.WriteLine("Saved: {0}", documentFileName);}}}static class LEAD_VARS{public const string ImagesDir = @"C:\LEADTOOLS23\Resources\Images";public const string OcrLEADRuntimeDir = @"C:\LEADTOOLS23\Bin\Common\OcrLEADRuntime";}