OCR SDK Technology

LEADTOOLS provides fast and highly accurate Optical Character Recognition SDK technology for .NET (C# & VB), C/C++, iOS, macOS, Linux, Java, and web developers. Leverage the high-level LEADTOOLS OCR toolkit to rapidly develop robust, scalable, and high-performance recognition and document processing applications that extract text from scanned documents and convert images to text-searchable formats such as PDF, PDF/A, DOC, DOCX, XML, and XPS.

The advanced OCR SDK technology in LEADTOOLS is multi-faceted and can be used as a standalone feature as well as the driving force behind more advanced technologies such as forms recognition, check recognition and document conversion. On its own, programmers can write as few as three lines of code to convert an image to text-searchable documents.

With extensive support for more than forty character sets, programmers can expand their customer base by providing the same solution for many languages, including English, Spanish, French, German, Japanese, Chinese, and Arabic.

Overview of LEADTOOLS OCR SDK Technology

Automatic Zone Recognition
Automatically segments an image into various zones which can improve recognition accuracy and efficiency

Manual Zone Recognition
Allow the user to draw and recognize text from specific regions of interest

Zone Types

  • Paragraph
  • Text
  • Numeric
  • Table
  • MICR
  • Graphic

Speed and Reliability

  • Fast, accurate, and reliable optical character recognition for use in any application and environment
    • Large volume document batch processing
    • Single and multi-page documents
    • Tablet and mobile devices
    • Web and Cloud applications
  • Utilize multiple cores for unparalleled performance

Accuracy

  • Spell checking dictionary support
  • Automatically detect, segment, and recognize multiple languages on the same document
  • Full-page analysis and Zonal recognition
    • Automatic table area segmentation
    • Automatic OMR area segmentation
    • Automatic vertical text segmentation with its orientation angle (i.e. 90, 270 or 0)
    • Automatic segmentation of vertical text within horizontal text pages
  • Automatic document cleanup
    • Omni-directional noise removal
    • Undither text
    • Dot matrix correction
    • Option to remove lines from tables
  • Automatic document preprocessing
    • Deskew of scanned document
    • Detect and correct the orientation of the document (flipped or reversed) with multi or single-page modes
    • Remove borders
    • Split pages
  • Unique color and bitonal image recognition for scanned documents and pictures can detect text regardless of foreground/background colors
  • Output searchable text document formats such as PDF, PDF/A, DOC, DOCX, XML, XPS, and more, maintaining the original look and feel
    • Text with detected font characteristics (font-family name, style, size, bold, italic, underline, strikeout, slope angle, etc.)
    • Location
    • Tables reconstruction
    • Layout
    • Graphics

Versatility

  • Fully featured SDK
    • High-level classes for one-shot recognition from image to final document
    • Low-level functionality for full customization
  • Supports multiple text recognition engines
  • Fully configurable recognition engine
    • Narrow down possible results with character and numeric filters
    • Multiple voting techniques for enhanced accuracy
    • Trace and progress callbacks
    • Enable/disable font characteristics for fast text only recognition
  • Comprehensive results reporting
    • Character location, size and baseline
    • Character attributes (end of word, end of line, end of paragraph, etc.)
    • Font properties (monospace, proportional, serif, sans-serif, bold, italic, underline, strikethrough)
    • Confidence values
    • Obtain the recognized words directly for a zone or page without saving to an external document
  • Integrates seamlessly with LEADTOOLS Forms Recognition and Processing
  • Implement large-scale distributed OCR applications using LEADTOOLS Distributed Computing SDK

Cross-Platform

  • Native, full-featured Optical Character Recognition libraries for many platforms
    • Windows desktops, servers, and Phones
    • iOS and macOS
    • Android
    • Web Services
  • Native mobile libraries run on the device without needing to call external services
  • Utilize camera preview on phones and tablets for real-time text recognition and processing
  • Specialized image processing and recognition for deformations unique to mobile devices
    • 3D Perspective Deskew
    • Keystone Correction
    • Fixed-focus camera optimizations
  • Programming interfaces closely resemble each other, making it easy to port an application to multiple platforms

LEADTOOLS OCR Engines

The LEADTOOLS OCR SDK contains the following recognition engines:

LEAD OCR Engine

Developed in-house with native libraries for Windows x86/x64, .NET (C# & VB), Linux, iOS, macOS, and Android.

Benefits of In-House Development

  • New features and enhancements to speed and accuracy are continuously added
  • Customer feedback and feature requests are used to enhance and expand the engine with solutions to real-world applications

OmniPage OCR Engine

Licenses the Nuance OmniPage 20 recognition engine and includes native libraries for Windows x86/x64 and .NET.

Interchangeable High-level Interface

LEADTOOLS features a high-level interface that abstracts and simplifies the use of multiple recognition engines with only a single line of code. For example, the following code will determine which recognition engine is used while the rest of your code remains unchanged.


// Use LEAD engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD, false);
// Use OmniPage engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.OmniPage, false);
      

TIFF to Searchable PDF with a Few Lines of Code

The LEADTOOLS OCR SDK provides a high-level programming interface that allows developers to create complex recognition applications in record time. For example, with the AutoRecognizeManager, developers can convert any of 150+ image formats into a text-searchable format such as PDF or DOC in as little as three lines of code!



IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD, false);
ocrEngine.Startup(null, null, null, null);
ocrEngine.AutoRecognizeManager.Run(@"C:\in.tif", @"C:\out.pdf",
   DocumentFormat.Pdf, null, null);


Dim ocrEngine As IOcrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD, False)
ocrEngine.Startup(Nothing, Nothing, Nothing, Nothing)
ocrEngine.AutoRecognizeManager.Run(@"C:\in.tif", @"C:\out.pdf",
   DocumentFormat.Pdf, Nothing, Nothing)


OcrEngine ocrEngine = OcrEngineManager.createEngine(OcrEngineType.LEAD);
ocrEngine.startup(null, null, null, null);
ocrEngine.getAutoRecognizeManager().run("C:\\in.tif", "C:\\out.pdf",
   DocumentFormat.PDF, null);


L_OcrEngine ocrEngine = NULL;
L_OcrAutoRecognizeManager autoRecognizeManager = NULL;
L_OcrEngineManager_CreateEngine( L_OcrEngineType_LEAD, &ocrEngine );
L_OcrEngine_Startup( ocrEngine, NULL, NULL );
L_OcrEngine_GetAutoRecognizeManager( ocrEngine, &autoRecognizeManager );
L_OcrAutoRecognizeManager_Run( autoRecognizeManager,
    L_TEXT( "in.tif" ), L_TEXT( "out.pdf" ), DOCUMENTFORMAT_PDF, NULL );


LTOcrEngine * const engine =
   [LTOcrEngineManager createEngine:LTOcrEngineTypeLEAD];
[engine startup:nil documentWriter:nil workDirectory:nil
   engineDirectory:engineDirectory error:&error];
[engine.autoRecognizeManager run:@"in.tif" documentFileName:@"out.pdf"
   zonesFileName:nil format:LTDocumentFormatPdf error:&error];


let engine = LTOcrEngineManager.createEngine(.lead)
do {
   try engine.startup(rasterCodecs: nil, workDirectory: nil,
      startupParameters: engineDirectory)
   try engine.autoRecognizeManager.run(imageFileName: "in.tif",
      documentFileName: "out.pdf", zonesFileName: nil,
      format: LTDocumentFormat.pdf)
}
catch let error as NSError {
   errorBlock(error)
}

OCR Languages

Recognize text from more than 40 languages and character sets, including:

  • English (en)
  • Afrikaans (af)
  • Albanian (sq)
  • Arabic (ar)
  • Azerbaijani (az)
  • Basque (eu)
  • Belarusian (be)
  • Bulgarian (bg)
  • Catalan (ca)
  • Chinese Simplified (zh-Hans)
  • Chinese Traditional (zh-Hant)
  • Croatian (hr)
  • Czech (cs)
  • Danish (da)
  • Dutch (nl)
  • Estonian (et)
  • Faroese (fo)
  • Finnish (fi)
  • French (fr)
  • Galician (gl)
  • German (de)
  • Greek (el)
  • Hungarian (hu)
  • Icelandic (is)
  • Indonesian (id)
  • Italian (it)
  • Japanese (ja)
  • Korean (ko)
  • Latvian (lv)
  • Lithuanian (lt)
  • Macedonian (mk)
  • Malay (ms)
  • Maltese (mt)
  • Norwegian (no)
  • Polish (pl)
  • Portuguese (pt)
  • Portuguese Brazil (pt-BR)
  • Romanian (ro)
  • Russian (ru)
  • Serbian (sr)
  • Serbian Cyrillic (sr-Cyrl-CS)
  • Slovak (sk)
  • Slovenian (sl)
  • Spanish (es)
  • Swahili (sw)
  • Swedish (sv)
  • Telugu (te)
  • Thai (th)
  • Turkish (tr)
  • Ukrainian (uk)
  • Vietnamese (vi)

Technology Related to OCR

Download the Full Evaluation

The Full Evaluation Download includes all LEADTOOLS Document, Medical, Imaging, and Vector technologies for all development and target platforms.

Code Tips That use OCR

Documentation Links for OCR

White Papers Written About OCR

  • OCR with LEADTOOLS: the Better Choice

    Optical Character Recognition (OCR) is a technology and concept that is familiar to the majority of programmers: take a picture with words and convert it to text. It sure sounds simple, but implementing it well is often much harder than it looks. Much like watching a professional surfer and trying to do it yourself, developers get bruised, tired, and nearly drown in the endless waves of images with varying fonts, bad scans, dust speckles, and paper crinkles finding new problems in your algorithm. Save yourself some headache and use LEADTOOLS, the most accurate, fast, and easy-to-use OCR SDK on the market!

  • Using LEADTOOLS OCR to Enhance Google Drive Search

    Google Drive is a wonderful service for storing, organizing, and sharing files such as documents, photos, and videos. However, TIFF and other raster image file formats can get easily lost because of the limited search capabilities. With LEADTOOLS, developers can use its OCR SDK to extract the text and then add it to the IndexableTextData for each item.

  • End-to-End eDiscovery with LEADTOOLS Document Imaging

    When it comes to change, the desire for efficiency is surely at or near the top of the list of reasons. Some processes and industries are harder to change, especially those that have been around for a long time. Court systems in many countries are one of the oldest and most well established processes to ensure all-around fairness, even if it must sacrifice expediency. Thankfully, the legal industry has taken major strides towards adapting to the digital age with the evolution of eDiscovery and document imaging.

  • Unstructured Invoice Recognition and Processing with LEADTOOLS

    There is no doubt that imaging technologies such as forms recognition and OCR have revolutionized the way we do business and have freed up countless man-hours from the tedious and error-prone task of data entry. However, these technologies still rely on the assumption that the data will be located in the same place on every document. Unstructured forms and scanned documents such as invoices, bills, and tabs are notoriously difficult, if not impossible, to process on a large scale within an automated system. This is no longer the case after the release of LEADTOOLS Version 19 and its advanced Invoice Recognition and Processing SDK technology.

  • Reading and Processing Checks with LEADTOOLS

    As much as credit cards and digital wallets have tried to take over, checks are still a very prevalent and practical way to make payments and financial transactions. However, just because consumers and businesses are still relying on traditional, paper-based financial transactions doesn't mean that technology can't streamline deposits and improve customer satisfaction. How are financial institutions implementing these features? And what benefits are there for other businesses not in the financial sector? LEADTOOLS includes a robust Check Scanning and Processing SDK that combines MICR and OCR to process checks and is already being used by a number of applications across banking, insurance and retail industries.

Demo Applications that Include OCR

WinForms OCR

Demonstrates the LEADTOOLS OCR LEAD engine in a WinForms application.

  • Convert images to document files
  • Recognize text in more than 40 character sets
  • Multiple spell-checking dictionary types supported
  • Automatically detect, segment, and recognize multiple languages on the same document
  • Full-page analysis and zonal recognition
  • Automatic document cleanup and preprocessing

HTML5/JavaScript OCR

Demonstrates the use of our OCR technology in HTML5/JavaScript by loading an image from a URL, or uploading an image from your device, and returning the OCR results of that image.

WinForms OCR Screen Capture

Demonstrates LEADTOOLS OCR technology to extract text from a user-defined section of the screen by combining LEADTOOLS screen capture, OCR, Document Writers, and Image Viewer Control into one WinForms application.

  • Capture selected area, window, and full screen
  • Convert to RTF using LEADTOOLS OCR LEAD engine and Document Writers
  • Display the image using the LEADTOOLS Image Viewer control
  • Copy RTF and image data to the clipboard
  • Draw on the image using pen and brush

WinForms OCR Modules

Demonstrates LEADTOOLS OCR technology available in LEADTOOLS Recognition, Document Imaging Suite, and OCR Module add-ons. It enables you to quickly evaluate all of the LEADTOOLS OCR engines including machine text, hand-written text, MICR, and OMR. It also demonstrates our powerful auto-zoning features while allowing you to manually create zones as well. Supported output formats include PDF, Text, DocX, HTML, and XPS.

Android OCR App Android

The LEADTOOLS OCR application can perform Optical Character Recognition (OCR) on images, extract text from images, and convert images to various document formats with superior accuracy and speed.

iOS OCR App iOS

The LEADTOOLS OCR application can perform Optical Character Recognition (OCR) on images, extract text from images, and convert images to various document formats with superior accuracy and speed.

macOS OCR App OS X

The LEADTOOLS OCR application can perform Optical Character Recognition (OCR) on images, extract text from images, and convert images to various document formats with superior accuracy and speed.

Screenshots of OCR

OCR Before and After

OCR Before and After

OCR Zone Auto-Detection

OCR Zone Auto-Detection

Videos of OCR

OCR Features

High-level view of some features of the LEADTOOLS OCR SDK, including those that set it apart from other OCR SDKs. Thousands of developers rely on the LEADTOOLS OCR SDK to create enterprise-level OCR, forms processing, MICR, and OMR solutions.

LEADTOOLS SDK Products that Include OCR

LEADTOOLS Recognition v20

The LEADTOOLS Recognition Imaging SDK is a handpicked collection of LEADTOOLS SDK features designed to build end-to-end document imaging applications within enterprise-level document automation solutions that require OCR, MICR, OMR, barcode, forms recognition and processing, PDF, print capture, archival, annotation, and image viewing functionality. This powerful set of tools utilizes LEAD's award-winning image processing technology to intelligently identify document features that can be used to recognize and extract data from any type of scanned or faxed form image.

LEADTOOLS Document Imaging Suite v20

The LEADTOOLS Document Imaging Suite SDK is a comprehensive collection of LEADTOOLS SDK features designed to build end-to-end document imaging solutions that require OCR, MICR, OMR, ICR, barcode, forms recognition and processing, PDF, HTML5 Zero-footprint viewing, conversion, print, capture, archival, annotation, and image viewing functionality. This powerful set of tools utilizes LEAD's award-winning image processing technology to intelligently identify document features that can be used to recognize and extract data from any type of scanned or faxed form image.

LEADTOOLS OCR Module - LEAD Engine v20

LEADTOOLS OCR Module - LEAD Engine is an add-on product for integrating optical character recognition (OCR), magnetic ink character recognition (MICR), and optical mark recognition (OMR) technology into applications and provides everything programmers need to develop robust, high-performance, and scalable image recognition solutions. LEADTOOLS OCR Module - LEAD Engine seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product lines.

LEADTOOLS OCR Module - OmniPage Engine v20

LEADTOOLS OCR Module - OmniPage Engine is an add-on product for integrating optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high-performance, and scalable image recognition solutions. LEADTOOLS OCR Module - OmniPage Engine seamlessly combines with LEADTOOLS SDKs in the Document and Medical product lines.

LEADTOOLS OCR Module - OmniPage Engine with Additional Languages v20
(Only product to include Arabic language support)

LEADTOOLS OCR Module - OmniPage Engine with Additional Languages is an add-on product for integrating Arabic, Chinese, Japanese, and Korean optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high performance, scalable image recognition solutions. LEADTOOLS OCR Module - OmniPage Engine with Additional Languages seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product families.

LEADTOOLS ICR Module - OmniPage Engine v20

LEADTOOLS ICR Module - OmniPage Engine is an add-on product for integrating intelligent character recognition (ICR) and optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high-performance, scalable image-recognition solutions. LEADTOOLS ICR Module - OmniPage Engine includes the OmniPage Engine and seamlessly combines with LEADTOOLS SDKs in the Document and Medical product lines.