LEADTOOLS OCR SDK
Programming tools for adding OCR
technology into software applications quickly and easily.
LEADTOOLS OCR Modules
Optical Character Recognition for Documents
The award winning LEADTOOLS OCR
Modules provide methods for incorporating optical character recognition
(OCR) technology into an application and include everything needed to
develop robust, high performance and scalable document imaging solutions
that include optical character recognition technology.
An important feature of the OCR design is the support of multiple
OCR engines. The ability to choose the right engine for a specific solution
gives unprecedented flexibility to developers. To reduce complexity
and overall development time, the design hides the underlying engine
details through the use of a common .NET class library. Changing underlying
OCR engines based on the requirements of the project requires virtually
no change to the application code.
Key Features of LEADTOOLS OCR
- Convert an image to a document with as little as three lines of
code.
- OCR images from SharePoint.
- Process an entire page or only specific areas in a page.
- Recognize and export text, choosing from
any of 40 formats, including Adobe PDF and PDF/A,
MS Word, MS Excel as well as various flavors of ANSI and UNICODE text.
- Perform OCR processes in a single or multi-threaded environment
with optimization for server-based operations.
- Create, process and recognize multiple documents simultaneously
in the same application.
- Supports 32 and 64 bit development.
- Multiple OCR engines are supported and abstracted from the user
through the use of a common .NET class library. Switching between
the various engines requires virtually no changes in the application
code.
- Select the character
set and language dictionary of documents to be recognized. Choose
from English, Danish, Dutch, Finnish, French, German, Italian, Norwegian,
Portuguese, Russian, Spanish, or Swedish.
- Segment complex pages manually or automatically into text zones,
image zones, table zones, lines, headers and footers.
- Set accuracy thresholds prior to recognition to control the accuracy
of recognition.
- Learn, save, and load character recognition data for similar documents.
The software learns as a result of normal recognition and acquires
additional information by using the OCR’s text verification
system.
- Recognize text from 5 to 72 points in virtually any typeface.
- Increase recognition accuracy with built-in and user dictionaries.
- Automatically detect fax, dot matrix, and other degraded documents
and compensate accordingly.
- Process both text and graphics. The recognition software's ability
to distinguish halftone graphics from text can provide the basis of
a compound document processing system.
- High-level design provides ease of use, while available low-level
functions are still available to provide complete control.
OCR Modules
LEAD's OCR
SDK Modules include C DLL and .NET support.
Each of the following LEADTOOLS OCR Modules include .NET interfaces
that greatly simplify coding and speeds development of OCR applications.
Additionally, the same code can be used with any of LEADTOOLS OCR Modules.
LEADTOOLS OCR Module - Plus
Includes automatic and manual zone detection, formatted output, auto-orientation,
custom spelling dictionaries and MICR support. PDF
and PDF/A output, ICR and OMR support is available.
LEADTOOLS OCR Module - Professional
Includes the fastest and most accurate automatic and manual zone detection,
unicode support, formatted output, auto-orientation, custom spelling dictionaries
and MICR support. PDF and PDF/A output, ICR and
OMR support is available.
Why LEADTOOLS OCR
SDK is the best choice.
- Industry's only multi-thread safe OCR SDK.
- Supports 32 and 64 bit operating systems.
- Ease of use means converting an image to a PDF can be done in three
lines of code.
- Supports full page or zonal OCR. Zones can either be automatically
or manually generated.
- Use LEADTOOLS image processing functions like document
cleanup functions to improve recognition results.
- Recognize text in low quality text images such as those created
with dot matrix printers.
- Recognize a variety of documents, including facsimiles, photocopies
and documents with complex layouts.
- Integrates with the LEADTOOLS
Forms Recognition Module to improve form recognition and adds
forms processing functionality.
Specialized Recognition Modules may be added on to LEADTOOLS OCR
Modules.
- Recognize marks such as checkmarks or bubble sheets.
- Recognize hand-printed numerals (0-9) and four additional signs
(+ - . ,).
- Recognize hand printed text.
LEADTOOLS PDF
Output OCR Module
- Extend the LEADTOOLS OCR
Modules to add text searchable PDF and PDF/A output support.
The OCR SDK
and related products are available below.
* Deployment requires runtime license. Marked toolkits require runtime licensing based on the deployment of the application you develop. Several purchase options are available. For more information, please contact oemsales@leadtools.com or call a LEAD sales representative. Click here for more information on LEADTOOLS Runtime Licensing requirements.
* LEADTOOLS PDF OCR
Plug-in is required to output PDF