LEADTOOLS OCR (Leadtools.Forms.Ocr assembly)
LEAD Technologies, Inc

OcrCharacter Structure






Members 
Represents recognized character data. .NET support
Object Model
OcrCharacter Structure
Syntax
[SerializableAttribute()]
public struct OcrCharacter : System.ValueType 
'Declaration
 
<SerializableAttribute()>
Public Structure OcrCharacter 
   Inherits System.ValueType
'Usage
 
Dim instance As OcrCharacter
[SerializableAttribute()]
public class OcrCharacter
JAVASCRIPT_NOSTRUCTS
[SerializableAttribute()]
public value class OcrCharacter : public System.ValueType 
Remarks

To get the recognized characters of a page, call IOcrPage.GetRecognizedCharacters after IOcrPage.Recognize or IOcrPage.RecognizeText.

To update the recognized characters of a page, call IOcrPage.GetRecognizedCharacters before calling IOcrDocument.Save, or IOcrDocument.SaveAsync, IOcrDocument.SaveXml or or IOcrDocument.SaveXmlAsync.

IOcrPageCharacters implements the standard IList, ICollection and IEnumerable interfaces with items of type IOcrZoneCharacters. Each item in the IOcrPageCharacters contains a collection of the character collections of the zones.

The IOcrZoneCharacters interface contains a collection of the characters for a particular zone.

IOcrZoneCharacters also implements IList, ICollection and IEnumerable interfaces but with items of type OcrCharacter. Each item in the IOcrZoneCharacters contains a collection of the characters of the zone.

The IOcrPageCharacters interface also contains the IOcrPageCharacters.UpdateWord method that allow to modify the OCR recognition results by updating or deleting the words before optionally saving the results to the final output document.

For example, if you are interesed in iterating through the characters of the 2nd zone in the page, you can do the following:

            // Get the page characters
            IOcrPageCharacters pageCharacters = ocrPage.GetRecognizedCharacters();
            // Get the 2nd zone characters. Note, index is zero-based so 2nd zone is index 1
            // You can also iterate through the pageCharacters collection and fine the IOcrZoneCharacters item with ZoneIndex = 1
            IOcrZoneCharacters zoneCharacters = pageCharacters.FindZoneCharacters(1);
            // Loop through the characters
            foreach(OcrCharacter ocrCharacter in zoneCharacters)
            {
               // Do something with ocrCharacter
            }
            

OcrCharacter is the most detailed information available about the recognized characters.

Touching characters, those whose shapes are physically joined in the page passed to the OCR engine, will result in a separate OcrCharacter structure for each recognized character within the block. However, the coordinate property of these characters (Bounds) will have identical coordinates defining a rectangle boundary for the character block. The order of the OcrCharacter structures representing a character block gives the order of the touching characters on the original document. This means the coordinates do not give information on the order of characters inside a boundary block.

To create an OcrCharacter object with all values initialized to default values, use OcrTypeManager.CreateDefaultOcrCharacter. This is a shortcut for languages that do not automatically initialize all the members of a structure, such as JavaScript.

Example
For an example, refer to IOcrPage.GetRecognizedCharacters.
Requirements

Target Platforms: Windows 7, Windows Vista SP1 or later, Windows XP SP3, Windows Server 2008 (Server Core not supported), Windows Server 2008 R2 (Server Core supported with SP1 or later), Windows Server 2003 SP2

See Also

Reference

OcrCharacter Members
Leadtools.Forms.Ocr Namespace
IOcrPage.SetRecognizedCharacters
IOcrPage.GetRecognizedCharacters
IOcrPage.Recognize
IOcrPage.IsRecognized
OcrCharacter Structure
IOcrPageCharacters Interface
IOcrZoneCharacters Interface
IOcrPageCollection Interface
IOcrZoneCollection Interface
OcrZone Structure
Programming with the LEADTOOLS .NET OCR
OCR Confidence Reporting

 

 


Products | Support | Contact Us | Copyright Notices

© 2006-2012 All Rights Reserved. LEAD Technologies, Inc.

OcrCharacter requires an OCR module license and unlock key. For more information, refer to: Imaging Pro/Document/Medical Features