Welcome Guest! To enable all features, please Login or Register.

Notification

Icon
Error

Options
View
Last Go to last post Unread Go to first unread post
#1 Posted : Monday, October 17, 2005 12:38:32 AM(UTC)

inno  
inno

Groups: Registered
Posts: 4


Is there any implemeted way to recognize white space between words as
in original scanned document ? Helpfull woud be  some  some
point, how to do this ,too . My Idea was that when I know the size of
the Zone an the ant I thought thet i can find out the position of
character whithin the zone then I can calculate spaces between words . But it needs my own algorithm.

Thanks for help.

Rene.
 

Try the latest version of LEADTOOLS for free for 60 days by downloading the evaluation: https://www.leadtools.com/downloads

Wanna join the discussion? Login to your LEADTOOLS Support accountor Register a new forum account.

#2 Posted : Tuesday, October 18, 2005 11:16:19 PM(UTC)
Maen Hasan

Groups: Registered, Tech Support
Posts: 1,326

Was thanked: 1 time(s) in 1 post(s)

Hello,

You can recognize the white spaces
characters in the recognized document by searching for the white spaces in the recognized characters in a specific page.

To get the recognized characters in a specific page, you can use the
GetRecognizedCharacters method (ILEADRasterDocument).This method will
update the RecognizedCharacter property and its associated
ILTRecognizedCharacters properties. Also, you can get the recognized
characters coordinates by retrieving the left (Top, Width or Height) property ILTRecognizedCharacters) of the recognized character.
For more information, please refer to the "LEADTOOLS OCR COM Object Help File" and read the following topics:
- GetRecognizedCharacters method (ILEADRasterDocument)
- RecognizedCharacter property (ILEADRasterDocument)
- Left property (ILTRecognizedCharacters)
- Quick Reference OCR: Using Recognized Characters
Also, for a code snippet, please refer to the following sample in the "LEADTOOLS OCR COM Object Help File":
-GetRecognizedCharacters Example for Visual Basic
    

If you are using LEADTOOLS .Net v14 programming interface, you can find
a VB.NET OCR example that allows you to load an image and perform OCR
on it. You can then specify a word you wish to search for within the
image. If the word is found, a highlight annotation will be displayed over the word
The VB.Net sample is posted on the following forum:
http://support.leadtools...orums/2788/ShowPost.aspx

Thanks,
Maen Badwan
LEADTOOLS Technical Support
 
#3 Posted : Friday, October 21, 2005 1:36:50 AM(UTC)

inno  
inno

Groups: Registered
Posts: 4


You mean to calculate white space from range between characters , when I know their possitions and size?

Its only  possybility I think because GetRecognizedCharacters do
not return Whitespace Characters for me , just letters , so I have to calculate it myself. :(


 
#4 Posted : Friday, October 21, 2005 2:40:13 AM(UTC)

inno  
inno

Groups: Registered
Posts: 4


I have found olso space property in recognized characters but but it
gives me no acurate results , and it gives different result when I set
different fill method but no result was ok. Sometimes to high sometimes
ok, Sometimes I think that sapace property inidcates no
whitespaceCharacters but something like space before chcaracter but ,
another problem is tah I dont know size of that space I know just just number of spaces , which is not usefull.


Everything what I need is Output form OCR simmilar to input , so i need
to have recognized whitespace characters , i dont understand how it is
possyble that it is not yet implemented because technically it is
possyble to calculate . For me is logikal , when i set full format and
recognize to pdf for example I get output whit whitespace characters
like in input , but in leeds i get just one spaceCharacter between words.
:(
 
#5 Posted : Sunday, October 30, 2005 3:33:33 AM(UTC)
Maen Hasan

Groups: Registered, Tech Support
Posts: 1,326

Was thanked: 1 time(s) in 1 post(s)

Hello,

I think my colleague Travis
already answered you in Ticket#2005101410002891. And the following his reply in case you didn't receive it:

==================================
I made a project in MS Visual C++ 6.0.  Each of the space came out just as they should.
However I see that you are
using our COM object. Attached is a project that shows prints out the
white space value the OCR finds.  I'm not finding any issues with
it.  Try this with your image.  If you still see the issue, please send me the image.

==================================

Thanks,
Maen Badwan
LEADTOOLS Technical Support
File Attachment(s):
NET - V14 - GetRecognizedCharacters.zip (9kb) downloaded 56 time(s).
 
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

Powered by YAF.NET | YAF.NET © 2003-2024, Yet Another Forum.NET
This page was generated in 0.129 seconds.