Welcome Guest! To enable all features, please Login or Register.



Last Go to last post Unread Go to first unread post
#1 Posted : Sunday, January 1, 2012 6:18:46 AM(UTC)

Groups: Registered
Posts: 7

Apologies if this seems to be an easy one. I'd like to know what is the best way to determine if two PDF files are the same in terms of content (images and text).
I've tried PDFDocument and DocumentReader, but I'm not sure if there are others out there that are just as effective.

Any help will be greatly appreciated. Thanks!


Try the latest version of LEADTOOLS for free for 60 days by downloading the evaluation: https://www.leadtools.com/downloads

Wanna join the discussion? Login to your LEADTOOLS Support accountor Register a new forum account.

#2 Posted : Monday, January 2, 2012 12:11:16 AM(UTC)
Maen Hasan

Groups: Registered, Tech Support
Posts: 1,326

Was thanked: 1 time(s) in 1 post(s)

Regarding comparing images, I would recommend looking at the CorrelationCommand class. For more information, please see the following page:

Note that to use the CorrelationCommand class, you need to load the PDF files as raster images using the RasterCodecs.Load() method, and then pass the images to the CorrelationCommand class.

Also, the other option is using the RasterImage.GetRow or GetRowColumn methods, and comparing sections of pixels within the image to see if they're the same.

Maen Badwan
LEADTOOLS Technical Support
#3 Posted : Wednesday, January 4, 2012 12:35:52 AM(UTC)

Groups: Registered
Posts: 7

Thanks for the reply Maen! That sounds like a great idea! I'll try it out and then I'll let you know how it does.
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

Powered by YAF.NET | YAF.NET © 2003-2024, Yet Another Forum.NET
This page was generated in 0.050 seconds.