LEADTOOLS Document SDK products include comprehensive document and imaging libraries to load, save, convert, and view PDF files.

LEADTOOLS libraries provide functions such as the extraction of text, images, hyperlinks, and metadata, editing of bookmarks and annotations, page replacement, split and merge existing files, convert to PDF/A, linearization, and compression. Combined with rasterization and image display components, developers can take advantage of these tools to enhance their applications with dynamic document viewing, editing, and assembly features. Furthermore, programmers using .NET (C# & VB, Core, Xamarin, UWP), C/C++, iOS, macOS, Linux, Java, and web can leverage state-of-the-art OCR, OMR, ICR, Forms Recognition, Virtual Printing, and scanning technologies within LEADTOOLS to create any type of document and medical imaging application that utilizes the PDF format.

Tested against thousands of files, LEADTOOLS provides impeccable rendering accuracy that tops many market-leading third-party applications. LEADTOOLS accounts for common errors and differences between PDF versions to give programmers peace of mind, minimize testing phase, and create the best applications on the market.

Overview of LEADTOOLS PDF SDK Technology

 PDF Document Features

  • Load and view any document
  • Extract text (characters, words, and lines), fonts, annotations, rectangles, and hyperlinks with location and size
  • Extract images from documents and save to any of the 150+ file formats supported by LEADTOOLS
  • Full support to read, edit, and write PDF annotations
  • Parse the document structure by reading and updating bookmarks (table of contents) and internal links (jumps)
  • Unicode support, including Chinese, Japanese, Arabic, and Hebrew character-sets
  • Generate a raster image and thumbnail of any page

 PDF File Features

  • The PDF Optimizer component uses AI to analyze features of the document to create the smallest file possible
  • Comprehensive multi-page support includes
    • Merge existing files into a single file
    • Split a single file into multiple files
    • Extract, delete, insert, and replace any page in existing files
  • Convert existing PDF to PDF/A
  • Convert between PDF versions
  • Convert (distill) postscript to PDF with optimization for eBook, screen, and pre-press
  • Convert documents to vector SVG
  • Linearize (optimize for web viewing)
  • Create auto-print files
  • Read, write, and update the Table of Contents
  • Read, write, and update all metadata such as author, title, subject, keywords, and initial view
  • Read and write PDF Digital Signatures
  • Encrypt and decrypt documents

 PDF Annotations and Markups

LEADTOOLS supports reading, displaying, editing, and writing annotations and markups that work seamlessly with Adobe Acrobat and other compliant readers. With annotations and markups, users can collaborate by writing comments and drawing shapes on the document without making permanent changes. Additionally, sensitive information can be redacted to help your application comply with privacy and protection standards such as GDPR and HIPPA.

  • Supports the following annotation and markup objects and properties
    • Arrow
    • Comments and replies
    • Highlight
    • Intent
    • Leader
    • Line
    • Line Endings
    • Redaction
    • Review
    • Shapes
    • Text
    • Text Callout
    • Note reply
    • Transformation
  • Display annotations with border effects and cloud stroke
  • Options to control annotation rendering when loading as raster with support for No Appearance Stream annotations
  • Fully functional sample application with source code that implements reading, writing, editing, and annotation features

 OCR Output

With LEADTOOLS, developers can easily convert any image into a searchable PDF. These files are generally smaller in size than the comparable raster image and the embedded text can be searched, indexed, and edited.

  • Convert images to searchable PDF files with as little as three lines of code using LEADTOOLS OCR SDK technology
  • Export as text-only for minimal file size or export as image-over-text to retain original formatting
  • Multiple PDF versions and flavors including 1.2 - 1.7 and PDF/A
  • Multiple compression options for images within the file, including:
    • JPEG
    • JPEG 2000
    • CCITT G3/G4
    • JBIG2
    • LZW
    • MRC
  • Convert entire file or only specified pages
  • Create and update metadata such as author, title, and keywords
  • Protect sensitive data with encryption using RC4 40-bit and RC4 128-bit encryption
  • Control access to the contents with a password
  • Enable file permissions with a password
  • Embed fonts
  • Create linearized files for faster web viewing
  • Convert images from disk, memory, Internet, SharePoint, and cloud

 Image-based PDF Features

In addition to handling text-based files, LEADTOOLS fully supports loading, saving, and editing image-based files. This includes rasterizing both text and image-based PDF files, as well as converting single and multi-page image formats such as JPEG and TIFF into image-based PDF files.

  • Convert any file between more than 150 supported image formats
  • Multiple versions and flavors, including 1.2 - 1.7 and PDF/A
  • Multiple compression options, including:
    • JPEG
    • JPEG 2000
    • CCITT G3/G4
    • JBIG2
    • LZW
    • MRC
  • Specify RGB or CMYK color space
  • Convert entire file or only specified pages
  • Encrypt and decrypt documents using RC4 40-bit and RC4 128-bit encryption
  • Control access with User and Owner passwords
  • Load from disk, memory, Internet, and SharePoint

Rasterization Options

At the heart of PDF-to-image conversion is the rasterization process. PDF documents are comprised of vector objects such as text, shapes, and images. These objects have a relative location based on the physical, printed dimensions. This means that these files can be rasterized to any pixel dimension which provides varying quality depending on the DPI selected. LEADTOOLS libraries provides maximum flexibility when rasterizing files so developers can control the quality, size, color, and more.

  • Automatically detects the best rasterization options by examining the contents of the PDF
  • Rasterize to any DPI to control overall quality and file size
  • Load at 1, 8, or 24 bits per pixel
  • Render fonts with 2 or 4-bit anti-aliasing, resulting in a more readable image
  • Display CIDFonts not embedded in the file
  • Detect the original DPI of embedded images
  • Rescale embedded vector graphics with 2 and 4-bit anti-aliasing to reduce jaggies

 Vector-based PDF Features

LEADTOOLS provides a specialized framework to handle vector-based documents. Only available in the Document and Medical Imaging product families, this framework provides the ultimate experience for developing applications that need vector-based document support.

 PDF Forms

LEADTOOLS libraries provides developers everything they need to use existing PDF Forms. Any field can be read, filled, and saved.

  • Read form field information such as location and type
  • Enter data into form fields
  • Extract data from filled forms and save as XML
  • Supports Acrobat Forms Data (FDF) and Adobe XML Forms Architecture (XFA)

 PDF Compression

Maintain quality while maximizing compression with LEADTOOLS advanced image segmentation and compression components. Files optimized with this component can be viewed in any PDF viewer. By storing complex mixed raster content (MRC), this component creates files with better compression and quality than a standard raster PDF file.

  • AI automatically segments the image with optimization options
  • Manually segment the image to take full control over file size and image quality optimization
  • Multiple compression options, including:
    • JPEG
    • JPEG 2000
    • CCITT G3/G4
    • JBIG2
    • LZW
    • MRC
  • Automatic background detection
  • Compress single and multi-page files

Explanation of PDF File Types

In general, PDF and PDF/A files can be categorized into two basic file types: raster image and searchable. Raster image files are comprised of a complete raster image in a PDF wrapper and support multiple compression types, including JPEG, JPEG 2000, CCITT G3/G4, JBIG2, LZW, and MRC. The greatest advantage of raster image-based files is that they appear identical to the original document. On the other hand, searchable files are often smaller in size and the text can be searched and edited.

When converting from raster images to searchable PDFs, the formatting of the original image is often modified. To alleviate this, LEAD has implemented a hybrid type known as "image over text". In image-over-text files, the text is formatted as usual, while the original raster image is overlaid on top of the text. This maintains the look and formatting of the original raster image while still allowing the text content to be searched, selected, and copied.

Technology Related to PDF

Start Coding with LEADTOOLS

Download the Full Evaluation

The Full Evaluation Download includes all LEADTOOLS Recognition, Document, Vector, Medical, and Imaging technologies for all development and target platforms. Get everything LEADTOOLS all in one convenient download.

Documentation Links for PDF

Code Tips That use PDF

Supported Development Platforms for PDF

White Papers Written About PDF

  • Implementing a Standardized PDF/A Document Storage System with LEADTOOLS

    Electronic document archival has evolved far beyond the simple days of scanning a paper document and saving it as an image or PDF. Nowadays, many documents don't even start in physical form and could be one of many open or proprietary formats. Adding to the disparity caused by varying file formats is how and where files are stored. Many enterprises have their documents spread around numerous "data islands" including local computers, networked file shares, and cloud services. This white paper will explore how to take full advantage of PDF/A as your universal document storage format by using the state-of-the-art technology within LEADTOOLS Document Imaging SDKs.

  • End-to-End eDiscovery with LEADTOOLS Document Imaging

    When it comes to change, the desire for efficiency is surely at or near the top of the list of reasons. Some processes and industries are harder to change, especially those that have been around for a long time. Court systems in many countries are one of the oldest and most well established processes to ensure all-around fairness, even if it must sacrifice expediency. Thankfully, the legal industry has taken major strides towards adapting to the digital age with the evolution of eDiscovery and document imaging.

Demo Applications that Include PDF

HTML5/JavaScript Document Viewer

Demonstrates the LEADTOOLS Document Viewer in an HTML5/JavaScript application. The Document Viewer can be used to view raster, text, and document formats, making it ideal for Enterprise Content Management (ECM), document retrieval, and document normalization solutions.

  • Load a document from local disk and url
  • Draw annotations on the document
  • Use thumbnail viewer for page selection
  • View any bookmarks that have been included in the document
  • Interactive zooming/panning
  • Print annotated documents

HTML5/JavaScript LEADVIEW

LEAD has packaged the LEADTOOLS Document Viewer web application and service into the LEADVIEW API component. It requires as little as three lines of code to plug the LEADVIEW API into any JavaScript application. The component is highly customizable and supports all the features of the existing low-level Document Viewer, including viewing and converting hundreds of file formats and more than 30 annotation and markup objects. Users can easily create themes for the UI or use the predefined dark or light themes. With a settings dialog or JSON file, the entire ReactJS UI can be customized by each end-user or administratively locked down at a server level.

Screenshots of PDF

PDF File Properties

PDF File Properties

PDF Displayed in Document Viewer

PDF Displayed in Document Viewer

Videos of PDF

PDF Features

This tutorial shows LEADTOOLS PDF features including TIFF to PDF conversion, PDF text extraction, PDF file properties and more.

LEADTOOLS SDK Products that Include PDF

LEADTOOLS PDF Pro v20
(Excludes SVG-based PDF Viewing and Conversion)

LEADTOOLS PDF Pro provides everything developers need to read, write, raster-view, and update PDF files. It also includes advanced capabilities such as the extraction of text, images, hyperlinks, and metadata, manipulation of pages in existing PDF documents, conversion to PDF/A as a real document, linearization, and the LEADTOOLS PDF Optimizer to reduce the size of PDF files. By building upon the award-winning LEADTOOLS Imaging Pro features which includes 150+ image formats, image compression, image processing, image viewers, imaging common dialogs, 200+ display effects, TWAIN and WIA scanning, screen capture, and printing, LEADTOOLS PDF Pro is one of the best values available.

LEADTOOLS Pro Suite v20
(Excludes SVG-based PDF Viewing and Conversion)

LEADTOOLS Pro Suite is a an extensive bundle in the LEADTOOLS Pro line of SDKs and includes HTML5 Zero-footprint image viewers, barcode detect/read/write, advanced PDF read/write/view/edit, 150+ image formats, image compression, image processing, image viewers, imaging common dialogs, 200+ display effects, TWAIN and WIA scanning, screen capture, and printing. Developers using LEADTOOLS Pro Suite can develop robust imaging applications and solutions at a fraction of the cost of similar feature sets found elsewhere on the market.

LEADTOOLS Document Imaging v20

Develop powerful document imaging applications with LEADTOOLS Document Imaging. Features include PDF viewing and editing, comprehensive image annotating, specialized bitonal image displaying, and image processing. Other features include performance and memory optimizations for bitonal images, document image cleanup, including inverted text, border, hole-punch, and line removal, and scanning with LEADTOOLS Fast TWAIN and WIA.

LEADTOOLS Recognition v20

The LEADTOOLS Recognition Imaging SDK is a handpicked collection of LEADTOOLS SDK features designed to build end-to-end document imaging applications within enterprise-level document automation solutions that require OCR, MICR, OMR, barcode, forms recognition and processing, PDF, print capture, archival, annotation, and image viewing functionality. This powerful set of tools utilizes LEAD's award-winning image processing technology to intelligently identify document features that can be used to recognize and extract data from any type of scanned or faxed form image.

LEADTOOLS Document Imaging Suite v20

The LEADTOOLS Document Imaging Suite SDK is a comprehensive collection of LEADTOOLS SDK features designed to build end-to-end document imaging solutions that require OCR, MICR, OMR, ICR, barcode, forms recognition and processing, PDF, HTML5 Zero-footprint viewing, conversion, print, capture, archival, annotation, and image viewing functionality. This powerful set of tools utilizes LEAD's award-winning image processing technology to intelligently identify document features that can be used to recognize and extract data from any type of scanned or faxed form image.

LEADTOOLS Medical Imaging v20

Develop powerful Medical Imaging applications with the LEADTOOLS Medical Imaging SDK. Features include comprehensive DICOM data set support, 8-16 bit extended grayscale image support, image annotation, specialized extended grayscale image display such as window level and LUT processing, and medical-specific image processing. Other features include lossless JPEG compression, and signed and unsigned image data processing.

LEADTOOLS PACS Imaging v20

Develop robust DICOM PACS applications with LEADTOOLS PACS Imaging. Features include Medical Web Viewer Framework, high and low-level PACS SCP and SCU functions and controls, secure PACS communication, comprehensive DICOM data set support, image annotation, extended grayscale image display such as window level and LUT processing, and specialized medical image processing. Other features include lossless JPEG compression, JPIP, MRTI, and signed and unsigned image data processing.

LEADTOOLS Medical Imaging Suite v20

Develop powerful PACS and Medical imaging applications with LEADTOOLS Medical Imaging Suite. Features include LEAD's Zero-footprint HTML5 DICOM Viewer, Medical Web Viewer Framework, Medical 3D, DICOM Multimedia codecs, high and low-level PACS SCP and SCU functions and controls, secure PACS communication, Print to PACS, comprehensive DICOM data set support, image annotation, extended grayscale image display such as window level and LUT processing, DICOM Hanging Protocol, and specialized medical image processing. Other features include lossless JPEG compression, JPIP, and signed and unsigned image data processing.