Technology / Document

LEADTOOLS Distributed Computing SDK

Cloud SDK

The LEADTOOLS Distributed Computing SDK provides developers with a complete framework which can be used to create powerful cloud-based applications. Utilizing their own existing network of servers and worker machines, developers can easily create any distributed, grid or parallel computing application such as:

  • Audio/video conversion and recompression
  • Optical Character Recognition (OCR)
  • Forms Recognition and Processing
  • Barcode Recognition
  • Web crawlers
  • Distributed graphics rendering

By developing an application with the LEADTOOLS Distributed Computing SDK, significant savings in time and money can be realized through the use of existing infrastructure to eliminate bottle-necks in processor intensive business-critical activities.

Overview of LEADTOOLS Distributed Computing SDK Technology

Framework Components of the LEADTOOLS Distributed Computing SDK

The LEADTOOLS Distributed Computing SDK framework can be broken down into three individual components described below.

Cloud SDK Diagram

Client

The clients exist outside of the cloud and can be any type of computer or mobile device. Communication between the clients and the central server is based on standard web services allowing you to write applications that run on any platform.

Central Server

Within the cloud, the central server acts as the primary interface between the clients and the worker machines. The primary responsibilities of the central server include:

  • Host the web service used to communicate with clients.
  • Manage worker machine settings such as job type, number of jobs, etc. Storing these settings in a central location makes it simple for administrators to make global changes within the cloud regardless of where worker machines are physically located.
  • Manage the cloud database which is used to store all of the jobs and and any information related to each job.

Worker Machines

The worker machines actually perform the work of the distributed application. There is no limit to the number of worker machines used within the cloud and can be hot-plugged or hot-swapped as needed without interrupting service for the clients. Each worker can take on as much or as little work as needed using customizable configuration settings such as:

  • Job Type
  • Maximum percent of CPU usage
  • Number of CPU Cores
  • Number of threads
  • Number of jobs
  • Time of day

Benefits of Distributed Applications in the Cloud

Speed

The benefits of parallel processing are well established, but even the most advanced and powerful computers will encounter bottlenecks. For example, a computer with eight cores performing OCR on a 100 page document can only process up to 8 pages at a time. Utilizing the cloud, it is possible to OCR and convert that same document in virtually the same time it takes to OCR and convert a single page given enough worker machines on the network.

Additionally, older hardware and less powerful devices such as mobile phones can utilize cloud applications to accomplish tasks with the same speed and efficiency as a high-end server.

Reduced Costs

The cloud provides huge potential for significant cost savings. Server hardware is significantly more expensive than desktop PCs.

  • Avoid expensive server hardware - Several inexpensive desktops can be purchased and linked together to provide the same or better speed than a pricey server.
  • More efficient use of existing hardware - Computers utilizing minimal CPU capacity can be used as worker machines without any noticeable performance degradation to the user's regular tasks.
  • Lower hardware specifications - Clients connecting to the cloud require less processing power which means each user's machine is cheaper and has a longer lifetime.
  • Inexpensive upgrades - Worker machines can be added and removed

Additionally, the LEADTOOLS Distributed Computing SDK makes it easy for developers to create the cloud for themselves, eliminating the need for third party cloud computing services.

Dependability

Distributed applications running across multiple machines are more dependable and easier to maintain than their counterparts running on a single machine. A well designed cloud application will suffer no downtime during planned maintenance, hardware failures, virus infections and even power outages or natural disasters. As long as there are still machines within the cloud, there is no disruption of service to the client. The central server and worker machines implement fail-safe measures that retry, restart and redistribute jobs.

Platform Independence

Since the primary work is done within the cloud, virtually any client is able to access the application. While the actual cloud must be implemented on machines running Windows, the clients can access the cloud using Macs, iPads, mobile devices or anything with a network or Internet connection.

Why Use LEADTOOLS Distributed Computing SDK?

LEADTOOLS Distributed Computing SDK can be used in any scenario where you want to perform some type of automated job processing on a farm of computers. Although this leaves a wide range of opportunities, consider the following scenarios and how the LEADTOOLS Distributed Computing SDK can be used to successfully implement a powerful and dynamic solution:

  1. My video is too large and takes too long to convert.
    Decoding and encoding multimedia files can be both a long and processor intensive task. By using a cloud based service for large multimedia files, the client can utilize the farm of worker machines to split large files, convert each piece separately, and re-multiplex them together in a fraction of the time required to convert the same file on a single machine.
  2. There are too many documents to process and OCR.
    A cloud based application can divide the workload between a farm of worker machines. Once the document conversion or text extraction is complete, the data can be sent back to the client, archived in a database or whatever the application architecture requires. This process can be extremely flexible and dedicate some workers to document cleanup while others perform the OCR. If the files themselves are large, the document can be broken up into the individual pages and pieced back together.
  3. I don't want to bog down my computer with mindless tasks.
    The LEADTOOLS Distributed Computing SDK can be used for any distributed computing application and is ideal for tasks that require little or no user interaction and spawn other tasks. For example, one can use a farm of worker machines to compute complex mathematical and statistical data or crawling and indexing websites.

LEADTOOLS SDK Products that Include Distributed Computing Technology

Hover over each product for a description. Click for more details.

LEADTOOLS Recognition Imaging SDK

The LEADTOOLS Recognition Imaging SDK is a handpicked collection of LEADTOOLS SDK features designed to build end-to-end document imaging applications as part of an enterprise level document automation solution that requires scanning, OCR, OMR, forms recognition and processing, archival, annotation and display functionality. This powerful set of tools utilizes LEAD's industry LEADing image processing technology to intelligently identify document features that can be used to recognize any type of scanned or faxed form image.

LEADTOOLS Document Imaging Suite SDK

The LEADTOOLS Document Imaging Suite SDK is a comprehensive collection of LEADTOOLS SDK features designed to build end-to-end document imaging applications within enterprise level document automation solutions that requires capture, OCR, OMR, forms recognition and processing, PDF, print capture, archival, annotation and display functionality. This powerful set of tools utilizes LEAD's industry LEADing image processing technology to intelligently identify document features that can be used to recognize any type of scanned or faxed form image.

LEADTOOLS Multimedia Suite SDK

The Multimedia Suite SDK is a comprehensive collection of multimedia development technology from LEAD Technologies. By bundling features from the Multimedia SDK, DVD Module, Video Streaming Module, MPEG-2 Transport Module, Imaging Pro SDK and Distributed Computing SDK, the Multimedia Suite provides everything a developer needs to create enterprise level multimedia applications for a wide variety of industries including defense, broadcast, security and more.

LEADTOOLS OCR Module - Advantage

The LEADTOOLS Advantage OCR Module adds methods for incorporating optical character recognition (OCR), intelligent character recognition (ICR) and magnetic ink character recognition (MICR) technology into applications and includes everything needed to develop robust, high performance and scalable image recognition solutions. The LEADTOOLS Advantage OCR Module seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product lines.

LEADTOOLS OCR Module - Professional

The LEADTOOLS Professional OCR Module adds methods for incorporating optical character Recognition (OCR) technology into applications and includes everything needed to develop robust, high performance and scalable image Recognition solutions. The LEADTOOLS Professional OCR Module seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product lines.

LEAD Technologies Logo
LEADTOOLS Logo