We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy. Image for the cookie policy date
Unfortunately, activation email could not send to your email. Please try again.
Syncfusion Feedback


Trusted by the world’s leading companies

Syncfusion Trusted Companies

Overview

The Syncfusion .NET Optical Character Recognition (OCR) Library is used to extract text from scanned PDFs and images. With a few lines of C# code, a scanned PDF document containing a raster image is converted into a searchable and selectable PDF document. You can save the OCR result as text, structured data, or searchable PDF documents. The .NET OCR Library uses a powerful Tesseract OCR engine.


Why Syncfusion’s OCR

Cross-platform support

Designed for C# and VB.NET running on .NET 6, 5, Core, Standard, or Framework.

Cloud platform

Works in cloud platforms such as Azure (Webapps, Websites, Webservices, and Functions) and AWS (EC2, Lambda).

Customize OCR engine

By default, the OCR library uses the Tesseract OCR engine. Other external OCR services from Microsoft Azure, AWS, Google, and more can also be used.

International languages

The OCR engine supports 120+ languages. It is possible to use more than one language at a time to read documents that contain words in more than one language.


Create searchable PDF

Perform OCR on the entire scanned PDF document and convert it into a searchable PDF document.

Image to searchable PDF/A

Make images searchable and selectable by converting them to PDF or PDF/A document using OCR.

Extract text from an image

Extract the text from a single scanned image or multi-page tiff images.

Zonal text extraction

Extract data from PDFs and images by restricting OCR to a particular region in the PDF or image.

OCR on a rotated page

Extract the text from the scanned rotated page of a PDF document and convert it to a searchable PDF document.

Improved accessibility

Automatically convert images into an accessible PDF (PDF/UA) document by applying necessary tags to the hidden text, so that text in the PDF document is machine readable.

Post-processing

After OCR, you can programmatically highlight, underline, and strike through the text of a PDF document. You can also redact, edit, and digitally sign the PDF document.


Convert scanned PDF to a searchable PDF in C#

Convert the scanned PDF document to a searchable PDF document using the Syncfusion OCR Library with just a few lines of C# code as demonstrated below.

//Initialize the OCR processor by providing the path of Tesseract binaries
using (OCRProcessor processor = new OCRProcessor(@"TesseractBinaries/"))
{
    //Load the existing PDF document
    PdfLoadedDocument loadedDocument = new PdfLoadedDocument(inputStream);
    processor.Settings.Language = Languages.English; //Set the OCR language
    //Process OCR by providing the PDF document and Tesseract language data
    processor.PerformOCR(loadedDocument, @"TessData/");
    MemoryStream stream = new MemoryStream(); //Save the OCRed document to memory stream
    loadedDocument.Save(stream);
    loadedDocument.close(true); //Close the PDF document
}


PRICING

File Formats
Developer Platform
Unlimited Global License

Developer Platform

Essential Studio

$ 2,495

Per developer 1st year

Includes 1,700+ Controls
Best Value

Unlimited Global License

Cover everyone in your organization for one low annual fee

CONTACT US

File Formats

$ 995

Per developer 1st year

DEVELOPER PLATFORM
  • 80+ Components each for JavaScript, Angular, jQuery, React, and Vue.
  • 75+ ASP.NET Web Forms Controls
  • 80+ Blazor Components
  • 80+ ASP.NET MVC Controls
  • 80+ ASP.NET Core Controls
  • File Formats (Word, Excel, PowerPoint and PDF) Tool tip information icon
  • Report Viewer SDK
  • 30+ WinUI Controls
  • 25+ .NET MAUI Controls
  • 155+ Xamarin Controls *Includes iOS, Android and UWP
  • 100+ WinForms Controls
  • 100+ WPF Controls
  • 70+ UWP Controls
  • 60+ PHP Controls
  • 60+ JSP Controls
  • Flutter Widgets
  • Predictive Analytics
SUPPORT
  • Dedicated Account Representative
  • Quarterly product releases
  • Support within 24 hours on all business days
  • Live chat support on all business days
LICENSING
  • Licensing Management Portal
  • Commercial Use
  • Max Developer count
  • 1 per license
  • Coverage Period
  • 1 year
File Formats
Developer Platform
Unlimited Global License

RESOURCES

CASE STUDY

Syncfusion’s file format components helped me create the reports I needed, fast. – J. Pereira, Software Developer.

The libraries have been built from scratch and refined for more than a decade to provide blazing-fast performance, comprehensive API, and compatibility across the latest and older versions of these files.

CONTINUE READING View all Customer Stories

VIDEOS

Syncfusion File Format Libraries - Manipulate Excel, Word, PowerPoint, and PDF files

Read and write Excel, Word, PDF, and PowerPoint files. Also includes integrated visualization capabilities. Advanced features include support for pivot tables, pivot charts, mail-merge, and extensive formatting.

E-BOOK

Succinctly Series: Statistics Using Excel Succinctly

Succinctly Series: Statistics Using Excel Succinctly


- by Charles Zaiontz
CONTINUE READING View all E-books

Awards

Greatness—it’s one thing to say you have it, but it means more when others recognize it. Syncfusion is proud to hold the following industry awards.

Scroll up icon

Warning Icon You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.Close Icon

Live Chat Icon For mobile
Live Chat Icon