We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy. (Last updated on: November 16, 2018).
Unfortunately, activation email could not send to your email. Please try again.
Syncfusion Feedback

Using pdf with text for ocr

Thread ID:





127231 Nov 4,2016 07:43 AM UTC Nov 7,2016 12:22 PM UTC ASP.NET MVC 1
Tags: PDF
Asked On November 4, 2016 07:43 AM UTC

I use a pdf file for ocr and if the pdf is some kind of text(I can copy it) it does not work, but if the text is an image in pdf it works.
What to do ?

Sasi Kumar Sekar [Syncfusion]
Replied On November 7, 2016 12:22 PM UTC

Thank you for your update, 
We have Ocr-ed the different types of input PDF files in our side, the mentioned text is not coping issue is not reproduce.  
Please find the code snippet and sample for Ocr the PDF file. 
Code snippet: 
string Tesserctbinaries = Server.MapPath("~/Tesseract binaries"); 
string Testdata = Server.MapPath("~/Tessdata"); 
//Initialize the OCR processor 
using (OCRProcessor processor = new OCRProcessor(Tesserctbinaries)) 
   //Load the PDF document  
   PdfLoadedDocument lDoc = new     PdfLoadedDocument(Server.MapPath("~/App_Data/Region.pdf")); 
   //Language to process the OCR 
   processor.Settings.Language = Languages.English; 
   processor.PerformOCR(lDoc, Testdata); 
Sample link: 
So kindly provide the input document it will helpful us to analyze and provide the solution earlier. 
Sasi kumar S. 


This post will be permanently deleted. Are you sure you want to continue?

Sorry, An error occured while processing your request. Please try again later.

Please sign in to access our forum

This page will automatically be redirected to the sign-in page in 10 seconds.

Warning Icon You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.Close Icon