We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy.
Unfortunately, activation email could not send to your email. Please try again.

Using pdf with text for ocr

Thread ID:

Created:

Updated:

Platform:

Replies:

127231 Nov 4,2016 03:43 AM Nov 7,2016 07:22 AM ASP.NET MVC 1
loading
Tags: PDF
Testname
Asked On November 4, 2016 03:43 AM

I use a pdf file for ocr and if the pdf is some kind of text(I can copy it) it does not work, but if the text is an image in pdf it works.
What to do ?

Sasi Kumar Sekar [Syncfusion]
Replied On November 7, 2016 07:22 AM

Hi, 
 
Thank you for your update, 
 
We have Ocr-ed the different types of input PDF files in our side, the mentioned text is not coping issue is not reproduce.  
Please find the code snippet and sample for Ocr the PDF file. 
Code snippet: 
string Tesserctbinaries = Server.MapPath("~/Tesseract binaries"); 
string Testdata = Server.MapPath("~/Tessdata"); 
//Initialize the OCR processor 
using (OCRProcessor processor = new OCRProcessor(Tesserctbinaries)) 
 { 
   //Load the PDF document  
   PdfLoadedDocument lDoc = new     PdfLoadedDocument(Server.MapPath("~/App_Data/Region.pdf")); 
   //Language to process the OCR 
   processor.Settings.Language = Languages.English; 
   processor.PerformOCR(lDoc, Testdata); 
   lDoc.Save(Server.MapPath("~/Output/sample.pdf")); 
 }          
 
 
Sample link: 
 
So kindly provide the input document it will helpful us to analyze and provide the solution earlier. 
 
Regards, 
Sasi kumar S. 


CONFIRMATION

This post will be permanently deleted. Are you sure you want to continue?

Sorry, An error occured while processing your request. Please try again later.

You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.

;