OcrProcessor only reads PDF page title image when there's plenty of text?

Question

Using the code below I read the text from a one page document. The text result is: "QQD COMFORT C700 LETTINGS " - No other text is returned? Note: Please see attached PDF (zipped) for source document.Nuget package versions are at the bottom of this message. private static string OCRprocessPDF(string pdfIn) { //Initialize the OCR processor. using (OCRProcessor processor = new OCRProcessor()) { //Load an existing PDF document. SKBitmap sKBitmap = new SKBitmap(); FileStream stream = new FileStream(pdfIn, FileMode.Open, FileAccess.Read); PdfLoadedDocument pdfLoadedDocument = new PdfLoadedDocument(stream); //Set OCR language to process. processor.Settings.Language = Languages.English; //Process OCR by providing the PDF document. var textFound = processor.PerformOCR(pdfLoadedDocument); //Close the document. pdfLoadedDocument.Close(true); return textFound; } }// Visual Studio packages net7.0 enable enable Attachment: Dallas_2a210f6.zip

Karmegam Seerangan · Accepted Answer

Hi Russell,We are validated the reported issue on our end. OCR
processor recognize the text from images only. The input documents contain only
one image, so that image result only return to you. However, we have attached
the sample to get your excepted output by converting the pdf page to image and
then send the image to Perform OCR.Sample Link : https://www.syncfusion.com/downloads/support/directtrac/general/ze/PerformOCR_Image786831718.zipIncase
If your input image quality is very low, we recommend you to try the OCR
processor with tessdata_best to get better results. You can get the
tessdata_best from below link,tessdata_best: https://github.com/tesseract-ocr/tessdata_bestRegards,Karmegam S