We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy. Image for the cookie policy date
Unfortunately, activation email could not send to your email. Please try again.
Syncfusion Feedback

How to extract text from the predefined rectangle from a PDF document?

Platform: WPF |
Control: PdfViewer
Tags: pdf, extracttext

In WinForms PDF Viewer you can extract text from the given rectangular region on a page by exporting the page to an image, cloning the exported image with the given rectangle, and performing OCR on the cloned image. Refer to the following code snippet.

 

C#

PdfLoadedDocument loadedDocument = new PdfLoadedDocument("../../Data/Barcode.pdf");
            using (OCRProcessor processor = new OCRProcessor("../../Tesseract binaries"))
            {
                //Language to process the OCR
                processor.Settings.Language = Languages.English;
                Bitmap image = loadedDocument.ExportAsImage(0);
                //Define the bounds from which the text to be extracted
                Rectangle textBounds = new Rectangle(274, 53, 68, 15);
                //Point to Pixel conversion
                RectangleF rect = new RectangleF(textBounds.X * 1.3333f, textBounds.Y * 1.3333f, textBounds.Width * 1.3333f, textBounds.Height * 1.3333f);
                using (Bitmap clonedImage = image.Clone(rect, System.Drawing.Imaging.PixelFormat.Format32bppArgb))
                {
                    string ocrText = processor.PerformOCR(clonedImage, @"../../Tessdata/");                 
                }
                image.Dispose();
            }
 

 

Sample link:

https://www.syncfusion.com/downloads/support/directtrac/general/ze/ExtractTextFromBounds_(2)-259553055

Conclusion

I hope you enjoyed learning about how to extract text from the predefined rectangle from a PDF document. 

You can refer to our  WinForms PDF Viewer’s feature tour page to know about its other groundbreaking feature representations. You can also explore our WinForms PDF Viewer documentation to understand how to present and manipulate data. 

 

For current customers, you can check out our WinForms from the License and Downloads page. If you are new to Syncfusion, you can try our 30-day free trial to check out our WinForms PDF Viewer and other WinForms components.

 

If you have any queries or require clarifications, please let us know in comments below. You can also contact us through our support forumsDirect-Trac, or feedback portal. We are always happy to assist you!

 

2X faster development

The ultimate WPF UI toolkit to boost your development speed.
ADD COMMENT
You must log in to leave a comment

Please sign in to access our KB

This page will automatically be redirected to the sign-in page in 10 seconds.

Up arrow icon

Warning Icon You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.Close Icon

Live Chat Icon For mobile