We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy. Image for the cookie policy date
Unfortunately, activation email could not send to your email. Please try again.
Syncfusion Feedback

How to extract text from the predefined rectangle from a PDF document?

Platform: WinForms |
Control: PdfViewer
Tags: pdf, extracttext

You can extract text from the given rectangular region on a page by exporting the page to an image, cloning the exported image with the given rectangle, and performing OCR on the cloned image. Refer to the following code snippet.



PdfLoadedDocument loadedDocument = new PdfLoadedDocument("../../Data/Barcode.pdf");
            using (OCRProcessor processor = new OCRProcessor("../../Tesseract binaries"))
                //Language to process the OCR
                processor.Settings.Language = Languages.English;
                Bitmap image = loadedDocument.ExportAsImage(0);
                //Define the bounds from which the text to be extracted
                Rectangle textBounds = new Rectangle(274, 53, 68, 15);
                //Point to Pixel conversion
                RectangleF rect = new RectangleF(textBounds.X * 1.3333f, textBounds.Y * 1.3333f, textBounds.Width * 1.3333f, textBounds.Height * 1.3333f);
                using (Bitmap clonedImage = image.Clone(rect, System.Drawing.Imaging.PixelFormat.Format32bppArgb))
                    string ocrText = processor.PerformOCR(clonedImage, @"../../Tessdata/");                 


Sample link:



2X faster development

The ultimate WinForms UI toolkit to boost your development speed.
You must log in to leave a comment

Please sign in to access our KB

This page will automatically be redirected to the sign-in page in 10 seconds.

Up arrow icon

Warning Icon You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.Close Icon

Live Chat Icon For mobile