We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy.
Unfortunately, activation email could not send to your email. Please try again.

How to extract the text from specific coordinates of the PDF document

Currently, PDF viewer does not extract the text from the PDF document in Web platform. However, as a workaround, the text from the PDF document is extracted using the PdfLoadedDocument in the server-side of ASP.NET Web forms. Refer to the following code snippet.

HTML

<asp:Content ID="BodyContent" runat="server" ContentPlaceHolderID="MainContent">

     <table width="70%" style="height: 31px;left:15%;position:absolute">

           <tr style="width: 100%;text-align:center">

                <td >

                    <asp:FileUpload ID="FileUpload1" runat="server" />

                    <asp:Button ID="Button1" runat="server" onclick="Button1_Click"

                        Text="ExtractText" />

                </td>               

            </tr>

             </table>

    <div style="left:15%;position:absolute; top:88px">

     <asp:TextBox ID="TextBox1" runat="server" TextMode="multiline" Height="259px" Width="888px"></asp:TextBox>

        </div>

</asp:Content>

 

C#

protected void Button1_Click(object sender, EventArgs e)

        {

            if (System.IO.Path.GetExtension(FileUpload1.PostedFile.FileName).Equals(".pdf"))

            {

                //Get the PDF file stream

                Stream pdfStream = FileUpload1.PostedFile.InputStream;

                //Create an instance for PdfLoadedDocument

                PdfLoadedDocument ldoc = new PdfLoadedDocument(pdfStream);

                //Load the document in the viewer

                foreach (PdfLoadedPage page in ldoc.Pages)

                {

                    //Extract the text from the pages of PDF document

                    TextBox1.Text += page.ExtractText();

                }

            }          

        }

 

Sample:

http://www.syncfusion.com/downloads/support/directtrac/general/ze/ExtractText1108197466

Article ID: Published Date: Last Revised Date: Platform: Control:
9084 08/06/2018 09/06/2018 ASP.NET Web Forms PdfViewer
Did you find this information helpful?
Add Comment
You must log in to leave a comment

You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.