We use cookies to give you the best experience on our website. If you continue to browse, then you agree to our privacy policy and cookie policy. (Last updated on: November 16, 2018).
Unfortunately, activation email could not send to your email. Please try again.
Syncfusion Feedback

How to extract the text from specific coordinates of the PDF document

Platform: ASP.NET Web Forms |
Control: PdfViewer |
Published Date: August 6, 2018 |
Last Revised Date: September 6, 2018

Currently, PDF viewer does not extract the text from the PDF document in Web platform. However, as a workaround, the text from the PDF document is extracted using the PdfLoadedDocument in the server-side of ASP.NET Web forms. Refer to the following code snippet.

HTML

<asp:Content ID="BodyContent" runat="server" ContentPlaceHolderID="MainContent">

     <table width="70%" style="height: 31px;left:15%;position:absolute">

           <tr style="width: 100%;text-align:center">

                <td >

                    <asp:FileUpload ID="FileUpload1" runat="server" />

                    <asp:Button ID="Button1" runat="server" onclick="Button1_Click"

                        Text="ExtractText" />

                </td>               

            </tr>

             </table>

    <div style="left:15%;position:absolute; top:88px">

     <asp:TextBox ID="TextBox1" runat="server" TextMode="multiline" Height="259px" Width="888px"></asp:TextBox>

        </div>

</asp:Content>

 

C#

protected void Button1_Click(object sender, EventArgs e)

        {

            if (System.IO.Path.GetExtension(FileUpload1.PostedFile.FileName).Equals(".pdf"))

            {

                //Get the PDF file stream

                Stream pdfStream = FileUpload1.PostedFile.InputStream;

                //Create an instance for PdfLoadedDocument

                PdfLoadedDocument ldoc = new PdfLoadedDocument(pdfStream);

                //Load the document in the viewer

                foreach (PdfLoadedPage page in ldoc.Pages)

                {

                    //Extract the text from the pages of PDF document

                    TextBox1.Text += page.ExtractText();

                }

            }          

        }

 

Sample:

http://www.syncfusion.com/downloads/support/directtrac/general/ze/ExtractText1108197466

ADD COMMENT
You must log in to leave a comment

Please sign in to access our KB

This page will automatically be redirected to the sign-in page in 10 seconds.

Warning Icon You are using an outdated version of Internet Explorer that may not display all features of this and other websites. Upgrade to Internet Explorer 8 or newer for a better experience.Close Icon