ExtractText is missing |
At present we do not have support for Extracting text in the Syncfusion.Pdf.Portable library. However, we have planned to implement this feature in our Syncfusion.Pdf.Portable library. This feature will be available in 2017 Volume 2 Service Pack 1 release, which is expected to be available at the end of May 2017.
|
how can I get coordinates? |
This can be achieved by the FindText feature in the Syncfusion PDF library. But at present we do not have support for FindText feature in the ASP.NET Core platform. We have planned to implement this feature in our Syncfusion.Pdf.Portable library. This feature will be available in 2017 Volume 3 release, which is expected to be available at the end of July 2017. |
Due ExtractText and Text coordinates never will be available in Syncfusion.Pdf.Portable, how can I access to internal PDF objects using Syncfusion.Pdf.Portable?
At this moment I have access to Page object but I wish to access to obj ... endobj content.
Some information will be appreciated
Cheers
Hi folks,
I tested ExtractText but I don't obtain text of any page. I tested ExtractText(), ExtractText(true), ExtractText(false) and I only obtain escape chars. I tested several PDFs. Does
ExtractText should return all page text separated by \r\n?
Cheers
Details |
Query |
I tested ExtractText but I don't obtain text of any page. I tested ExtractText(), ExtractText(true), ExtractText(false) and I only obtain escape chars. I tested several PDFs. |
We regret to let you know that we could not reproduce the issue “Text extraction is not working properly” in the ASP.NET Core application. We have created a sample for your reference and you can download the sample from the following link.
We suspect that the reported issue is specific to the PDF document. So please share us the PDF document in which you have faced the issue. This will be helpful for us to analyze further and assist you better.
Note: The text extraction feature in the .NET Core is available from .NET Standard library 1.4 version. So Kindly refer the “Syncfusion.Compression.NETStandard14” and “Syncfusion.Pdf.NETStandard14” packages to achieve the text extraction from PDF document in the application level. |
Does ExtractText should return all page text separated by \r\n? |
The escape sequences “\r\n” marks the line breaks in the pages of the PDF document which is break text content between pages. |
Can't send you pdf. it contains personal information.
Please close this thread. I will user other provider component.