PDF Loaded Document - not reading in spaces
I'm using the PDFLoadedDocument and then page.extractText(true) to read a pdf document that only contains text.
While the text is all there the spaces between words are frequently missing.
For instance 'One Two Three' becomes 'OneTwoThree'.
I'm wondering if there is another option that I can use to retrieve the spaces between words.
SIGN IN To post a reply.
9 Replies
1 reply marked as answer
AA
Akshaya Arivoli
Syncfusion Team
September 4, 2020 11:23 AM UTC
Hi Dave ,
Thank you for contacting Syncfusion support.
We were unable to reproduce the reported issue with the provided details, so can you please share us the PDF document and Essential Studio version in which you are facing the issue. These details will be helpful for us to investigate further and assist you better.
Regards,
Akshaya
DJ
dave Jacob
September 6, 2020 11:02 PM UTC
Attached is the pdf document that I am using. Please let me know if I can provide any other information.
Attachment: SamplePage_66f1cdff.zip
Thanks,
Dave
Dave
Attachment: SamplePage_66f1cdff.zip
AA
Akshaya Arivoli
Syncfusion Team
September 7, 2020 12:40 PM UTC
Hi Dave,
We were able to reproduce the reported issue with the provided document using PdfLoadedDocument. However, we can resolve it using Our EJ2 PDF Viewer server library. It allows you to extract the text from a page along with the bounds. So can please try extracting the text for the provided PDF document using the below link,
Note: We have embedded the Pdfium rendering engine in our EJ2 PDF Viewer for robust rendering, so the text extraction result will be more accurate when compared to the PdfLoadedDocument.
Please try it and revert us with more details if you need further assistance
Regards,
Akshaya
JK
Julia Kohl
June 1, 2021 09:49 AM UTC
Hi!
I am currently facing the same issue within my flutter app.
Is there a solution that can be used inside my app too?
Thanks
Julia
AP
Anand Panchamoorthi
Syncfusion Team
June 2, 2021 12:43 PM UTC
Hi Julia Kohl,
We are able to reproduce the reported issue with “White space missing while extracting text from particular PDF document” in our end. And we will provide further validation details on 4th June 2021.
With Regards,
Anand Panchamoorthi
AP
Anand Panchamoorthi
Syncfusion Team
June 4, 2021 10:35 AM UTC
Hi Julia Kohl,
The reported issue with “White space missing while extracting text from particular PDF document” has been validated and confirmed as a defect. We will include the fix in our upcoming weekly release which is planned on 22nd June 2021.
The status of the bug can be track through below link: https://www.syncfusion.com/feedback/25966/white-space-missing-while-extracting-text-from-particular-pdf-document
With Regards,
Anand Panchamoorthi
AP
Anand Panchamoorthi
Syncfusion Team
June 22, 2021 09:50 AM UTC
Hi Julia Kohl,
As 2021 Volume 2 Main Release has been planned on 25th June 2021, we will not be having weekly release on today (June 22nd, 2021). So, We will include our fix in upcoming weekly release which is available after once Main release rolled out.
Until then we will appreciating your patience.
With Regards,
Anand Panchamoorthi
AP
Anand Panchamoorthi
Syncfusion Team
June 25, 2021 07:35 AM UTC
Hi Julia Kohl,
We are appreciating your patience.
Next weekly release has planned for 6th July 2021. We will include the fix in upcoming weekly release which is available on July 6th.
With Regards,
Anand Panchamoorthi
We are appreciating your patience.
Next weekly release has planned for 6th July 2021. We will include the fix in upcoming weekly release which is available on July 6th.
With Regards,
Anand Panchamoorthi
AP
Anand Panchamoorthi
Syncfusion Team
July 6, 2021 11:58 AM UTC
Hi Julia Kohl,
Thanks for your patience.
We have fixed an issue with 'White space missing while extracting text from particular PDF document' and included the fix in weekly release https://pub.dev/packages/syncfusion_flutter_pdf/versions/19.2.46-beta.
Please try the sample https://www.syncfusion.com/downloads/support/forum/166012/ze/ExtractTextFromPDF-2100639782 in your end and let us know if you need further assistance in this.
With Regards,
Anand Panchamoorthi
Marked as answer
SIGN IN To post a reply.
- 9 Replies
- 4 Participants
- Marked answer
-
DJ dave Jacob
- Sep 3, 2020 06:03 PM UTC
- Jul 6, 2021 11:58 AM UTC