PDF Loaded Document - not reading in spaces

I'm using the PDFLoadedDocument and then page.extractText(true) to read a pdf document that only contains text.
While the text is all there the spaces between words are frequently missing.
For instance 'One Two Three' becomes 'OneTwoThree'.
I'm wondering if there is another option that I can use to retrieve the spaces between words.

9 Replies 1 reply marked as answer

AA Akshaya Arivoli Syncfusion Team September 4, 2020 11:23 AM UTC

Hi Dave , 

Thank you for contacting Syncfusion support.  

We were unable to reproduce the reported issue with the provided details, so can you please share us the PDF document and Essential Studio version in which you are facing the issue. These details will be helpful for us to investigate further and assist you better. 

Regards, 
Akshaya  



DJ dave Jacob September 6, 2020 11:02 PM UTC

Attached is the pdf document that I am using. Please let me know if I can provide any other information. 

Thanks, 

Dave 

Attachment: SamplePage_66f1cdff.zip


AA Akshaya Arivoli Syncfusion Team September 7, 2020 12:40 PM UTC

Hi Dave,   
  
We were able to reproduce the reported issue with the provided document using PdfLoadedDocument. However, we can resolve it using Our EJ2 PDF Viewer server library. It allows you to extract the text from a page along with the bounds. So can please try extracting the text for the provided PDF document using the below link,    
  


  
Note: We have embedded the Pdfium rendering engine in our EJ2 PDF Viewer for robust rendering, so the text extraction result will be more accurate when compared to the PdfLoadedDocument.    
  
Please try it and revert us with more details if you need further assistance 
Regards,   
Akshaya   



JK Julia Kohl June 1, 2021 09:49 AM UTC

Hi!

I am currently facing the same issue within my flutter app. 
Is there a solution that can be used inside my app too?

Thanks
Julia


AP Anand Panchamoorthi Syncfusion Team June 2, 2021 12:43 PM UTC

Hi Julia Kohl, 

We are able to reproduce the reported issue with “White space missing while extracting text from particular PDF document” in our end. And we will provide further validation details on 4th June 2021. 

With Regards, 
Anand Panchamoorthi 



AP Anand Panchamoorthi Syncfusion Team June 4, 2021 10:35 AM UTC

Hi Julia Kohl, 

The reported issue with “White space missing while extracting text from particular PDF document” has been validated and confirmed as a defect. We will include the fix in our upcoming weekly release which is planned on 22nd June 2021. 


With Regards,  
Anand Panchamoorthi 



AP Anand Panchamoorthi Syncfusion Team June 22, 2021 09:50 AM UTC

Hi Julia Kohl, 

As 2021 Volume 2 Main Release has been planned on 25th June 2021, we will not be having weekly release on today (June 22nd, 2021). So, We will include our fix in upcoming weekly release which is available after once Main release rolled out. 

Until then we will appreciating your patience. 

With Regards, 
Anand Panchamoorthi 



AP Anand Panchamoorthi Syncfusion Team June 25, 2021 07:35 AM UTC

Hi Julia Kohl,

We are appreciating your patience.

Next weekly release has planned for 6th July 2021. We will include the fix in upcoming weekly release which is available on July 6th.

With Regards,
Anand Panchamoorthi


AP Anand Panchamoorthi Syncfusion Team July 6, 2021 11:58 AM UTC

Hi Julia Kohl, 

Thanks for your patience. 

We have fixed an issue with 'White space missing while extracting text from particular PDF document' and included the fix in weekly release https://pub.dev/packages/syncfusion_flutter_pdf/versions/19.2.46-beta

Please try the sample https://www.syncfusion.com/downloads/support/forum/166012/ze/ExtractTextFromPDF-2100639782 in your end and let us know if you need further assistance in this. 

With Regards, 
Anand Panchamoorthi 


Marked as answer
Loader.
Up arrow icon