Hello,
I need to extract text from pdf Invoices.
But often I get System.FormatException.
My code:
using (FileStream oFs = new FileStream(item.FilePath, FileMode.Open, FileAccess.Read))
{
loadedDocument = new PdfLoadedDocument(oFs);
PdfLoadedPageCollection loadedPages = loadedDocument.Pages;
foreach (PdfLoadedPage loadedPage in loadedPages)
{
extractedText += loadedPage.ExtractText();
}
loadedDocument.Close(true);
}
if (extractedText.Trim().Length > 0) {
DocumentTextRecognition dt = new DocumentTextRecognition() {Id = item.IdDocumentText };
dt.Flgread = true;
dt.Updated = DateTime.Now;
dt.Textread = extractedText.Trim();
_backgroundService.UpdateDocumentText(dt).Wait();
}
StackTrace:
at System.Number.ParseSingle(String value, NumberStyles options, NumberFormatInfo numfmt)
at Syncfusion.Pdf.PdfPageBase.RenderText(PdfRecordCollection recordCollection, PdfPageResources m_pageResources)
at Syncfusion.Pdf.PdfPageBase.ExtractText()
MessageException:
Input string was not in a correct format
Thanks,
David