PDF Extract to Image - Cannot extract parts of pdf document

Hi,

I'm trying to extract pages from a pdf into individual pages. The extraction works on most pdf's, however, on certain documents, it does not extract all the data. 
I have attached the pdf I'm trying to extract.

Here is the code:
        public List<byte[]> ExtractPagesAsImages(byte[] file)
        {
            var ldoc = new PdfLoadedDocument(file);
            var result = new List<byte[]>();
            var images = ldoc.ExportAsImage(0, ldoc.Pages.Count - 1);
            foreach (var image in images)
            {
                using (var memory = new MemoryStream())
                {
                    image.Save(memory, ImageFormat.Png);
                    result.Add(memory.ToArray());
                }
            }

            return result;
        }

Attachment: Doc_7c7ba6a2f0434bada17397bcf3935249_1430c4e1.zip

1 Reply

PE Priyanga Elangovan Syncfusion Team December 4, 2017 10:30 AM UTC

Hi Lee,

Thank you for contacting Syncfusion support.

A support incident to track the status of the reported “Some contents are missing while exporting the PDF document as images” issue has been created under your account. Please log on to our support website to check for further updates

https://www.syncfusion.com/account/login?ReturnUrl=%2fsupport%2fdirecttrac%2fincidents 

Regards,
Priyanga.E
 


Loader.
Up arrow icon