You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Sean Bridges (JIRA)" <ji...@apache.org> on 2009/05/12 22:30:45 UTC

[jira] Updated: (PDFBOX-466) error parsing files generated by crystal reports

     [ https://issues.apache.org/jira/browse/PDFBOX-466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Bridges updated PDFBOX-466:
--------------------------------

    Attachment: patch

This patch fixes the issue.  Crystal reports adds a space after xref and startxref.  trim()'ing the lines before comparison makes it work.

> error parsing files generated by crystal reports
> ------------------------------------------------
>
>                 Key: PDFBOX-466
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-466
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>            Reporter: Sean Bridges
>             Fix For: 0.8.0-incubator
>
>         Attachments: patch
>
>
> This is with the latest from svn, Revision: 773978
> From a sample of 13304 pdf documents generated in a very wide variety of ways, I got 200 exceptions with the stack trace,
> Caused by: java.io.IOException: expected='obj' actual='000' org.apache.pdfbox.io.PushBackInputStream@1049d3
> 	at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:471)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:169)
> 	at message_analyzer.extractor.PDFExtractor.getContent(PDFExtractor.java:32)
> 	... 2 more
> I can't give an example file, but the pdfs are all generated by crystal reports.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.