You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Johan van der Knijff (JIRA)" <ji...@apache.org> on 2013/12/17 13:21:08 UTC

[jira] [Created] (PDFBOX-1812) Illegal characters in XML output

Johan van der Knijff created PDFBOX-1812:
--------------------------------------------

             Summary: Illegal characters in XML output
                 Key: PDFBOX-1812
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1812
             Project: PDFBox
          Issue Type: Bug
          Components: Preflight
    Affects Versions: 2.0.0
         Environment: Bug reproduced under Win 7, Ubuntu
            Reporter: Johan van der Knijff
             Fix For: 2.0.0


When running Preflight in XML mode, the latest Preflight version (I used the JAR from build #747) sometimes produces output that contains characters that are illegal in XML. This can cause unexpected behavior if such files are further processed with tools that expect well-formed XML.  See attached PDFs, which all result in illegal characters in the description of a 1.0 Syntax error, Error: Expected a long type. Output of older versions of Preflight didn't contain these illegal characters; instead they would give something like *actual='/O'*, *actual='Pages'*. etc. So I suppose this must have been caused by a fairly recent change.

[NOTE: can't see how to add attachments here, if I can't get this working I will create a Git repo with the example files and provide a link here]



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)