You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "William Fausser (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/01/17 17:23:39 UTC

[jira] [Issue Comment Edited] (PDFBOX-1204) OCR generated PDF/A has problems with preflight validation

    [ https://issues.apache.org/jira/browse/PDFBOX-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13184937#comment-13184937 ] 

William Fausser edited comment on PDFBOX-1204 at 1/17/12 4:22 PM:
------------------------------------------------------------------


Hi Eric,
With issues 1110 and 1200 getting fixed, I reran my test on the boyd.pdf   and still get a  preflight validation error below:

/home/fausser/boyd.pdf is not valid, error(s) :
3.3.1 : Glyph error, CID 95 is missing from the Composite Font format "HiddenHorzOCR"
7.2 : Error on MetaData, ModificationDate present in the document catalog dictionary doesn't match with XMP information

OCR generated PDFs are a big part of the way PDF/As get generated and if these bugs get cleared up, I think the preflight product wiill
become usable as a valid PDF/A validator.

Regards,
Bill

                
      was (Author: bfausser):
    
Hi Eric,
With issues 110 and 1200 getting fixed, I reran my test on the boyd.pdf   and still get a  preflight validation error below:

/home/fausser/boyd.pdf is not valid, error(s) :
3.3.1 : Glyph error, CID 95 is missing from the Composite Font format "HiddenHorzOCR"
7.2 : Error on MetaData, ModificationDate present in the document catalog dictionary doesn't match with XMP information

Regards,
Bill

                  
> OCR generated PDF/A  has problems with preflight validation
> -----------------------------------------------------------
>
>                 Key: PDFBOX-1204
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1204
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Preflight
>    Affects Versions: 1.7.0
>            Reporter: William Fausser
>         Attachments: boyd.pdf
>
>
> /home/fausser/boyd.pdf is not valid, error(s):
> 2.1.2:Invalid Graphis object, The Info entry of a OutputIntent dictionary is missing
> 3.3.1: Glyph error, CID 95 is missing from the Composite Font format "HiddenHorzOCR"
> 7.2:Error on MetaData, ModificationDate present in the document catalog dictionary doesn't match with XMP information
> Passes as a valid PDF/A with commercial validators Adobe Acrobat 10.x and Callas

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira