You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Maruan Sahyoun (JIRA)" <ji...@apache.org> on 2013/05/13 08:27:40 UTC

[jira] [Commented] (PDFBOX-1598) Could not parse predefined CMAP file for UCS2 Encoding

    [ https://issues.apache.org/jira/browse/PDFBOX-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13655791#comment-13655791 ] 

Maruan Sahyoun commented on PDFBOX-1598:
----------------------------------------

Hi James,

Adobe Reader and other viewers have issues displaying the content too. So IMHO the PDF is corrupt. Eg. Adobe Reader displays only dots instead of text and there is an error message when the PDF is opened.

Please give it a try and close the issue if you agree.

BR
Maruan
                
> Could not parse predefined CMAP file for UCS2 Encoding
> ------------------------------------------------------
>
>                 Key: PDFBOX-1598
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1598
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.1
>         Environment: Ubuntu 12.04
>            Reporter: James Sullivan
>         Attachments: PDFExampleError.pdf
>
>
> To reproduce from the command line type: pdfbox ExtractText -console PDFExampleError.pdf
> org.apache.pdfbox.pdmodel.font.PDCIDFont determineEncoding
> SEVERE: Error: Could not parse predefined CMAP file for 'æ¢x-í§sO-UCS2'
> Garbled but may be UniJIS-UCS2-H encoding for Japanese

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira