You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2010/09/01 14:02:53 UTC

[jira] Commented: (PDFBOX-805) Extratced ascii text in CJK document is malformed

    [ https://issues.apache.org/jira/browse/PDFBOX-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905004#action_12905004 ] 

Andreas Lehmkühler commented on PDFBOX-805:
-------------------------------------------

It is always a good idea to embed all used fonts to the pdf. Otherwise one can't be sure that all needed fonts are installed on your destination platform. E.g. here on my german WinXP the acrobat reader doesn't show anything. Please, recreate the pdf with embedded fonts if possible?

> Extratced ascii text in CJK document is malformed
> -------------------------------------------------
>
>                 Key: PDFBOX-805
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-805
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>    Affects Versions: 1.2.1
>            Reporter: Keiji Suzuki
>         Attachments: cjk.pdf, CMapParser.java.patch
>
>
> When I run ExtractText with CJK PDF document with ascii text, the only ascii text is malformed. This does not occur in version 1.1.0.
> I can fix it with the attached patch. I attach an example pdf.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.