You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Juraj Lonc (JIRA)" <ji...@apache.org> on 2012/08/27 12:29:07 UTC

[jira] [Commented] (PDFBOX-1397) Some characters are displayed by default font (not by correct font)

    [ https://issues.apache.org/jira/browse/PDFBOX-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13442352#comment-13442352 ] 

Juraj Lonc commented on PDFBOX-1397:
------------------------------------

It seems that embedded font does not contain "cmap" table (it is defined in pdf object instead). JRE is not able to process such font and automatically makes substitution.

Do you think this would be a good solution?
1. extract that embedded font
2. process "cmap" defined in pdf object
3. make copy of that font with cmap included
4. replace font's object stream in pdf

so JRE would get font that is correct.
                
> Some characters are displayed by default font (not by correct font)
> -------------------------------------------------------------------
>
>                 Key: PDFBOX-1397
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1397
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.7.1
>            Reporter: Juraj Lonc
>         Attachments: adobe_reader.png, makcene-print.pdf, pdbox_output.png
>
>
> I have created simple test pdf which contains some central european characters.
> Adobe Reader displays text OK.
> But this code
> BufferedImage bufferedImage = pdPage.convertToImage();
> renders some characters with default font (not the one set in content).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira