You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "John Hewson (JIRA)" <ji...@apache.org> on 2014/10/10 23:51:33 UTC

[jira] [Updated] (PDFBOX-212) PDF Document cut German Umlauts

     [ https://issues.apache.org/jira/browse/PDFBOX-212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Hewson updated PDFBOX-212:
-------------------------------
    Affects Version/s: 1.2.1

> PDF Document cut German Umlauts
> -------------------------------
>
>                 Key: PDFBOX-212
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-212
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Writing
>    Affects Versions: 1.2.1
>            Priority: Minor
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1587745
> Originally submitted by kajiro on 2006-10-31 01:05.
> I use the class TextToPDF for create a PDF Document
> from a text file. That operates correctly with a simply
> text. But when i use german umlauts in the text like
> ä,ö,ü or ß the PDF Document cut this letters. 
> Attached is a sample document contaning four words with
> incorrectly umlauts! 
> [attachment on SourceForge]
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1587745&file_id=200742
> bsp.pdf (application/pdf), 958 bytes
> Umlauts are incorrect
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> Originator: NO
> To the anonymous poster, did you mean for both PDF links to be the same?
> Ben
> [comment on SourceForge]
> Originally sent by nobody.
> Logged In: NO 
> For PDF file, which contains accented Latin1
> characters:
>     http://acl.ldc.upenn.edu//P/P06/P06-2052.pdf
> I get a u with umlauts converted into "currency1u"
> (look at the first name on the first page).
> For the following file containing Japanese characters:
>      http://acl.ldc.upenn.edu//P/P06/P06-2052.pdf
> I get error:
>      java.io.IOException: Unknown encoding for 'H'
> I also can't seem to cut and past the form.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)