You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "John Hewson (JIRA)" <ji...@apache.org> on 2014/10/11 03:36:33 UTC

[jira] [Closed] (PDFBOX-212) PDF Document cut German Umlauts

     [ https://issues.apache.org/jira/browse/PDFBOX-212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Hewson closed PDFBOX-212.
------------------------------
    Resolution: Invalid

I'm closing this issue as invalid as it seems to cover parsing and rendering as well as writing new PDFs, and to cover both TrueType and Type1 fonts, which are entirely separate. Plus there's no sample PDFs.

> PDF Document cut German Umlauts
> -------------------------------
>
>                 Key: PDFBOX-212
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-212
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Writing
>    Affects Versions: 1.2.1
>            Priority: Minor
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1587745
> Originally submitted by kajiro on 2006-10-31 01:05.
> I use the class TextToPDF for create a PDF Document
> from a text file. That operates correctly with a simply
> text. But when i use german umlauts in the text like
> ä,ö,ü or ß the PDF Document cut this letters. 
> Attached is a sample document contaning four words with
> incorrectly umlauts! 
> [attachment on SourceForge]
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1587745&file_id=200742
> bsp.pdf (application/pdf), 958 bytes
> Umlauts are incorrect
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> Originator: NO
> To the anonymous poster, did you mean for both PDF links to be the same?
> Ben
> [comment on SourceForge]
> Originally sent by nobody.
> Logged In: NO 
> For PDF file, which contains accented Latin1
> characters:
>     http://acl.ldc.upenn.edu//P/P06/P06-2052.pdf
> I get a u with umlauts converted into "currency1u"
> (look at the first name on the first page).
> For the following file containing Japanese characters:
>      http://acl.ldc.upenn.edu//P/P06/P06-2052.pdf
> I get error:
>      java.io.IOException: Unknown encoding for 'H'
> I also can't seem to cut and past the form.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)