You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Volker Kunert (Jira)" <ji...@apache.org> on 2022/05/21 19:58:00 UTC

[jira] [Commented] (PDFBOX-4951) Sequences with combining letters are rendered incorrectly

    [ https://issues.apache.org/jira/browse/PDFBOX-4951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17540490#comment-17540490 ] 

Volker Kunert commented on PDFBOX-4951:
---------------------------------------

See also 
 * [https://github.com/LibrePDF/OpenPDF/issues/442 |https://github.com/LibrePDF/OpenPDF/issues/442]Accented Letters composed of Unicode base letter and combining accent are rendered incorrectly.
 * [https://github.com/LibrePDF/OpenPDF/wiki/Accents,-DIN-91379,-non-Latin-scripts] Accents, DIN 91379, non Latin scripts
 * [https://github.com/danfickle/openhtmltopdf/issues/777 |https://github.com/danfickle/openhtmltopdf/issues/777]Wrong Position of Accents for Sequences of DIN 91379

> Sequences with combining letters are rendered incorrectly
> ---------------------------------------------------------
>
>                 Key: PDFBOX-4951
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4951
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 2.0.21
>            Reporter: Volker Kunert
>            Priority: Major
>         Attachments: DIN_SPEC_91379_Sequences-aa.pdf, DIN_SPEC_91379_Sequences-ab.pdf, DIN_SPEC_91379_Sequences-ac.pdf, DIN_SPEC_91379_Sequences.txt, DefaultScriptProcessor.java, DoGlyphLayoutDinSpec91379.pdf, DoGlyphLayoutDinSpec91379Form.pdf, DoGlyphPositionBengali.pdf, ExamplePdfboxFopPos-By-Tilman.pdf, ExamplePdfboxFopPos.java, ExamplePdfboxFopPos.pdf, ExamplePdfboxFopPosForm.java, ExamplePdfboxFopPosForm.pdf, TestPdfbox.java, TestPdfboxFop2.java, TestPdfboxFop2.pdf, TestPdfboxJava2D.java, TestPdfboxJava2D.pdf, patch-2020-10-02.txt, pdfbox.patch, pdfbox.pdf, screenshot-1.png
>
>
> Accented Letters composed of Unicode base letter and combining accent are rendered wrong. E.g. with 0041 030B LATIN CAPITAL LETTER A WITH COMBINING DOUBLE ACUTE ACCENT the accent appears at the right hand side of the letter A, not above the letter A.
> The position is wrong for most of the sequences defined in the following spec:
> DIN SPEC 91379: Characters in Unicode for the electronic processing of names and data 
>  exchange in Europe; with digital attachment
>  [https://www.xoev.de/downloads-2316#StringLatin]
>  [https://www.din.de/de/wdc-beuth:din21:301228458]
>  
> The correct rendering should look like the output of hb-view 2.6.8, see files DIN_SPEC_91379_Sequences*.pdf.
> The output of PDFBox is appended in pdfbox.pdf, which is created by running TestPdfbox.java. The sequences are read from file DIN_SPEC_91379_Sequences.txt.
>  
> Font used for testing: NotoSansMono-Regular.ttf, see [https://www.google.com/get/noto/] 
> download: [https://noto-website-2.storage.googleapis.com/pkgs/NotoSansMono-hinted.zip]
>  See also FOP-2969
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org