You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2014/10/23 22:55:34 UTC

[jira] [Updated] (PDFBOX-2449) Character missing in text extraction

     [ https://issues.apache.org/jira/browse/PDFBOX-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tilman Hausherr updated PDFBOX-2449:
------------------------------------
    Attachment: 267739.pdf

> Character missing in text extraction
> ------------------------------------
>
>                 Key: PDFBOX-2449
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2449
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.8.8
>            Reporter: Tilman Hausherr
>         Attachments: 267739.pdf
>
>
> The attached file brings this text extraction:
> 1.8.6:
> For safe! clean! abundant water?in our homes!
> rivers! lakes! and streams?is one of our
> 1.8.7:
> For safe! clean! abundant water?in our homes!
> rivers! lakes! and streams?is one of our
> 1.8.8:
> For safe! clean! abundant water?n our homes!
> rivers! lakes! and streams?s one of our
> 2.0:
> For safe! clean! abundant water–in our homes!
> rivers! lakes! and streams–is one of our
> AR:
> For safe! clean! abundant water–in our homes!
> rivers! lakes! and streams–is one of our
> So the "i" has been lost in the 1.8.8 version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)