You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Hayk Hayryan (JIRA)" <ji...@apache.org> on 2015/04/07 15:30:12 UTC

[jira] [Updated] (PDFBOX-2749) Annotations character bounding boxes size 3 times higher than expected

     [ https://issues.apache.org/jira/browse/PDFBOX-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hayk Hayryan updated PDFBOX-2749:
---------------------------------
    Attachment: RESULT.pdf

> Annotations character bounding boxes size 3 times higher than expected
> ----------------------------------------------------------------------
>
>                 Key: PDFBOX-2749
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2749
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.4
>            Reporter: Hayk Hayryan
>            Priority: Critical
>         Attachments: RESULT.pdf
>
>
> After text extraction the character bounding boxes 3 times higher than expected. For example, see the first few character bounding boxes below:
> [90.1,46,6.64,40.06],[96.7,46,5.09,40.06],[101.79,46,5.8,40.06].
> The values are x, y, width, height. The width of the characters are between 5 and 7 pixels, but the height of the characters are 40.6 pixels. The actual height of each line of text appears to be about 12 pixels. The example pdf document attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org