You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Maxime Veron (JIRA)" <ji...@apache.org> on 2017/06/28 12:03:00 UTC
[jira] [Created] (PDFBOX-3841) OpenText - Exstream PDFs textual
informations are recognized with erroneous font size
Maxime Veron created PDFBOX-3841:
------------------------------------
Summary: OpenText - Exstream PDFs textual informations are recognized with erroneous font size
Key: PDFBOX-3841
URL: https://issues.apache.org/jira/browse/PDFBOX-3841
Project: PDFBox
Issue Type: Bug
Affects Versions: 1.8.13, 1.8.12
Environment: Windows/Linux
Reporter: Maxime Veron
Attachments: 436bc378-aa42-43e5-a4de-a0bbaf233f79-4.PDF vII.PDF
I'm creating this bug report because I observed a particular behavior in PDFBox parsing of text positions.
I used the command line tool (PrintTextPositions) to generate the text positions and remove any code logic our application could have over PDFBox and here is an extract of the result :
String[17.279999,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0 width=3.9960938]-
String[21.6,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0 width=5.999998]*
String[28.08,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0 width=3.9960918]-
String[31.679998,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0 width=3.000002]
The font size is reported as 50.0 (which is indeed contained in the PDF as 50) but Adobe and other PDF viewers do not seem to take into account this font of size 50 to render the text selection. Should we rely on another metric from the text position, or could this font size be reported relative to the correct appearance the text should have?
Best regards
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org