You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Manuel Aristaran (JIRA)" <ji...@apache.org> on 2013/10/23 23:44:41 UTC
[jira] [Created] (PDFBOX-1755) Wrong widthOfSpace
Manuel Aristaran created PDFBOX-1755:
----------------------------------------
Summary: Wrong widthOfSpace
Key: PDFBOX-1755
URL: https://issues.apache.org/jira/browse/PDFBOX-1755
Project: PDFBox
Issue Type: Bug
Components: Text extraction
Affects Versions: 1.8.3
Environment: Java 7, JRuby
Reporter: Manuel Aristaran
Attachments: tabla_subsidios.pdf
For some TextPositions in this particular PDF (after being processed with PDFStreamEngine), the getWidthOfSpace returns 0.
I've traced the bug to `processEncodedText` in `PDFStreamEngine`: when `spaceWidthText` is converted to display units, `textMatrix.getValue(0,0)` returns 0. Being a factor in the conversion expression, sets the result to 0.
float spaceWidthDisp = spaceWidthText * fontSizeText * horizontalScalingText * textMatrix.getValue(0, 0)
* ctm.getValue(0, 0);
--
This message was sent by Atlassian JIRA
(v6.1#6144)