You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Roman (JIRA)" <ji...@apache.org> on 2017/03/10 04:12:38 UTC

[jira] [Created] (PDFBOX-3715) Text Stripper trims last spaces - regression of 2.0

Roman created PDFBOX-3715:
-----------------------------

             Summary: Text Stripper trims last spaces - regression of 2.0
                 Key: PDFBOX-3715
                 URL: https://issues.apache.org/jira/browse/PDFBOX-3715
             Project: PDFBox
          Issue Type: Bug
            Reporter: Roman


When migrated from 1.8 to 2.0, we realized that some spaces are disappeared. Please see attached PDF. Disappeared spaces are shown as blue boxes in it. Those spaces WERE present in 1.8 version.

Our App overrides *PDFTextStripper* class, implements *writePage()* method, and uses *charactersByArticle* property, which is actually a list of all *TextPosition* objects existing for every character from document.

Some trailing spaces are disappeared from it. In the same time, those spaces are present in PDF via explicit declaration. For example, these piece of attached PDF contains the space right after "contents" word:

{code}
[( the content)-7(s )-2(of t)...]TJ
{code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org