You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by Jason Wood <jw...@comcast.net> on 2011/06/20 04:09:01 UTC
Error with setSortByPosition
Are there any known issues involving the setSortByPosition method of
PDFTextStripper to cause characters to be handled incorrectly, when
enabled? When using the 1.5.0 version of pdfbox, there is a PDF
document which correctly returns its text with positional sorting
disabled, but enabling it causes the text output to be incorrect.
Curiously, this same document is sorted correctly using version 1.3.1 of
pdfbox.
Also tested was the trunk code with sorting enabled, but the result was
even worse.