You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by Jason Wood <jw...@comcast.net> on 2011/06/20 04:09:01 UTC

Error with setSortByPosition

Are there any known issues involving the setSortByPosition method of 
PDFTextStripper to cause characters to be handled incorrectly, when 
enabled?  When using the 1.5.0 version of pdfbox, there is a PDF 
document which correctly returns its text with positional sorting 
disabled, but enabling it causes the text output to be incorrect.  
Curiously, this same document is sorted correctly using version 1.3.1 of 
pdfbox.
Also tested was the trunk code with sorting enabled, but the result was 
even worse.