You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Jeroen Verhagen <je...@gmail.com> on 2010/02/25 11:11:45 UTC

individual character extracted?

Hi all,

I'm using pdfbox (0.7.3) for some time now just fine. We extended
PDFTextStripper so that the method showCharacter() will give us text
fragments from a pdf.

However my client changed the format of his pdf and now individual
characters instead of sentences are recognized as fragments causing
all context to be lost. Could somebody please tell me what's causing
this?

Thanks for any help.

-- 

regards,

Jeroen