You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Renaud Billen <re...@nic.be> on 2015/01/10 14:49:41 UTC

Re: Content of pdf moved around - SOLVED

Wow thought the sort option would make an alphabetical sort, so I haven’t tried it, but it did the trick… :)

Anyway thanks a lot for you help,
Renaud

> Le 10 janv. 2015 à 14:20, Andreas Lehmkuehler <an...@lehmi.de> a écrit :
> 
> Hi,
> 
> Am 10.01.2015 um 14:04 schrieb Renaud Billen:
>> Hello,
>> 
>> I have a little issue with the extraction of the text of some pdfs, where some words are switching order with others..
>> 
>> With the pdf attached to this mail, if I use "save as text » from adobe reader, I get :
>> 
>> Référence: LIX-673LIX-6737
>> 
>> 
>> Nom: The test company
>> 
>> 
>> Type:
>> Ouverture: 24/04/2007
>> 
>> Titulaire: BD
>> Resp.: LIX
>> Co-Resp.: BB
>> Client
>> 
>> 
>> 
>> 
>> But with pdfbox I get :
>> 
>> Référence: LIX-6737
>> Nom: The test company
>> Titulaire: BD
>> Resp.: LIX
>> Co-Resp.: BB
>> Type:
>> Ouverture: 24/04/2007
>> Client
>> 
>> 
>> Could you tell me if something can be done to solve this problem?
> Is the sort option activated?
> 
>> Thanks,
>> Renaud
> 
> BR
> Andreas Lehmkühler