You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Renaud Billen <re...@nic.be> on 2015/01/10 14:49:41 UTC
Re: Content of pdf moved around - SOLVED
Wow thought the sort option would make an alphabetical sort, so I haven’t tried it, but it did the trick… :)
Anyway thanks a lot for you help,
Renaud
> Le 10 janv. 2015 à 14:20, Andreas Lehmkuehler <an...@lehmi.de> a écrit :
>
> Hi,
>
> Am 10.01.2015 um 14:04 schrieb Renaud Billen:
>> Hello,
>>
>> I have a little issue with the extraction of the text of some pdfs, where some words are switching order with others..
>>
>> With the pdf attached to this mail, if I use "save as text » from adobe reader, I get :
>>
>> Référence: LIX-673LIX-6737
>>
>>
>> Nom: The test company
>>
>>
>> Type:
>> Ouverture: 24/04/2007
>>
>> Titulaire: BD
>> Resp.: LIX
>> Co-Resp.: BB
>> Client
>>
>>
>>
>>
>> But with pdfbox I get :
>>
>> Référence: LIX-6737
>> Nom: The test company
>> Titulaire: BD
>> Resp.: LIX
>> Co-Resp.: BB
>> Type:
>> Ouverture: 24/04/2007
>> Client
>>
>>
>> Could you tell me if something can be done to solve this problem?
> Is the sort option activated?
>
>> Thanks,
>> Renaud
>
> BR
> Andreas Lehmkühler