You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Shyam Sundar <sw...@gmail.com> on 2016/05/31 10:00:29 UTC

Numbers get reversed sometimes during conversion

Hi,

I have come across an issue wherein while trying to covert PDFs (mainly of
RTL languages) into TXT, the numbers get reversed.

Please check the attached file, '2005' in heading has become '5002'.

It happens with the latest version too. Is this a bug ?

This is a PDF/A format by the way. Hope it is fully supported.

Thanks in advance.

Re: Numbers get reversed sometimes during conversion

Posted by Andreas Lehmkühler <an...@lehmi.de>.
Hi,

> Shyam Sundar <sw...@gmail.com> hat am 31. Mai 2016 um 12:00
> geschrieben:
> 
> 
> Hi,
> 
> I have come across an issue wherein while trying to covert PDFs (mainly of
> RTL languages) into TXT, the numbers get reversed.
> 
> Please check the attached file, '2005' in heading has become '5002'.
The file didn't make it due to some restrictions.

> It happens with the latest version too. Is this a bug ?
Do you use the sorting option?

BR
Andreas
> This is a PDF/A format by the way. Hope it is fully supported.
> 
> Thanks in advance.
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Re: Numbers get reversed sometimes during conversion

Posted by Shyam Sundar <sw...@gmail.com>.
Thanks Andreas !

On Thu, Jun 2, 2016 at 3:45 PM, Andreas Lehmkühler <an...@lehmi.de> wrote:

> > Shyam Sundar <sw...@gmail.com> hat am 2. Juni 2016 um 09:18
> > geschrieben:
> >
> >
> > Hi,
> >
> > Wondering if you got a chance to check this ...
>
> First thing to be done in such cases is to do the "Adobe Reader test". It
> fails,
> the text can't be extracted using Acrobat Reader, so we are better ;-)
> Anyway,
> mixed LTR/RTL text is always tricky to handle, see PDFBOX-2252
>
> No solution so far, if there is any at all.
>
> BR
> Andreas
> >
> > Thanks.
> >
> > On Wed, Jun 1, 2016 at 1:48 AM, Shyam Sundar <sw...@gmail.com>
> wrote:
> >
> > > Hi Andreas,
> > >
> > > I have just uploaded the files at below location -
> > >
> > >
> > >
> https://ftp.emc.com/action/login?domain=ftp.emc.com&username=7rdPxvIJU&password=mKymXA2KyB
> > >
> > > I tried both, but whether I sort or not doesn't make any difference in
> the
> > > output.
> > >
> > > Thanks,
> > > Shyam
> > >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org
>
>

Re: Numbers get reversed sometimes during conversion

Posted by Andreas Lehmkühler <an...@lehmi.de>.
> Shyam Sundar <sw...@gmail.com> hat am 2. Juni 2016 um 09:18
> geschrieben:
> 
> 
> Hi,
> 
> Wondering if you got a chance to check this ...

First thing to be done in such cases is to do the "Adobe Reader test". It fails,
the text can't be extracted using Acrobat Reader, so we are better ;-) Anyway,
mixed LTR/RTL text is always tricky to handle, see PDFBOX-2252

No solution so far, if there is any at all.

BR
Andreas
> 
> Thanks.
> 
> On Wed, Jun 1, 2016 at 1:48 AM, Shyam Sundar <sw...@gmail.com> wrote:
> 
> > Hi Andreas,
> >
> > I have just uploaded the files at below location -
> >
> >
> > https://ftp.emc.com/action/login?domain=ftp.emc.com&username=7rdPxvIJU&password=mKymXA2KyB
> >
> > I tried both, but whether I sort or not doesn't make any difference in the
> > output.
> >
> > Thanks,
> > Shyam
> >

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Re: Numbers get reversed sometimes during conversion

Posted by Shyam Sundar <sw...@gmail.com>.
Hi,

Wondering if you got a chance to check this ...

Thanks.

On Wed, Jun 1, 2016 at 1:48 AM, Shyam Sundar <sw...@gmail.com> wrote:

> Hi Andreas,
>
> I have just uploaded the files at below location -
>
>
> https://ftp.emc.com/action/login?domain=ftp.emc.com&username=7rdPxvIJU&password=mKymXA2KyB
>
> I tried both, but whether I sort or not doesn't make any difference in the
> output.
>
> Thanks,
> Shyam
>

Re: Numbers get reversed sometimes during conversion

Posted by Shyam Sundar <sw...@gmail.com>.
Hi Andreas,

I have just uploaded the files at below location -

https://ftp.emc.com/action/login?domain=ftp.emc.com&username=7rdPxvIJU&password=mKymXA2KyB

I tried both, but whether I sort or not doesn't make any difference in the
output.

Thanks,
Shyam