You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by "Czech, Christian" <c....@elo.com> on 2012/05/15 16:29:44 UTC

Problems with text extraction

Hi,

where could I send the PDF documents  that have problems with text extraction?

Thanks

Christian

Mit freundlichen Grüßen

Christian Czech
Software-Entwicklung

ELO Digital Office GmbH
Heilbronner Str. 150, D-70191 Stuttgart
Tel.:      +49 (0) 711 806089-0
Fax:       +49 (0) 711 806089-39
E-Mail:  c.czech@elo.com<mailto:c.czech@elo.com;>
Web:     www.elo.com<http://www.elo.com>

[Beschreibung: ELOpress]<http://www.etracker.de/lnkcnt.php?et=9J3MuV&url=http://elo.com/wcm/de/service/elo-buecher&lnkname=email_press_de>
Alle ELO Bücher finden Sie hier<http://www.etracker.de/lnkcnt.php?et=9J3MuV&url=http://elo.com/wcm/de/service/elo-buecher&lnkname=email_press_de>
________________________________
[Beschreibung: C:\Dokumente und Einstellungen\Czech\Anwendungsdaten\Microsoft\Signatures\icon_thinkgreen.gif]  Please think before you print.


________________________________

ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Geschäftsführer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516

AW: Problems with text extraction

Posted by "Czech, Christian" <c....@elo.com>.
Hi,

thank you very mach for you answer. But the sort-option isn't the solution of this problem.
I'll sent my problem with attachment one more time.

Thanks

Christian

Mit freundlichem Gruß

Christian Czech
Software Entwicklung
____________________________________
ELO Digital Office GmbH
Heilbronnerstr. 150, D-70191 Stuttgart
Tel. +49 (0) 711/806089-0
Fax: +49 (0) 711/806089-19
E- Mail: c.czech@elo.com; Internet: www.elo.com


-----Ursprüngliche Nachricht-----
Von: Andreas Lehmkuehler [mailto:andreas@lehmi.de]
Gesendet: Dienstag, 15. Mai 2012 17:40
An: users@pdfbox.apache.org
Betreff: Re: Problems with text extraction

Hi,

Am 15.05.2012 16:29, schrieb Czech, Christian:
> Hi,
>
> where could I send the PDF documents that have problems with text extraction?
We are using JIRA [1] as bugtracker. Our infra people just updated our instance and there were some bug reports about attachments, so probably it won't work until they'll fix it.

I saw your former mails. The extracted text seems to be unsorted. Did you ever try to use the sort-option? This may solve your issue.

Otherwise check the already existing issues first, before you create a new one to avoid duplicates.

>
> Thanks
>
> Christian

BR
Andreas Lehmkühler

[1] https://issues.apache.org/jira/browse/PDFBOX


________________________________

ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Geschäftsführer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516

Re: Problems with text extraction

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

Am 15.05.2012 16:29, schrieb Czech, Christian:
> Hi,
>
> where could I send the PDF documents that have problems with text extraction?
We are using JIRA [1] as bugtracker. Our infra people just updated our instance 
and there were some bug reports about attachments, so probably it won't work 
until they'll fix it.

I saw your former mails. The extracted text seems to be unsorted. Did you ever 
try to use the sort-option? This may solve your issue.

Otherwise check the already existing issues first, before you create a new one 
to avoid duplicates.

>
> Thanks
>
> Christian

BR
Andreas Lehmkühler

[1] https://issues.apache.org/jira/browse/PDFBOX