You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by "Czech, Christian" <c....@elo.com> on 2012/05/15 16:29:44 UTC
Problems with text extraction
Hi,
where could I send the PDF documents that have problems with text extraction?
Thanks
Christian
Mit freundlichen Grüßen
Christian Czech
Software-Entwicklung
ELO Digital Office GmbH
Heilbronner Str. 150, D-70191 Stuttgart
Tel.: +49 (0) 711 806089-0
Fax: +49 (0) 711 806089-39
E-Mail: c.czech@elo.com<mailto:c.czech@elo.com;>
Web: www.elo.com<http://www.elo.com>
[Beschreibung: ELOpress]<http://www.etracker.de/lnkcnt.php?et=9J3MuV&url=http://elo.com/wcm/de/service/elo-buecher&lnkname=email_press_de>
Alle ELO Bücher finden Sie hier<http://www.etracker.de/lnkcnt.php?et=9J3MuV&url=http://elo.com/wcm/de/service/elo-buecher&lnkname=email_press_de>
________________________________
[Beschreibung: C:\Dokumente und Einstellungen\Czech\Anwendungsdaten\Microsoft\Signatures\icon_thinkgreen.gif] Please think before you print.
________________________________
ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Geschäftsführer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516
AW: Problems with text extraction
Posted by "Czech, Christian" <c....@elo.com>.
Hi,
thank you very mach for you answer. But the sort-option isn't the solution of this problem.
I'll sent my problem with attachment one more time.
Thanks
Christian
Mit freundlichem Gruß
Christian Czech
Software Entwicklung
____________________________________
ELO Digital Office GmbH
Heilbronnerstr. 150, D-70191 Stuttgart
Tel. +49 (0) 711/806089-0
Fax: +49 (0) 711/806089-19
E- Mail: c.czech@elo.com; Internet: www.elo.com
-----Ursprüngliche Nachricht-----
Von: Andreas Lehmkuehler [mailto:andreas@lehmi.de]
Gesendet: Dienstag, 15. Mai 2012 17:40
An: users@pdfbox.apache.org
Betreff: Re: Problems with text extraction
Hi,
Am 15.05.2012 16:29, schrieb Czech, Christian:
> Hi,
>
> where could I send the PDF documents that have problems with text extraction?
We are using JIRA [1] as bugtracker. Our infra people just updated our instance and there were some bug reports about attachments, so probably it won't work until they'll fix it.
I saw your former mails. The extracted text seems to be unsorted. Did you ever try to use the sort-option? This may solve your issue.
Otherwise check the already existing issues first, before you create a new one to avoid duplicates.
>
> Thanks
>
> Christian
BR
Andreas Lehmkühler
[1] https://issues.apache.org/jira/browse/PDFBOX
________________________________
ELO Digital Office GmbH
Firmensitz: Heilbronner Strasse 150, 70191 Stuttgart
Fon: +49 711 806089-0, Fax: +49 711 806089-19, Web: www.elo.com
Geschäftsführer: Karl Heinz Mosbach, Matthias Thiele
BW-Bank, Konto-Nr. 2089782, BLZ 600 501 01
Registergericht Stuttgart HRB 15059 - USt-IdNr.: DE812471516
Re: Problems with text extraction
Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,
Am 15.05.2012 16:29, schrieb Czech, Christian:
> Hi,
>
> where could I send the PDF documents that have problems with text extraction?
We are using JIRA [1] as bugtracker. Our infra people just updated our instance
and there were some bug reports about attachments, so probably it won't work
until they'll fix it.
I saw your former mails. The extracted text seems to be unsorted. Did you ever
try to use the sort-option? This may solve your issue.
Otherwise check the already existing issues first, before you create a new one
to avoid duplicates.
>
> Thanks
>
> Christian
BR
Andreas Lehmkühler
[1] https://issues.apache.org/jira/browse/PDFBOX