You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by Andreas Lehmkühler <an...@lehmi.de> on 2010/05/31 17:34:29 UTC

Is PDFToTextTask still needed?

Hi,

I was wondering if the example PDFToTextTask [1] is still needed. 
It is the only part of PDFBox which depends on ant. It is a simple 
wrapper, which enables a PDF2Text conversion as task within ant.
If we think about minimizing dependencies, IMO this is one we can
remove without any substantial loss.

WDYT??


BR
Andreas Lehmkühler

P.S.: To avoid missunderstandings, I don't want to remove the ant building
skript included in PDFBox ;-)

[1] http://svn.apache.org/repos/asf/pdfbox/trunk/pdfbox/src/main/java/org/apache/pdfbox/ant/PDFToTextTask.java

Re: Is PDFToTextTask still needed?

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

Jukka Zitting schrieb:
> Hi,
> 
> 2010/5/31 Andreas Lehmkühler <an...@lehmi.de>:
>> I was wondering if the example PDFToTextTask [1] is still needed.
>> It is the only part of PDFBox which depends on ant. It is a simple
>> wrapper, which enables a PDF2Text conversion as task within ant.
>> If we think about minimizing dependencies, IMO this is one we can
>> remove without any substantial loss.
> 
> How about, instead of removing the functionality, we move it into a
> separate pdfbox-ant component that depends on the main pdfbox jar? We
> could/should do the same also for the Lucene integration stuff.
Yes, that's also an alternative, probably a better one.

BR
Andreas Lehmkühler

Re: Is PDFToTextTask still needed?

Posted by Jukka Zitting <ju...@gmail.com>.
Hi,

2010/5/31 Andreas Lehmkühler <an...@lehmi.de>:
> I was wondering if the example PDFToTextTask [1] is still needed.
> It is the only part of PDFBox which depends on ant. It is a simple
> wrapper, which enables a PDF2Text conversion as task within ant.
> If we think about minimizing dependencies, IMO this is one we can
> remove without any substantial loss.

How about, instead of removing the functionality, we move it into a
separate pdfbox-ant component that depends on the main pdfbox jar? We
could/should do the same also for the Lucene integration stuff.

BR,

Jukka Zitting

Re: Is PDFToTextTask still needed?

Posted by Johannes Koch <jo...@fit.fraunhofer.de>.
Andreas Lehmkühler schrieb:
> Hi,
> 
> I was wondering if the example PDFToTextTask [1] is still needed. 
> It is the only part of PDFBox which depends on ant. It is a simple 
> wrapper, which enables a PDF2Text conversion as task within ant.
> If we think about minimizing dependencies, IMO this is one we can
> remove without any substantial loss.
> 
> WDYT??

No objection from my side.

-- 
Johannes Koch
Fraunhofer Institute for Applied Information Technology FIT
Web Compliance Center
Schloss Birlinghoven, D-53757 Sankt Augustin, Germany
Phone: +49-2241-142628    Fax: +49-2241-142065