You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by "Allison, Timothy B." <ta...@mitre.org> on 2016/05/02 14:19:13 UTC

RE: is it possible to batch extract text from pdf files within a tree of folders within a zip file ?

>> While PDFBox is a part of TIKA and the two projects are kindof "best friends forever"
Thank you, Tilman! :)


-----Original Message-----
From: Tilman Hausherr [mailto:THausherr@t-online.de] 
Sent: Saturday, April 30, 2016 5:24 PM
To: users@pdfbox.apache.org
Subject: Re: is it possible to batch extract text from pdf files within a tree of folders within a zip file ?

Am 30.04.2016 um 19:46 schrieb David Green:
> you may gather that i am new to this.
> my original zip files containing pdf files are on my f drive I want 
> the unpacked text files saved in an identical directory structure on 
> my g drive I have tried:
>      java -jar tika-app.X.Y.jar -J -t -i <f:> -o <g:> resulted in 
> "syntax error"
> can you please suggest what I'm doing wrong

You're in the wrong mailing list. This is the PDFBox mailing list. While PDFBox is a part of TIKA and the two projects are kindof "best friends forever", this doesn't mean that PDFBox users all know how to use TIKA.

However I suspect that you actually used the "<" and ">". The "<" and ">" are there to explain a concept. So your command line would probably be

java -jar tika-app.X.Y.jar -J -t -i f: -o g:



Tilman


>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org