You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Stephane James Vaucher <va...@cirano.qc.ca> on 2004/04/20 18:08:25 UTC
Using Runtime.exec to extract text [Was: Bridge with OO]
In case you don't know. Using Runtime.exec() on windows, you need to
consume the output streams of the application will block. This is not the
case on linux.
http://www.javaworld.com/javaworld/jw-12-2000/jw-1229-traps.html
In short:
Because some native platforms only provide limited buffer size for
standard input and output streams, failure to promptly write the input
stream or read the output stream of the subprocess may cause the
subprocess to block, and even deadlock.
HTH,
sv
On Tue, 20 Apr 2004, Argyn wrote:
> I've the same requirement. I used antiword, xlhtml and ppthtml on win2k. I
> called them with Runtime.exec(). There are still problems: all three hang
> up sometimes. Otherwise, it worked. I indexed several hunderds of
> thousands files in development mode. I never got into production.
>
> Argyn
>
>
> On Mon, 19 Apr 2004 16:53:41 -0400 (EDT), Stephane James Vaucher
> <va...@cirano.qc.ca> wrote:
>
> > Actually, the objective would be to use OO to extract text from MSOffice
> > formats. If I read your code correctly, your code should only work with
> > OO
> > as the docs are in xml.
> >
> > Thanks for the code for OO docs through,
> > sv
> >
> > On Mon, 19 Apr 2004, Mario Ivankovits wrote:
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org