You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by "Natarajan.T" <na...@crimsonlogic.co.in> on 2004/07/21 16:37:12 UTC
Use of Convertes or Parser
Hi Guys,
I have a small query, ie. Lucene 1.4 APIs directly indexing all the
documents(PPT,PDF,WORD,etc.) then why we go for Converters or Parsers.
Thanks,
Natarajan.
RE: Use of Convertes or Parser
Posted by "Natarajan.T" <na...@crimsonlogic.co.in>.
Ok Thanks.
-----Original Message-----
From: Otis Gospodnetic [mailto:otis_gospodnetic@yahoo.com]
Sent: Wednesday, July 21, 2004 9:33 PM
To: Lucene Users List
Subject: Re: Use of Convertes or Parser
Lucene cannot parse those document formats that you mentioned. You
need 3rd party parsers to do that. For example, POI will parse Excel
and MS Word docs, PDFBox will parse PDF.
Otis
--- "Natarajan.T" <na...@crimsonlogic.co.in> wrote:
> Hi Guys,
>
> I have a small query, ie. Lucene 1.4 APIs directly indexing all the
> documents(PPT,PDF,WORD,etc.) then why we go for Converters or
> Parsers.
>
>
> Thanks,
> Natarajan.
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: Use of Convertes or Parser
Posted by Otis Gospodnetic <ot...@yahoo.com>.
Lucene cannot parse those document formats that you mentioned. You
need 3rd party parsers to do that. For example, POI will parse Excel
and MS Word docs, PDFBox will parse PDF.
Otis
--- "Natarajan.T" <na...@crimsonlogic.co.in> wrote:
> Hi Guys,
>
> I have a small query, ie. Lucene 1.4 APIs directly indexing all the
> documents(PPT,PDF,WORD,etc.) then why we go for Converters or
> Parsers.
>
>
> Thanks,
> Natarajan.
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org