You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Maurice Coyle <ma...@ucd.ie> on 2003/07/07 18:06:40 UTC

lucene handling different document formats

could anyone tell me if there's some sort of repository somewhere that
contains parsers for document types such as .doc, .pdf, .xls?  or how i'd
begin to go about thinking to write one (tutorials etc much appreciated)

thanks,
maurice