You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Rohit Kulkarni <ro...@gmail.com> on 2005/04/04 05:36:49 UTC

does nutch have these features ?

Hello everyone, 

I just wanted to know if the following features are present in nutch..
1) postscript file parsing support
2) MS excel file parsing support
3) whether search based on file type (pdf,ps,xls,ppt,doc..etc) can be
given as a query (similar to filetype: in google)..if yes what syntax
should be used.
4) whether search within the url like, the word research in urls like
www.ibm.com/research or www.research.ibm.com can be given as a query
(similar to inurl: in google). If yes what syntax should be used ?
5) whether search within the page title can be done (similar to
intitle: or allintitle: in google). If yes what syntax should be used
?

i would highly appreciate your answers to these questions 

thanks in advance,

regards,
Rohit