You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by WebDev Freak <we...@gmail.com> on 2006/09/26 02:34:54 UTC
Subcollections / Index Filters Questions
I just started using Nutch to index HTML, Text, Microsoft Documents, PDF.
Our application is Struts Based and we are querying Nutch straight from our
application. The query I have going right now is basically searching the
whole site. I am trying to figure out two things :
1. How to create search collections, ie. search entire site, Computer
section, Audio Section, TV Section etc. I've found abit of info on
Subcollections and Index filters. I am indeed pressed for time an trying to
eliminate extending Nutch. Is there any way to create such collections in
Nutch ?
2. How would I programmatically access these collections from within my
application ? Ie. Search just on Computers or Computers Peripherals or
Storage devices ?
Any feedback appreciated or links to ways to do this.
Thanks,