You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by WebDev Freak <we...@gmail.com> on 2006/09/26 02:34:54 UTC

Subcollections / Index Filters Questions

I just started using Nutch to index HTML, Text, Microsoft Documents, PDF.
Our application is Struts Based and we are querying Nutch straight from our
application.  The query I have going right now is basically searching the
whole site.  I am trying to figure out two things :

1. How to create search collections, ie. search entire site, Computer
section, Audio Section, TV Section etc. I've found abit of info on
Subcollections and Index filters.  I am indeed pressed for time an trying to
eliminate extending Nutch.  Is there any way to create such collections in
Nutch ?

2. How would I programmatically access these collections from within my
application ?  Ie. Search just on Computers or Computers Peripherals or
Storage devices ?

Any feedback appreciated or links to ways to do this.

Thanks,