You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by foobar3001 <fo...@yahoo.com> on 2008/05/27 00:54:42 UTC

Re: Searching in sub-section of site

I should add some additional information that could be important:

I am using Nutch 0.9 and the command line to test the results, like so:

       $ nutch-0.9/bin/nutch org.apache.nutch.searcher.NutchBean "...
[search terms] ..."

Also, in the nutch-site.xml file, I have enabled the site and also url query
plugin. At least I think I have. The relevant lines are this:

    <property>
        <name>plugin.includes</name>
       
<value>protocol-http|urlfilter-regex|parse-(text|html|oo|pdf|msword|msexcel|mspowerpoint|zip|)|analysis-(fr|en|de)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
        ....
    </property>

As far as I can tell, the query-url and query-site plugins are specified.
However, even when I try "url:..." as part of the query string, it seems to
disregard the 'url:' portion and then treats whatever comes after that as a
search term in the full text, rather than something that limits the results
by the URL.


-- 
View this message in context: http://www.nabble.com/Searching-in-sub-section-of-site-tp17479657p17480344.html
Sent from the Nutch - User mailing list archive at Nabble.com.