You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Julian Qian <ju...@gmail.com> on 2007/08/17 21:07:29 UTC

how to config nutch to know the index place

Hi,
   In the tutorial "The webapp finds its indexes in ./crawl, relative to
where you start Tomcat,"
    I had put crawl under webapps and nutch-09, but none of them work.

   Then I found in another article it said:

   1.

   In the web application deployment directory, open the
   '\WEB-INF\classes\nutch-site.xml' file in a text editor.
   2.

   Change the values of the tags as follows and save the changes.
   -

           <property>
                <name>searcher.dir </name>
                <value>/home/tyrell/nutch-0.7/crawl.virtusa </value>
           </property>



     Since I run this in windows, I set
<value>c:\nutch-09\crawl</value>
   But it won't work. Please help me.
Thanks a lot
Julian

Re: how to config nutch to know the index place

Posted by Julian Qian <ju...@gmail.com>.
I had test my index use netbean, so the index has content in it.
I am using resin, here is the resin log:
2007-08-17 15:09:03,299 INFO  NutchBean - creating new bean
2007-08-17 15:09:03,409 INFO  NutchBean - opening indexes in crawl/indexes
2007-08-17 15:09:03,737 INFO  Configuration - found resource
common-terms.utf8 a
t file:/C:/resin-3.0.24/webapps/nutch-0.9/WEB-INF/classes/common-terms.utf8
2007-08-17 15:09:03,768 INFO  NutchBean - opening segments in crawl/segments
2007-08-17 15:09:03,862 INFO  SummarizerFactory - Using the first summarizer
ext
ension found: Basic Summarizer
2007-08-17 15:09:03,862 INFO  NutchBean - opening linkdb in crawl/linkdb
2007-08-17 15:09:03,909 INFO  NutchBean - query request from 127.0.0.1
2007-08-17 15:09:03,956 INFO  NutchBean - query: nutch
2007-08-17 15:09:03,956 INFO  NutchBean - lang: en
[15:09:04.034] resin-file: init
2007-08-17 15:09:04,049 INFO  NutchBean - searching for 20 raw hits
2007-08-17 15:09:04,284 INFO  NutchBean - total hits: 0



On 8/17/07, Julian Qian <ju...@gmail.com> wrote:
>
> Hi,
>    In the tutorial "The webapp finds its indexes in ./crawl, relative to
> where you start Tomcat,"
>     I had put crawl under webapps and nutch-09, but none of them work.
>
>    Then I found in another article it said:
>
>    1.
>
>    In the web application deployment directory, open the
>    '\WEB-INF\classes\nutch-site.xml' file in a text editor.
>    2.
>
>    Change the values of the tags as follows and save the changes.
>    -
>
>            <property>
>                 <name>searcher.dir </name>
>                 <value>/home/tyrell/nutch-0.7/crawl.virtusa </value>
>            </property>
>
>
>
>      Since I run this in windows, I set
> <value>c:\nutch-09\crawl</value>
>    But it won't work. Please help me.
> Thanks a lot
> Julian
>