You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Julian Qian <ju...@gmail.com> on 2007/08/17 21:07:29 UTC
how to config nutch to know the index place
Hi,
In the tutorial "The webapp finds its indexes in ./crawl, relative to
where you start Tomcat,"
I had put crawl under webapps and nutch-09, but none of them work.
Then I found in another article it said:
1.
In the web application deployment directory, open the
'\WEB-INF\classes\nutch-site.xml' file in a text editor.
2.
Change the values of the tags as follows and save the changes.
-
<property>
<name>searcher.dir </name>
<value>/home/tyrell/nutch-0.7/crawl.virtusa </value>
</property>
Since I run this in windows, I set
<value>c:\nutch-09\crawl</value>
But it won't work. Please help me.
Thanks a lot
Julian
Re: how to config nutch to know the index place
Posted by Julian Qian <ju...@gmail.com>.
I had test my index use netbean, so the index has content in it.
I am using resin, here is the resin log:
2007-08-17 15:09:03,299 INFO NutchBean - creating new bean
2007-08-17 15:09:03,409 INFO NutchBean - opening indexes in crawl/indexes
2007-08-17 15:09:03,737 INFO Configuration - found resource
common-terms.utf8 a
t file:/C:/resin-3.0.24/webapps/nutch-0.9/WEB-INF/classes/common-terms.utf8
2007-08-17 15:09:03,768 INFO NutchBean - opening segments in crawl/segments
2007-08-17 15:09:03,862 INFO SummarizerFactory - Using the first summarizer
ext
ension found: Basic Summarizer
2007-08-17 15:09:03,862 INFO NutchBean - opening linkdb in crawl/linkdb
2007-08-17 15:09:03,909 INFO NutchBean - query request from 127.0.0.1
2007-08-17 15:09:03,956 INFO NutchBean - query: nutch
2007-08-17 15:09:03,956 INFO NutchBean - lang: en
[15:09:04.034] resin-file: init
2007-08-17 15:09:04,049 INFO NutchBean - searching for 20 raw hits
2007-08-17 15:09:04,284 INFO NutchBean - total hits: 0
On 8/17/07, Julian Qian <ju...@gmail.com> wrote:
>
> Hi,
> In the tutorial "The webapp finds its indexes in ./crawl, relative to
> where you start Tomcat,"
> I had put crawl under webapps and nutch-09, but none of them work.
>
> Then I found in another article it said:
>
> 1.
>
> In the web application deployment directory, open the
> '\WEB-INF\classes\nutch-site.xml' file in a text editor.
> 2.
>
> Change the values of the tags as follows and save the changes.
> -
>
> <property>
> <name>searcher.dir </name>
> <value>/home/tyrell/nutch-0.7/crawl.virtusa </value>
> </property>
>
>
>
> Since I run this in windows, I set
> <value>c:\nutch-09\crawl</value>
> But it won't work. Please help me.
> Thanks a lot
> Julian
>