You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Bill Goffe <go...@Oswego.EDU> on 2005/10/12 00:00:26 UTC
nutch-site.xml Not Invoked in Searches
I'm having some trouble with getting nutch-site.xml invoked when I perform
a search. However, it is invoked in fetches -- I see it in the Nutch
log and better yet, in the logs of sites I fetch I see the modified
value of http.agent.email.
But, when I perform a search I don't see the the various boosts I also
modified in nutch-site.xml when I look at the explanation of search
results. This is even after copying nutch-0.7.1.war to the relevant Tomcat
directory as ROOT.war and then starting up Tomcat. (Is this needed after
each new round of fetching and indexing?).
However, I do seem the boosts if I copy conf/nutch-site.xml into Tomcat's
webapps/ROOT/WEB-INF/classes. I did read the discussion of nutch-site.xml
in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
with the location of segments.
I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
copy nutch-site.xml to the relevant Tomcat directory each time I modify
it? Am I missing something that should be automatic?
- Bill
--
*------------------------------------------------------*
| Bill Goffe goffe@oswego.edu |
| Department of Economics voice: (315) 312-3444 |
| SUNY Oswego fax: (315) 312-5444 |
| 416 Mahar Hall <http://cook.rfe.org> |
| Oswego, NY 13126 |
*--------*------------------------------------------------------*-----------*
| "You're not federalized." |
| -- What FEMA staff told doctors on why they couldn't help dying |
| patients who were evacuated to the New Orleans airport after |
| Hurricane Katrina. "Going (Down) by the Book," John Tierney, New |
| York Times (Op-Ed section), September 17, 2005. |
*---------------------------------------------------------------------------*
Re: nutch-site.xml Not Invoked in Searches
Posted by Gal Nitzan <gn...@usa.net>.
Gal Nitzan wrote:
> Hi,
>
> you need to copy only once. after tomcat extract the files you may
> delete the war.
>
> Indeed, copy the site.xml to WEB-INF/classes that is where the
> searcher pick up configuration from.
>
> Gal
P.S. Yes after every indexing you have to stop the server
(bin/nutch-daemon.sh stop server) and also stop Tomcat. I know it sounds
weird but after no one answered me, I looked it up in the code and there
it was - When the servlet is initialized (init), it reads the file
system segments and index and that is it.
Gal
>
> Bill Goffe wrote:
>> I'm having some trouble with getting nutch-site.xml invoked when I
>> perform
>> a search. However, it is invoked in fetches -- I see it in the Nutch
>> log and better yet, in the logs of sites I fetch I see the modified
>> value of http.agent.email.
>> But, when I perform a search I don't see the the various boosts I also
>> modified in nutch-site.xml when I look at the explanation of search
>> results. This is even after copying nutch-0.7.1.war to the relevant
>> Tomcat
>> directory as ROOT.war and then starting up Tomcat. (Is this needed after
>> each new round of fetching and indexing?).
>>
>> However, I do seem the boosts if I copy conf/nutch-site.xml into
>> Tomcat's
>> webapps/ROOT/WEB-INF/classes. I did read the discussion of
>> nutch-site.xml
>> in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
>> with the location of segments.
>>
>> I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
>> copy nutch-site.xml to the relevant Tomcat directory each time I modify
>> it? Am I missing something that should be automatic?
>>
>> - Bill
>>
>>
>
>
>
> .
>
Re: nutch-site.xml Not Invoked in Searches
Posted by Gal Nitzan <gn...@usa.net>.
Hi,
you need to copy only once. after tomcat extract the files you may
delete the war.
Indeed, copy the site.xml to WEB-INF/classes that is where the searcher
pick up configuration from.
Gal
Bill Goffe wrote:
> I'm having some trouble with getting nutch-site.xml invoked when I perform
> a search. However, it is invoked in fetches -- I see it in the Nutch
> log and better yet, in the logs of sites I fetch I see the modified
> value of http.agent.email.
>
> But, when I perform a search I don't see the the various boosts I also
> modified in nutch-site.xml when I look at the explanation of search
> results. This is even after copying nutch-0.7.1.war to the relevant Tomcat
> directory as ROOT.war and then starting up Tomcat. (Is this needed after
> each new round of fetching and indexing?).
>
> However, I do seem the boosts if I copy conf/nutch-site.xml into Tomcat's
> webapps/ROOT/WEB-INF/classes. I did read the discussion of nutch-site.xml
> in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
> with the location of segments.
>
> I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
> copy nutch-site.xml to the relevant Tomcat directory each time I modify
> it? Am I missing something that should be automatic?
>
> - Bill
>
>