You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Bill Goffe <go...@Oswego.EDU> on 2005/10/12 00:00:26 UTC

nutch-site.xml Not Invoked in Searches

I'm having some trouble with getting nutch-site.xml invoked when I perform
a search. However, it is invoked in fetches -- I see it in the Nutch
log and better yet, in the logs of sites I fetch I see the modified
value of http.agent.email. 

But, when I perform a search I don't see the the various boosts I also
modified in nutch-site.xml when I look at the explanation of search
results. This is even after copying nutch-0.7.1.war to the relevant Tomcat
directory as ROOT.war and then starting up Tomcat. (Is this needed after
each new round of fetching and indexing?).

However, I do seem the boosts if I copy conf/nutch-site.xml into Tomcat's
webapps/ROOT/WEB-INF/classes. I did read the discussion of nutch-site.xml
in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
with the location of segments.

I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
copy nutch-site.xml to the relevant Tomcat directory each time I modify
it? Am I missing something that should be automatic?

         - Bill

-- 
         *------------------------------------------------------*
         | Bill Goffe                 goffe@oswego.edu          |
         | Department of Economics    voice: (315) 312-3444     |
         | SUNY Oswego                fax:   (315) 312-5444     |
         | 416 Mahar Hall             <http://cook.rfe.org>     |          
         | Oswego, NY  13126                                    |
*--------*------------------------------------------------------*-----------*
| "You're not federalized."                                                 |
|   -- What FEMA staff told doctors on why they couldn't help dying         |
|      patients who were evacuated to the New Orleans airport after         |
|      Hurricane Katrina. "Going (Down) by the Book," John Tierney, New     |
|      York Times (Op-Ed section), September 17, 2005.                      |
*---------------------------------------------------------------------------*

Re: nutch-site.xml Not Invoked in Searches

Posted by Gal Nitzan <gn...@usa.net>.
Gal Nitzan wrote:
> Hi,
>
> you need to copy only once. after tomcat extract the files you may 
> delete the war.
>
> Indeed, copy the site.xml to WEB-INF/classes that is where the 
> searcher pick up configuration from.
>
> Gal
P.S. Yes after every indexing you have to stop the server 
(bin/nutch-daemon.sh stop server) and also stop Tomcat. I know it sounds 
weird but after no one answered me, I looked it up in the code and there 
it was - When the servlet is initialized (init), it reads the file 
system segments and index and that is it.

Gal

>
> Bill Goffe wrote:
>> I'm having some trouble with getting nutch-site.xml invoked when I 
>> perform
>> a search. However, it is invoked in fetches -- I see it in the Nutch
>> log and better yet, in the logs of sites I fetch I see the modified
>> value of http.agent.email.
>> But, when I perform a search I don't see the the various boosts I also
>> modified in nutch-site.xml when I look at the explanation of search
>> results. This is even after copying nutch-0.7.1.war to the relevant 
>> Tomcat
>> directory as ROOT.war and then starting up Tomcat. (Is this needed after
>> each new round of fetching and indexing?).
>>
>> However, I do seem the boosts if I copy conf/nutch-site.xml into 
>> Tomcat's
>> webapps/ROOT/WEB-INF/classes. I did read the discussion of 
>> nutch-site.xml
>> in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
>> with the location of segments.
>>
>> I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
>> copy nutch-site.xml to the relevant Tomcat directory each time I modify
>> it? Am I missing something that should be automatic?
>>
>>          - Bill
>>
>>   
>
>
>
> .
>



Re: nutch-site.xml Not Invoked in Searches

Posted by Gal Nitzan <gn...@usa.net>.
Hi,

you need to copy only once. after tomcat extract the files you may 
delete the war.

Indeed, copy the site.xml to WEB-INF/classes that is where the searcher 
pick up configuration from.

Gal

Bill Goffe wrote:
> I'm having some trouble with getting nutch-site.xml invoked when I perform
> a search. However, it is invoked in fetches -- I see it in the Nutch
> log and better yet, in the logs of sites I fetch I see the modified
> value of http.agent.email. 
>
> But, when I perform a search I don't see the the various boosts I also
> modified in nutch-site.xml when I look at the explanation of search
> results. This is even after copying nutch-0.7.1.war to the relevant Tomcat
> directory as ROOT.war and then starting up Tomcat. (Is this needed after
> each new round of fetching and indexing?).
>
> However, I do seem the boosts if I copy conf/nutch-site.xml into Tomcat's
> webapps/ROOT/WEB-INF/classes. I did read the discussion of nutch-site.xml
> in webapps/ROOT/WEB-INF/classes in the FAQ, but that only seems to deal
> with the location of segments.
>
> I'm using Nutch 0.7.1 and Tomcat 5.0.28. Any ideas? Do I indeed need to
> copy nutch-site.xml to the relevant Tomcat directory each time I modify
> it? Am I missing something that should be automatic?
>
>          - Bill
>
>