You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Mr Shore <sh...@gmail.com> on 2008/10/07 13:11:48 UTC

issue with search.jsp in nutch-0.9.war

I've correctly run bin/nutch crawl to generate an index database called
"crawl",and I can search the "crawl" database by "bin/nutch
org.apache.nutch.searcher.NutchBean test",which returns a result like below:
Total hits: 1
 0 20081007031237/http://localhost:81/1.txt
test

it is right since the "crawl" database is very small,just for the purpose of
testing,
now switch to the search.jsp in nutch-0.9.war,which  I put in the directory
of $TOMCAT_HOME/webapps and then automatically generated,then I modified
$TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like below:
 <?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
<property>
        <name>searcher.dir</name>
        <value>/root/crawl</value>
</property>
</configuration>
which exactly point to the "crawl" database,
but each time when I post a request,it returns total 0 match,
you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
any advice is greatly appreciated!
-- 
View this message in context: http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: issue with search.jsp in nutch-0.9.war

Posted by Mr Shore <sh...@gmail.com>.
I'm using fedora 9,
last time when I deployed nutch it's on fedora 8,

[root@www3 classes]# cat /etc/issue
Fedora release 9 (Sulphur)
Kernel \r on an \m (\l)


Brian Ulicny wrote:
> 
> What operating system are you using?  Are you using a shell on Windows?
> 
> Brian
> 
> On Tue, 7 Oct 2008 08:41:21 -0700 (PDT), "Mr Shore"
> <sh...@gmail.com> said:
>> 
>> I don't think it's caused by path,
>> I've watched the log of tomcat,and found it opened the right index
>> database--that is,"crawl" in my case
>> still thanks for your advice
>> Mr Shore
>> 
>> Brian Ulicny wrote:
>> > 
>> > No, that's a relative path.  It says, look inside the subdirectory root
>> > of the current directory from which you've deployed the .war file.
>> > 
>> > Generally, if you run the .war file from a different location than you
>> > run your NutchBean, you will need to modify the path accordingly.
>> > 
>> > Brian
>> > 
>> > 
>> > On Tue, 7 Oct 2008 07:51:57 -0700 (PDT), "Mr Shore"
>> > <sh...@gmail.com> said:
>> >> 
>> >> you mean in nutch-site.xml,the value of "seach.dir"?
>> >> it's already absolute value,
>> >> in my case,it's /root/crawl
>> >> Mr Shore
>> >> 
>> >> Brian Ulicny wrote:
>> >> > 
>> >> > Try putting in a non-relative path to your crawl directory.
>> >> > 
>> >> > Brian
>> >> > 
>> >> > On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
>> >> > <sh...@gmail.com> said:
>> >> >> 
>> >> >> I've correctly run bin/nutch crawl to generate an index database
>> >> called
>> >> >> "crawl",and I can search the "crawl" database by "bin/nutch
>> >> >> org.apache.nutch.searcher.NutchBean test",which returns a result
>> like
>> >> >> below:
>> >> >> Total hits: 1
>> >> >>  0 20081007031237/http://localhost:81/1.txt
>> >> >> test
>> >> >> 
>> >> >> it is right since the "crawl" database is very small,just for the
>> >> purpose
>> >> >> of
>> >> >> testing,
>> >> >> now switch to the search.jsp in nutch-0.9.war,which  I put in the
>> >> >> directory
>> >> >> of $TOMCAT_HOME/webapps and then automatically generated,then I
>> >> modified
>> >> >> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like
>> >> below:
>> >> >>  <?xml version="1.0"?>
>> >> >> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
>> >> >> 
>> >> >> <!-- Put site-specific property overrides in this file. -->
>> >> >> 
>> >> >> <configuration>
>> >> >> <property>
>> >> >>         <name>searcher.dir</name>
>> >> >>         <value>/root/crawl</value>
>> >> >> </property>
>> >> >> </configuration>
>> >> >> which exactly point to the "crawl" database,
>> >> >> but each time when I post a request,it returns total 0 match,
>> >> >> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
>> >> >> any advice is greatly appreciated!
>> >> >> -- 
>> >> >> View this message in context:
>> >> >>
>> >>
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
>> >> >> Sent from the Nutch - User mailing list archive at Nabble.com.
>> >> >> 
>> >> > -- 
>> >> >   Brian Ulicny
>> >> >   bulicny at alum dot mit dot edu
>> >> >   home: 781-721-5746
>> >> >   fax: 360-361-5746
>> >> > 
>> >> > 
>> >> > 
>> >> > 
>> >> 
>> >> -- 
>> >> View this message in context:
>> >>
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19859958.html
>> >> Sent from the Nutch - User mailing list archive at Nabble.com.
>> >> 
>> > -- 
>> >   Brian Ulicny
>> >   bulicny at alum dot mit dot edu
>> >   home: 781-721-5746
>> >   fax: 360-361-5746
>> > 
>> > 
>> > 
>> > 
>> 
>> -- 
>> View this message in context:
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19861014.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>> 
> -- 
>   Brian Ulicny
>   bulicny at alum dot mit dot edu
>   home: 781-721-5746
>   fax: 360-361-5746
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19862068.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: issue with search.jsp in nutch-0.9.war

Posted by Brian Ulicny <bu...@alum.mit.edu>.
What operating system are you using?  Are you using a shell on Windows?

Brian

On Tue, 7 Oct 2008 08:41:21 -0700 (PDT), "Mr Shore"
<sh...@gmail.com> said:
> 
> I don't think it's caused by path,
> I've watched the log of tomcat,and found it opened the right index
> database--that is,"crawl" in my case
> still thanks for your advice
> Mr Shore
> 
> Brian Ulicny wrote:
> > 
> > No, that's a relative path.  It says, look inside the subdirectory root
> > of the current directory from which you've deployed the .war file.
> > 
> > Generally, if you run the .war file from a different location than you
> > run your NutchBean, you will need to modify the path accordingly.
> > 
> > Brian
> > 
> > 
> > On Tue, 7 Oct 2008 07:51:57 -0700 (PDT), "Mr Shore"
> > <sh...@gmail.com> said:
> >> 
> >> you mean in nutch-site.xml,the value of "seach.dir"?
> >> it's already absolute value,
> >> in my case,it's /root/crawl
> >> Mr Shore
> >> 
> >> Brian Ulicny wrote:
> >> > 
> >> > Try putting in a non-relative path to your crawl directory.
> >> > 
> >> > Brian
> >> > 
> >> > On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
> >> > <sh...@gmail.com> said:
> >> >> 
> >> >> I've correctly run bin/nutch crawl to generate an index database
> >> called
> >> >> "crawl",and I can search the "crawl" database by "bin/nutch
> >> >> org.apache.nutch.searcher.NutchBean test",which returns a result like
> >> >> below:
> >> >> Total hits: 1
> >> >>  0 20081007031237/http://localhost:81/1.txt
> >> >> test
> >> >> 
> >> >> it is right since the "crawl" database is very small,just for the
> >> purpose
> >> >> of
> >> >> testing,
> >> >> now switch to the search.jsp in nutch-0.9.war,which  I put in the
> >> >> directory
> >> >> of $TOMCAT_HOME/webapps and then automatically generated,then I
> >> modified
> >> >> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like
> >> below:
> >> >>  <?xml version="1.0"?>
> >> >> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
> >> >> 
> >> >> <!-- Put site-specific property overrides in this file. -->
> >> >> 
> >> >> <configuration>
> >> >> <property>
> >> >>         <name>searcher.dir</name>
> >> >>         <value>/root/crawl</value>
> >> >> </property>
> >> >> </configuration>
> >> >> which exactly point to the "crawl" database,
> >> >> but each time when I post a request,it returns total 0 match,
> >> >> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
> >> >> any advice is greatly appreciated!
> >> >> -- 
> >> >> View this message in context:
> >> >>
> >> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
> >> >> Sent from the Nutch - User mailing list archive at Nabble.com.
> >> >> 
> >> > -- 
> >> >   Brian Ulicny
> >> >   bulicny at alum dot mit dot edu
> >> >   home: 781-721-5746
> >> >   fax: 360-361-5746
> >> > 
> >> > 
> >> > 
> >> > 
> >> 
> >> -- 
> >> View this message in context:
> >> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19859958.html
> >> Sent from the Nutch - User mailing list archive at Nabble.com.
> >> 
> > -- 
> >   Brian Ulicny
> >   bulicny at alum dot mit dot edu
> >   home: 781-721-5746
> >   fax: 360-361-5746
> > 
> > 
> > 
> > 
> 
> -- 
> View this message in context:
> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19861014.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 
-- 
  Brian Ulicny
  bulicny at alum dot mit dot edu
  home: 781-721-5746
  fax: 360-361-5746



Re: issue with search.jsp in nutch-0.9.war

Posted by Mr Shore <sh...@gmail.com>.
I don't think it's caused by path,
I've watched the log of tomcat,and found it opened the right index
database--that is,"crawl" in my case
still thanks for your advice
Mr Shore

Brian Ulicny wrote:
> 
> No, that's a relative path.  It says, look inside the subdirectory root
> of the current directory from which you've deployed the .war file.
> 
> Generally, if you run the .war file from a different location than you
> run your NutchBean, you will need to modify the path accordingly.
> 
> Brian
> 
> 
> On Tue, 7 Oct 2008 07:51:57 -0700 (PDT), "Mr Shore"
> <sh...@gmail.com> said:
>> 
>> you mean in nutch-site.xml,the value of "seach.dir"?
>> it's already absolute value,
>> in my case,it's /root/crawl
>> Mr Shore
>> 
>> Brian Ulicny wrote:
>> > 
>> > Try putting in a non-relative path to your crawl directory.
>> > 
>> > Brian
>> > 
>> > On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
>> > <sh...@gmail.com> said:
>> >> 
>> >> I've correctly run bin/nutch crawl to generate an index database
>> called
>> >> "crawl",and I can search the "crawl" database by "bin/nutch
>> >> org.apache.nutch.searcher.NutchBean test",which returns a result like
>> >> below:
>> >> Total hits: 1
>> >>  0 20081007031237/http://localhost:81/1.txt
>> >> test
>> >> 
>> >> it is right since the "crawl" database is very small,just for the
>> purpose
>> >> of
>> >> testing,
>> >> now switch to the search.jsp in nutch-0.9.war,which  I put in the
>> >> directory
>> >> of $TOMCAT_HOME/webapps and then automatically generated,then I
>> modified
>> >> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like
>> below:
>> >>  <?xml version="1.0"?>
>> >> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
>> >> 
>> >> <!-- Put site-specific property overrides in this file. -->
>> >> 
>> >> <configuration>
>> >> <property>
>> >>         <name>searcher.dir</name>
>> >>         <value>/root/crawl</value>
>> >> </property>
>> >> </configuration>
>> >> which exactly point to the "crawl" database,
>> >> but each time when I post a request,it returns total 0 match,
>> >> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
>> >> any advice is greatly appreciated!
>> >> -- 
>> >> View this message in context:
>> >>
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
>> >> Sent from the Nutch - User mailing list archive at Nabble.com.
>> >> 
>> > -- 
>> >   Brian Ulicny
>> >   bulicny at alum dot mit dot edu
>> >   home: 781-721-5746
>> >   fax: 360-361-5746
>> > 
>> > 
>> > 
>> > 
>> 
>> -- 
>> View this message in context:
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19859958.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>> 
> -- 
>   Brian Ulicny
>   bulicny at alum dot mit dot edu
>   home: 781-721-5746
>   fax: 360-361-5746
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19861014.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: issue with search.jsp in nutch-0.9.war

Posted by Brian Ulicny <bu...@alum.mit.edu>.
No, that's a relative path.  It says, look inside the subdirectory root
of the current directory from which you've deployed the .war file.

Generally, if you run the .war file from a different location than you
run your NutchBean, you will need to modify the path accordingly.

Brian


On Tue, 7 Oct 2008 07:51:57 -0700 (PDT), "Mr Shore"
<sh...@gmail.com> said:
> 
> you mean in nutch-site.xml,the value of "seach.dir"?
> it's already absolute value,
> in my case,it's /root/crawl
> Mr Shore
> 
> Brian Ulicny wrote:
> > 
> > Try putting in a non-relative path to your crawl directory.
> > 
> > Brian
> > 
> > On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
> > <sh...@gmail.com> said:
> >> 
> >> I've correctly run bin/nutch crawl to generate an index database called
> >> "crawl",and I can search the "crawl" database by "bin/nutch
> >> org.apache.nutch.searcher.NutchBean test",which returns a result like
> >> below:
> >> Total hits: 1
> >>  0 20081007031237/http://localhost:81/1.txt
> >> test
> >> 
> >> it is right since the "crawl" database is very small,just for the purpose
> >> of
> >> testing,
> >> now switch to the search.jsp in nutch-0.9.war,which  I put in the
> >> directory
> >> of $TOMCAT_HOME/webapps and then automatically generated,then I modified
> >> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like below:
> >>  <?xml version="1.0"?>
> >> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
> >> 
> >> <!-- Put site-specific property overrides in this file. -->
> >> 
> >> <configuration>
> >> <property>
> >>         <name>searcher.dir</name>
> >>         <value>/root/crawl</value>
> >> </property>
> >> </configuration>
> >> which exactly point to the "crawl" database,
> >> but each time when I post a request,it returns total 0 match,
> >> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
> >> any advice is greatly appreciated!
> >> -- 
> >> View this message in context:
> >> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
> >> Sent from the Nutch - User mailing list archive at Nabble.com.
> >> 
> > -- 
> >   Brian Ulicny
> >   bulicny at alum dot mit dot edu
> >   home: 781-721-5746
> >   fax: 360-361-5746
> > 
> > 
> > 
> > 
> 
> -- 
> View this message in context:
> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19859958.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 
-- 
  Brian Ulicny
  bulicny at alum dot mit dot edu
  home: 781-721-5746
  fax: 360-361-5746



Re: issue with search.jsp in nutch-0.9.war

Posted by Mr Shore <sh...@gmail.com>.
you mean in nutch-site.xml,the value of "seach.dir"?
it's already absolute value,
in my case,it's /root/crawl
Mr Shore

Brian Ulicny wrote:
> 
> Try putting in a non-relative path to your crawl directory.
> 
> Brian
> 
> On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
> <sh...@gmail.com> said:
>> 
>> I've correctly run bin/nutch crawl to generate an index database called
>> "crawl",and I can search the "crawl" database by "bin/nutch
>> org.apache.nutch.searcher.NutchBean test",which returns a result like
>> below:
>> Total hits: 1
>>  0 20081007031237/http://localhost:81/1.txt
>> test
>> 
>> it is right since the "crawl" database is very small,just for the purpose
>> of
>> testing,
>> now switch to the search.jsp in nutch-0.9.war,which  I put in the
>> directory
>> of $TOMCAT_HOME/webapps and then automatically generated,then I modified
>> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like below:
>>  <?xml version="1.0"?>
>> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
>> 
>> <!-- Put site-specific property overrides in this file. -->
>> 
>> <configuration>
>> <property>
>>         <name>searcher.dir</name>
>>         <value>/root/crawl</value>
>> </property>
>> </configuration>
>> which exactly point to the "crawl" database,
>> but each time when I post a request,it returns total 0 match,
>> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
>> any advice is greatly appreciated!
>> -- 
>> View this message in context:
>> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>> 
> -- 
>   Brian Ulicny
>   bulicny at alum dot mit dot edu
>   home: 781-721-5746
>   fax: 360-361-5746
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19859958.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: issue with search.jsp in nutch-0.9.war

Posted by Brian Ulicny <bu...@alum.mit.edu>.
Try putting in a non-relative path to your crawl directory.

Brian

On Tue, 7 Oct 2008 04:11:48 -0700 (PDT), "Mr Shore"
<sh...@gmail.com> said:
> 
> I've correctly run bin/nutch crawl to generate an index database called
> "crawl",and I can search the "crawl" database by "bin/nutch
> org.apache.nutch.searcher.NutchBean test",which returns a result like
> below:
> Total hits: 1
>  0 20081007031237/http://localhost:81/1.txt
> test
> 
> it is right since the "crawl" database is very small,just for the purpose
> of
> testing,
> now switch to the search.jsp in nutch-0.9.war,which  I put in the
> directory
> of $TOMCAT_HOME/webapps and then automatically generated,then I modified
> $TOMCAT_HOME/webapps/nutch-0.9/WEB-INF/classes/nutch-site.xml like below:
>  <?xml version="1.0"?>
> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
> 
> <!-- Put site-specific property overrides in this file. -->
> 
> <configuration>
> <property>
>         <name>searcher.dir</name>
>         <value>/root/crawl</value>
> </property>
> </configuration>
> which exactly point to the "crawl" database,
> but each time when I post a request,it returns total 0 match,
> you can try it out at http://www3.jobirn.com:8080/nutch-0.9/
> any advice is greatly appreciated!
> -- 
> View this message in context:
> http://www.nabble.com/issue-with-search.jsp-in-nutch-0.9.war-tp19855907p19855907.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 
-- 
  Brian Ulicny
  bulicny at alum dot mit dot edu
  home: 781-721-5746
  fax: 360-361-5746