You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@nutch.apache.org by "ned@bcit" <ne...@yahoo.com> on 2007/12/07 22:12:55 UTC

Re: Where should I place directory "crawl" which include indhttp://www.nabble.com/help/Answer.jtp?id=17ex and db of fetching website?

Did you restart your JSP server after you placed the crawl file? 


张世勇 wrote:
> 
> Hi I'm a beginner for nutch and JSP. In these days I always tried to use
> nutch seach function via web server. My JSP server is resin-3.1.3. I have
> done according to "Nutch version 0.8.x tutorial" and all is OK. Now I
> placed nutch-0.9.war in resin's webapp directory, and moved crawl
> directory generated by nutch to .../nutch-0.9/crawl, I also modified the
> configuration file: nutch-default.xml located in directory
> .../WEB-INF/classes ...<property>
>   <name>searcher.dir</name>
>   <value>/crawl</value>
>   <description>
>   Path to root of crawl.  This directory is searched (in
>   order) for either the file search-servers.txt, containing a list of
>   distributed search servers, or the directory "index" containing
>   merged indexes, or the directory "segments" containing segment
>   indexes.
>   </description>
> </property>... But I have never get the search result when I tried to
> search keyword "apache" fromhttp://localhost:8080/nutch-0.9. and search
> result is alwayse 0. I have checked the crawl result by command:
> "bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indicated
> that there should be many result for keyword "apache". Does anybody also
> encounter this problem before? who can tell me how I should do to resolve
> this problem? Thanks. Br Buck     
> 

-- 
View this message in context: http://www.nabble.com/Where-should-I-place-directory-%22crawl%22-which-include-index-and-db-of-fetching-website--tf4958753.html#a14220693
Sent from the Nutch - User mailing list archive at Nabble.com.

Re:Re: Where should I place directory "crawl" which include

Posted by 张世勇 <zs...@126.com>.

 indhttp://www.nabble.com/help/Answer.jtp?id=17ex and db of fetching

 website?
MIME-Version: 1.0
Content-Type: multipart/alternative; 
	boundary="----=_Part_2964_13394802.1197152084964"
X-Originating-IP: [192.168.208.35 (80.248.250.234)]
X-Priority: 3
X-Mailer: Coremail Webmail Server Version XTx build 071203(4346.1425.1420)
 Copyright (c) 2002-2007 www.mailtech.cn 126com

------=_Part_2964_13394802.1197152084964
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi ned

Yes, I have restarted the JSP server. I have also tried to place the crawl =
directory to root directory, so it real directory on linux should be \crawl=
, but still failed.

=E5=9C=A82007-12-08=EF=BC=8C"ned@bcit" <ne...@yahoo.com> =E5=86=99=E9=81=
=93=EF=BC=9A
>Did you restart your JSP server after you placed the crawl file?=20
>
>
>=E5=BC=A0=E4=B8=96=E5=8B=87 wrote:
>&gt;=20
>&gt; Hi I'm a beginner for nutch and JSP. In these days I always tried to =
use
>&gt; nutch seach function via web server. My JSP server is resin-3.1.3. I =
have
>&gt; done according to "Nutch version 0.8.x tutorial" and all is OK. Now I
>&gt; placed nutch-0.9.war in resin's webapp directory, and moved crawl
>&gt; directory generated by nutch to .../nutch-0.9/crawl, I also modified =
the
>&gt; configuration file: nutch-default.xml located in directory
>&gt; .../WEB-INF/classes ...&lt;property&gt;
>&gt;   &lt;name&gt;searcher.dir&lt;/name&gt;
>&gt;   &lt;value&gt;/crawl&lt;/value&gt;
>&gt;   &lt;description&gt;
>&gt;   Path to root of crawl.  This directory is searched (in
>&gt;   order) for either the file search-servers.txt, containing a list of
>&gt;   distributed search servers, or the directory "index" containing
>&gt;   merged indexes, or the directory "segments" containing segment
>&gt;   indexes.
>&gt;   &lt;/description&gt;
>&gt; &lt;/property&gt;... But I have never get the search result when I tr=
ied to
>&gt; search keyword "apache" fromhttp://localhost:8080/nutch-0.9. and sear=
ch
>&gt; result is alwayse 0. I have checked the crawl result by command:
>&gt; "bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indica=
ted
>&gt; that there should be many result for keyword "apache". Does anybody a=
lso
>&gt; encounter this problem before? who can tell me how I should do to res=
olve
>&gt; this problem? Thanks. Br Buck    =20
>&gt;=20
>
>--=20
>View this message in context: http://www.nabble.com/Where-should-I-place-d=
irectory-%22crawl%22-which-include-index-and-db-of-fetching-website--tf4958=
753.html#a14220693
>Sent from the Nutch - User mailing list archive at Nabble.com.
>
>
------=_Part_2964_13394802.1197152084964
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi ned<br><br>Yes, I have restarted the JSP server. I have also tried to pl=
ace the crawl directory to root directory, so it real directory on linux sh=
ould be \crawl, but still failed.<br><br>=E5=9C=A82007-12-08=EF=BC=8C"ned@b=
cit" &lt;ned_Dew@yahoo.com&gt; =E5=86=99=E9=81=93=EF=BC=9A<br>&gt;Did you r=
estart your JSP server after you placed the crawl file? <br>&gt;<br>&gt;<br=
>&gt;=E5=BC=A0=E4=B8=96=E5=8B=87 wrote:<br>&gt;&amp;gt; <br>&gt;&amp;gt; Hi=
 I'm a beginner for nutch and JSP. In these days I always tried to use<br>&=
gt;&amp;gt; nutch seach function via web server. My JSP server is resin-3.1=
.3. I have<br>&gt;&amp;gt; done according to "Nutch version 0.8.x tutorial"=
 and all is OK. Now I<br>&gt;&amp;gt; placed nutch-0.9.war in resin's webap=
p directory, and moved crawl<br>&gt;&amp;gt; directory generated by nutch t=
o .../nutch-0.9/crawl, I also modified the<br>&gt;&amp;gt; configuration fi=
le: nutch-default.xml located in directory<br>&gt;&amp;gt; .../WEB-INF/clas=
ses ...&amp;lt;property&amp;gt;<br>&gt;&amp;gt;   &amp;lt;name&amp;gt;searc=
her.dir&amp;lt;/name&amp;gt;<br>&gt;&amp;gt;   &amp;lt;value&amp;gt;/crawl&=
amp;lt;/value&amp;gt;<br>&gt;&amp;gt;   &amp;lt;description&amp;gt;<br>&gt;=
&amp;gt;   Path to root of crawl.  This directory is searched (in<br>&gt;&a=
mp;gt;   order) for either the file search-servers.txt, containing a list o=
f<br>&gt;&amp;gt;   distributed search servers, or the directory "index" co=
ntaining<br>&gt;&amp;gt;   merged indexes, or the directory "segments" cont=
aining segment<br>&gt;&amp;gt;   indexes.<br>&gt;&amp;gt;   &amp;lt;/descri=
ption&amp;gt;<br>&gt;&amp;gt; &amp;lt;/property&amp;gt;... But I have never=
 get the search result when I tried to<br>&gt;&amp;gt; search keyword "apac=
he" fromhttp://localhost:8080/nutch-0.9. and search<br>&gt;&amp;gt; result =
is alwayse 0. I have checked the crawl result by command:<br>&gt;&amp;gt; "=
bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indicated<br>=
&gt;&amp;gt; that there should be many result for keyword "apache". Does an=
ybody also<br>&gt;&amp;gt; encounter this problem before? who can tell me h=
ow I should do to resolve<br>&gt;&amp;gt; this problem? Thanks. Br Buck    =
 <br>&gt;&amp;gt; <br>&gt;<br>&gt;-- <br>&gt;View this message in context: =
http://www.nabble.com/Where-should-I-place-directory-%22crawl%22-which-incl=
ude-index-and-db-of-fetching-website--tf4958753.html#a14220693<br>&gt;Sent =
from the Nutch - User mailing list archive at Nabble.com.<br>&gt;<br>&gt;<b=
r>
------=_Part_2964_13394802.1197152084964--