You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "ned@bcit" <ne...@yahoo.com> on 2007/12/07 22:12:55 UTC
Re: Where should I place directory "crawl" which include
indhttp://www.nabble.com/help/Answer.jtp?id=17ex and db of fetching
website?
Did you restart your JSP server after you placed the crawl file?
张世勇 wrote:
>
> Hi I'm a beginner for nutch and JSP. In these days I always tried to use
> nutch seach function via web server. My JSP server is resin-3.1.3. I have
> done according to "Nutch version 0.8.x tutorial" and all is OK. Now I
> placed nutch-0.9.war in resin's webapp directory, and moved crawl
> directory generated by nutch to .../nutch-0.9/crawl, I also modified the
> configuration file: nutch-default.xml located in directory
> .../WEB-INF/classes ...<property>
> <name>searcher.dir</name>
> <value>/crawl</value>
> <description>
> Path to root of crawl. This directory is searched (in
> order) for either the file search-servers.txt, containing a list of
> distributed search servers, or the directory "index" containing
> merged indexes, or the directory "segments" containing segment
> indexes.
> </description>
> </property>... But I have never get the search result when I tried to
> search keyword "apache" fromhttp://localhost:8080/nutch-0.9. and search
> result is alwayse 0. I have checked the crawl result by command:
> "bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indicated
> that there should be many result for keyword "apache". Does anybody also
> encounter this problem before? who can tell me how I should do to resolve
> this problem? Thanks. Br Buck
>
--
View this message in context: http://www.nabble.com/Where-should-I-place-directory-%22crawl%22-which-include-index-and-db-of-fetching-website--tf4958753.html#a14220693
Sent from the Nutch - User mailing list archive at Nabble.com.
Re:Re: Where should I place directory "crawl" which include
Posted by 张世勇 <zs...@126.com>.
indhttp://www.nabble.com/help/Answer.jtp?id=17ex and db of fetching
website?
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----=_Part_2964_13394802.1197152084964"
X-Originating-IP: [192.168.208.35 (80.248.250.234)]
X-Priority: 3
X-Mailer: Coremail Webmail Server Version XTx build 071203(4346.1425.1420)
Copyright (c) 2002-2007 www.mailtech.cn 126com
------=_Part_2964_13394802.1197152084964
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Hi ned
Yes, I have restarted the JSP server. I have also tried to place the crawl =
directory to root directory, so it real directory on linux should be \crawl=
, but still failed.
=E5=9C=A82007-12-08=EF=BC=8C"ned@bcit" <ne...@yahoo.com> =E5=86=99=E9=81=
=93=EF=BC=9A
>Did you restart your JSP server after you placed the crawl file?=20
>
>
>=E5=BC=A0=E4=B8=96=E5=8B=87 wrote:
>>=20
>> Hi I'm a beginner for nutch and JSP. In these days I always tried to =
use
>> nutch seach function via web server. My JSP server is resin-3.1.3. I =
have
>> done according to "Nutch version 0.8.x tutorial" and all is OK. Now I
>> placed nutch-0.9.war in resin's webapp directory, and moved crawl
>> directory generated by nutch to .../nutch-0.9/crawl, I also modified =
the
>> configuration file: nutch-default.xml located in directory
>> .../WEB-INF/classes ...<property>
>> <name>searcher.dir</name>
>> <value>/crawl</value>
>> <description>
>> Path to root of crawl. This directory is searched (in
>> order) for either the file search-servers.txt, containing a list of
>> distributed search servers, or the directory "index" containing
>> merged indexes, or the directory "segments" containing segment
>> indexes.
>> </description>
>> </property>... But I have never get the search result when I tr=
ied to
>> search keyword "apache" fromhttp://localhost:8080/nutch-0.9. and sear=
ch
>> result is alwayse 0. I have checked the crawl result by command:
>> "bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indica=
ted
>> that there should be many result for keyword "apache". Does anybody a=
lso
>> encounter this problem before? who can tell me how I should do to res=
olve
>> this problem? Thanks. Br Buck =20
>>=20
>
>--=20
>View this message in context: http://www.nabble.com/Where-should-I-place-d=
irectory-%22crawl%22-which-include-index-and-db-of-fetching-website--tf4958=
753.html#a14220693
>Sent from the Nutch - User mailing list archive at Nabble.com.
>
>
------=_Part_2964_13394802.1197152084964
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Hi ned<br><br>Yes, I have restarted the JSP server. I have also tried to pl=
ace the crawl directory to root directory, so it real directory on linux sh=
ould be \crawl, but still failed.<br><br>=E5=9C=A82007-12-08=EF=BC=8C"ned@b=
cit" <ned_Dew@yahoo.com> =E5=86=99=E9=81=93=EF=BC=9A<br>>Did you r=
estart your JSP server after you placed the crawl file? <br>><br>><br=
>>=E5=BC=A0=E4=B8=96=E5=8B=87 wrote:<br>>&gt; <br>>&gt; Hi=
I'm a beginner for nutch and JSP. In these days I always tried to use<br>&=
gt;&gt; nutch seach function via web server. My JSP server is resin-3.1=
.3. I have<br>>&gt; done according to "Nutch version 0.8.x tutorial"=
and all is OK. Now I<br>>&gt; placed nutch-0.9.war in resin's webap=
p directory, and moved crawl<br>>&gt; directory generated by nutch t=
o .../nutch-0.9/crawl, I also modified the<br>>&gt; configuration fi=
le: nutch-default.xml located in directory<br>>&gt; .../WEB-INF/clas=
ses ...&lt;property&gt;<br>>&gt; &lt;name&gt;searc=
her.dir&lt;/name&gt;<br>>&gt; &lt;value&gt;/crawl&=
amp;lt;/value&gt;<br>>&gt; &lt;description&gt;<br>>=
&gt; Path to root of crawl. This directory is searched (in<br>>&a=
mp;gt; order) for either the file search-servers.txt, containing a list o=
f<br>>&gt; distributed search servers, or the directory "index" co=
ntaining<br>>&gt; merged indexes, or the directory "segments" cont=
aining segment<br>>&gt; indexes.<br>>&gt; &lt;/descri=
ption&gt;<br>>&gt; &lt;/property&gt;... But I have never=
get the search result when I tried to<br>>&gt; search keyword "apac=
he" fromhttp://localhost:8080/nutch-0.9. and search<br>>&gt; result =
is alwayse 0. I have checked the crawl result by command:<br>>&gt; "=
bin/nutch org.apache.nutch.searcher.NutchBean apache". and it indicated<br>=
>&gt; that there should be many result for keyword "apache". Does an=
ybody also<br>>&gt; encounter this problem before? who can tell me h=
ow I should do to resolve<br>>&gt; this problem? Thanks. Br Buck =
<br>>&gt; <br>><br>>-- <br>>View this message in context: =
http://www.nabble.com/Where-should-I-place-directory-%22crawl%22-which-incl=
ude-index-and-db-of-fetching-website--tf4958753.html#a14220693<br>>Sent =
from the Nutch - User mailing list archive at Nabble.com.<br>><br>><b=
r>
------=_Part_2964_13394802.1197152084964--