You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Michael Fingerhut <Mi...@ircam.fr> on 2007/02/23 12:47:47 UTC
Installation problem
Hello everyone,
I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat
6.0 and the Java SE Runtime environment (java version 1.6.0).
Installation went fine. I managed to index (through the command line, as
indicated in the tutorial) a few intranet pages and to do a successful
search through the bin/nutch command.
However, when I attempted the same search through Tomcat (which correctly
shows the nutch home page), it returned 0 hits (and no error messages).
The logs show:
2007-02-23 12:44:50,593 INFO NutchBean - creating new bean
2007-02-23 12:44:50,609 INFO NutchBean - opening indexes in crawl/indexes
2007-02-23 12:44:50,671 INFO Configuration - found resource
common-terms.utf8 at
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
2007-02-23 12:44:50,687 INFO NutchBean - opening segments in crawl/segments
2007-02-23 12:44:50,703 INFO SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2007-02-23 12:44:50,703 INFO NutchBean - opening linkdb in crawl/linkdb
2007-02-23 12:44:50,718 INFO NutchBean - query request from 127.0.0.1
2007-02-23 12:44:50,734 INFO NutchBean - query: ircam
2007-02-23 12:44:50,734 INFO NutchBean - lang: fr
2007-02-23 12:44:50,750 INFO NutchBean - searching for 20 raw hits
2007-02-23 12:44:50,812 INFO NutchBean - total hits: 0
2007-02-23 12:45:17,109 INFO NutchBean - query request from 127.0.0.1
2007-02-23 12:45:17,109 INFO NutchBean - query: ircam
2007-02-23 12:45:17,109 INFO NutchBean - lang: fr
2007-02-23 12:45:17,109 INFO NutchBean - searching for 20 raw hits
2007-02-23 12:45:17,109 INFO NutchBean - total hits: 0
Any idea why this works only from the command line and not with Tomcat?
Thanks!
MF
Re: Installation problem
Posted by Doğacan Güney <do...@gmail.com>.
Hi,
On 2/23/07, Michael Fingerhut <Mi...@ircam.fr> wrote:
> Hello everyone,
>
> I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat
> 6.0 and the Java SE Runtime environment (java version 1.6.0).
>
> Installation went fine. I managed to index (through the command line, as
> indicated in the tutorial) a few intranet pages and to do a successful
> search through the bin/nutch command.
>
> However, when I attempted the same search through Tomcat (which correctly
> shows the nutch home page), it returned 0 hits (and no error messages).
>
> The logs show:
>
> 2007-02-23 12:44:50,593 INFO NutchBean - creating new bean
> 2007-02-23 12:44:50,609 INFO NutchBean - opening indexes in crawl/indexes
> 2007-02-23 12:44:50,671 INFO Configuration - found resource
> common-terms.utf8 at
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
> 2007-02-23 12:44:50,687 INFO NutchBean - opening segments in crawl/segments
> 2007-02-23 12:44:50,703 INFO SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2007-02-23 12:44:50,703 INFO NutchBean - opening linkdb in crawl/linkdb
> 2007-02-23 12:44:50,718 INFO NutchBean - query request from 127.0.0.1
> 2007-02-23 12:44:50,734 INFO NutchBean - query: ircam
> 2007-02-23 12:44:50,734 INFO NutchBean - lang: fr
> 2007-02-23 12:44:50,750 INFO NutchBean - searching for 20 raw hits
> 2007-02-23 12:44:50,812 INFO NutchBean - total hits: 0
> 2007-02-23 12:45:17,109 INFO NutchBean - query request from 127.0.0.1
> 2007-02-23 12:45:17,109 INFO NutchBean - query: ircam
> 2007-02-23 12:45:17,109 INFO NutchBean - lang: fr
> 2007-02-23 12:45:17,109 INFO NutchBean - searching for 20 raw hits
> 2007-02-23 12:45:17,109 INFO NutchBean - total hits: 0
>
> Any idea why this works only from the command line and not with Tomcat?
Nutch has a configuration option called searcher.dir which by default
is "crawl" (like yours). When tomcat starts it searches for the
searcher.dir directory under your current path (unless searcher.dir is
absolute). So you have to start tomcat under the right directory or
give searcher.dir an absolute path.
>
> Thanks!
>
> MF
>
>
--
Doğacan Güney
[SOLVED] Installation problem
Posted by Michael Fingerhut <Mi...@ircam.fr>.
Dogacan Guney <do...@gmail.com> wrote:
> So you have to start tomcat under the right directory
I had done it, and it hadn't worked. I then reinstalled everything but that
time started Tomcat as a service (rather than from the command line from
within the right directory).
I then followed Dima Mazmanov <nu...@proservice.ge> and your advice to
insert the path in the nutch-site.xml file - it now works.
Thanks to both of you.
Michael
Re: Installation problem
Posted by Dima Mazmanov <nu...@proservice.ge>.
Hi,Michael.
Did you insert path to your database in nutch-site.xml file in Tomcat
ROOT directory?
You wrote 23 февраля 2007 г., 15:47:47:
> Hello everyone,
> I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat
> 6.0 and the Java SE Runtime environment (java version 1.6.0).
> Installation went fine. I managed to index (through the command line, as
> indicated in the tutorial) a few intranet pages and to do a successful
> search through the bin/nutch command.
> However, when I attempted the same search through Tomcat (which correctly
> shows the nutch home page), it returned 0 hits (and no error messages).
> The logs show:
> 2007-02-23 12:44:50,593 INFO NutchBean - creating new bean
> 2007-02-23 12:44:50,609 INFO NutchBean - opening indexes in crawl/indexes
> 2007-02-23 12:44:50,671 INFO Configuration - found resource
> common-terms.utf8 at
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
> 2007-02-23 12:44:50,687 INFO NutchBean - opening segments in crawl/segments
> 2007-02-23 12:44:50,703 INFO SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2007-02-23 12:44:50,703 INFO NutchBean - opening linkdb in crawl/linkdb
> 2007-02-23 12:44:50,718 INFO NutchBean - query request from 127.0.0.1
> 2007-02-23 12:44:50,734 INFO NutchBean - query: ircam
> 2007-02-23 12:44:50,734 INFO NutchBean - lang: fr
> 2007-02-23 12:44:50,750 INFO NutchBean - searching for 20 raw hits
> 2007-02-23 12:44:50,812 INFO NutchBean - total hits: 0
> 2007-02-23 12:45:17,109 INFO NutchBean - query request from 127.0.0.1
> 2007-02-23 12:45:17,109 INFO NutchBean - query: ircam
> 2007-02-23 12:45:17,109 INFO NutchBean - lang: fr
> 2007-02-23 12:45:17,109 INFO NutchBean - searching for 20 raw hits
> 2007-02-23 12:45:17,109 INFO NutchBean - total hits: 0
> Any idea why this works only from the command line and not with Tomcat?
> Thanks!
> MF
> __________ NOD32 2076 (20070222) Information __________
> This message was checked by NOD32 antivirus system.
> http://www.eset.com
--
Regards,
Dima mailto:nuther@proservice.ge