You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Michael Fingerhut <Mi...@ircam.fr> on 2007/02/23 12:47:47 UTC

Installation problem

Hello everyone,

I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat 
6.0 and the Java SE Runtime environment (java version 1.6.0).

Installation went fine. I managed to index (through the command line, as 
indicated in the tutorial) a few intranet pages and to do a successful 
search through the bin/nutch command.

However, when I attempted the same search through Tomcat (which correctly 
shows the nutch home page), it returned 0 hits (and no error messages).

The logs show:

2007-02-23 12:44:50,593 INFO  NutchBean - creating new bean
2007-02-23 12:44:50,609 INFO  NutchBean - opening indexes in crawl/indexes
2007-02-23 12:44:50,671 INFO  Configuration - found resource 
common-terms.utf8 at 
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
2007-02-23 12:44:50,687 INFO  NutchBean - opening segments in crawl/segments
2007-02-23 12:44:50,703 INFO  SummarizerFactory - Using the first summarizer 
extension found: Basic Summarizer
2007-02-23 12:44:50,703 INFO  NutchBean - opening linkdb in crawl/linkdb
2007-02-23 12:44:50,718 INFO  NutchBean - query request from 127.0.0.1
2007-02-23 12:44:50,734 INFO  NutchBean - query: ircam
2007-02-23 12:44:50,734 INFO  NutchBean - lang: fr
2007-02-23 12:44:50,750 INFO  NutchBean - searching for 20 raw hits
2007-02-23 12:44:50,812 INFO  NutchBean - total hits: 0
2007-02-23 12:45:17,109 INFO  NutchBean - query request from 127.0.0.1
2007-02-23 12:45:17,109 INFO  NutchBean - query: ircam
2007-02-23 12:45:17,109 INFO  NutchBean - lang: fr
2007-02-23 12:45:17,109 INFO  NutchBean - searching for 20 raw hits
2007-02-23 12:45:17,109 INFO  NutchBean - total hits: 0

Any idea why this works only from the command line and not with Tomcat?

Thanks!

MF 


Re: Installation problem

Posted by Doğacan Güney <do...@gmail.com>.
Hi,

On 2/23/07, Michael Fingerhut <Mi...@ircam.fr> wrote:
> Hello everyone,
>
> I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat
> 6.0 and the Java SE Runtime environment (java version 1.6.0).
>
> Installation went fine. I managed to index (through the command line, as
> indicated in the tutorial) a few intranet pages and to do a successful
> search through the bin/nutch command.
>
> However, when I attempted the same search through Tomcat (which correctly
> shows the nutch home page), it returned 0 hits (and no error messages).
>
> The logs show:
>
> 2007-02-23 12:44:50,593 INFO  NutchBean - creating new bean
> 2007-02-23 12:44:50,609 INFO  NutchBean - opening indexes in crawl/indexes
> 2007-02-23 12:44:50,671 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
> 2007-02-23 12:44:50,687 INFO  NutchBean - opening segments in crawl/segments
> 2007-02-23 12:44:50,703 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2007-02-23 12:44:50,703 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2007-02-23 12:44:50,718 INFO  NutchBean - query request from 127.0.0.1
> 2007-02-23 12:44:50,734 INFO  NutchBean - query: ircam
> 2007-02-23 12:44:50,734 INFO  NutchBean - lang: fr
> 2007-02-23 12:44:50,750 INFO  NutchBean - searching for 20 raw hits
> 2007-02-23 12:44:50,812 INFO  NutchBean - total hits: 0
> 2007-02-23 12:45:17,109 INFO  NutchBean - query request from 127.0.0.1
> 2007-02-23 12:45:17,109 INFO  NutchBean - query: ircam
> 2007-02-23 12:45:17,109 INFO  NutchBean - lang: fr
> 2007-02-23 12:45:17,109 INFO  NutchBean - searching for 20 raw hits
> 2007-02-23 12:45:17,109 INFO  NutchBean - total hits: 0
>
> Any idea why this works only from the command line and not with Tomcat?

Nutch has a configuration option called searcher.dir which by default
is "crawl" (like yours). When tomcat starts it searches for the
searcher.dir directory under your current path (unless searcher.dir is
absolute). So you have to start tomcat under the right directory or
give searcher.dir an absolute path.

>
> Thanks!
>
> MF
>
>


-- 
Doğacan Güney

[SOLVED] Installation problem

Posted by Michael Fingerhut <Mi...@ircam.fr>.
Dogacan Guney <do...@gmail.com> wrote:

> So you have to start tomcat under the right directory

I had done it, and it hadn't worked. I then reinstalled everything but that 
time started Tomcat as a service (rather than from the command line from 
within the right directory).

I then followed Dima Mazmanov <nu...@proservice.ge> and your advice to 
insert the path in the nutch-site.xml file - it now works.

Thanks to both of you.

Michael 


Re: Installation problem

Posted by Dima Mazmanov <nu...@proservice.ge>.
Hi,Michael.

Did you insert path to your database in nutch-site.xml file in Tomcat
ROOT directory?
You wrote 23 февраля 2007 г., 15:47:47:

> Hello everyone,

> I have installed nutch 0.8.1 on Windows XP SP2, together with Apache Tomcat
> 6.0 and the Java SE Runtime environment (java version 1.6.0).

> Installation went fine. I managed to index (through the command line, as
> indicated in the tutorial) a few intranet pages and to do a successful
> search through the bin/nutch command.

> However, when I attempted the same search through Tomcat (which correctly
> shows the nutch home page), it returned 0 hits (and no error messages).

> The logs show:

> 2007-02-23 12:44:50,593 INFO  NutchBean - creating new bean
> 2007-02-23 12:44:50,609 INFO  NutchBean - opening indexes in crawl/indexes
> 2007-02-23 12:44:50,671 INFO  Configuration - found resource 
> common-terms.utf8 at 
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%206.0/webapps/nutch/WEB-INF/classes/common-terms.utf8
> 2007-02-23 12:44:50,687 INFO  NutchBean - opening segments in crawl/segments
> 2007-02-23 12:44:50,703 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2007-02-23 12:44:50,703 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2007-02-23 12:44:50,718 INFO  NutchBean - query request from 127.0.0.1
> 2007-02-23 12:44:50,734 INFO  NutchBean - query: ircam
> 2007-02-23 12:44:50,734 INFO  NutchBean - lang: fr
> 2007-02-23 12:44:50,750 INFO  NutchBean - searching for 20 raw hits
> 2007-02-23 12:44:50,812 INFO  NutchBean - total hits: 0
> 2007-02-23 12:45:17,109 INFO  NutchBean - query request from 127.0.0.1
> 2007-02-23 12:45:17,109 INFO  NutchBean - query: ircam
> 2007-02-23 12:45:17,109 INFO  NutchBean - lang: fr
> 2007-02-23 12:45:17,109 INFO  NutchBean - searching for 20 raw hits
> 2007-02-23 12:45:17,109 INFO  NutchBean - total hits: 0

> Any idea why this works only from the command line and not with Tomcat?

> Thanks!

> MF 



> __________ NOD32 2076 (20070222) Information __________

> This message was checked by NOD32 antivirus system.
> http://www.eset.com




-- 
Regards,
 Dima                          mailto:nuther@proservice.ge