You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by bruce <be...@earthlink.net> on 2006/12/05 18:43:56 UTC

lucene/nutch investigation

anybody running lucene/nutch that i can talk to, to exchange information,
ideas.. i'm considering using lucene/nutch for a project, but i have zero
java experience. i'm around the cali/bay area.

the guy who was going to be provide the java experience oversold his
expertise.. so i might have to bite the bullet on this one.

thanks

-bruce



RE: lucene/nutch investigation

Posted by Fuad Efendi <fu...@efendi.ca>.
Nutch is mostly an Internet Search Engine, although experienced in Java
users may develop own protocol implementations as plugins to Nutch, and own
parsers in addition to HTML, PDF, even MPG...
It repeats many ideas of Google, including PageRank
It is difficult to implement Nutch without 'commercial support'...


-----Original Message-----
From: Phillip Rhodes [mailto:spamsucks@rhoderunner.com] 
Sent: Tuesday, December 05, 2006 2:42 PM
To: nutch-user@lucene.apache.org
Subject: Re: lucene/nutch investigation


Bruce,

I recently wrestled with this same issue.

Nutch is good if you need something crawled. (e.g. apache web server, 
file system)
Lucene is good if you need to index something that can't be crawled 
(e.g. database)

While there are exceptions to the above, I would stick to that as a 
general rule of thumb when evaluating  lucene or nutch to use in a 
project.  Of course, an understanding of lucene will probably help out 
with nutch.

IMO.
Phillip

Insurance Squared Inc. wrote:

> Hi Bruce,
>
> This list is not only very active - it's full of people constantly 
> giving helpful, instructive answers.  If you've got questions, this is 
> the place.
>
> I would say based on my experience that nutch is a) excellent and b) 
> not for the faint of heart when it comes to java - you'll need someone 
> who knows what they're doing probably even to get it installed.
>
> g.
>
>
> bruce wrote:
>
>> anybody running lucene/nutch that i can talk to, to exchange 
>> information,
>> ideas.. i'm considering using lucene/nutch for a project, but i have 
>> zero
>> java experience. i'm around the cali/bay area.
>>
>> the guy who was going to be provide the java experience oversold his
>> expertise.. so i might have to bite the bullet on this one.
>>
>> thanks
>>
>> -bruce
>>
>>
>>
>>   
>
>
>






Re: lucene/nutch investigation

Posted by Phillip Rhodes <sp...@rhoderunner.com>.
Bruce,

I recently wrestled with this same issue.

Nutch is good if you need something crawled. (e.g. apache web server, 
file system)
Lucene is good if you need to index something that can't be crawled 
(e.g. database)

While there are exceptions to the above, I would stick to that as a 
general rule of thumb when evaluating  lucene or nutch to use in a 
project.  Of course, an understanding of lucene will probably help out 
with nutch.

IMO.
Phillip

Insurance Squared Inc. wrote:

> Hi Bruce,
>
> This list is not only very active - it's full of people constantly 
> giving helpful, instructive answers.  If you've got questions, this is 
> the place.
>
> I would say based on my experience that nutch is a) excellent and b) 
> not for the faint of heart when it comes to java - you'll need someone 
> who knows what they're doing probably even to get it installed.
>
> g.
>
>
> bruce wrote:
>
>> anybody running lucene/nutch that i can talk to, to exchange 
>> information,
>> ideas.. i'm considering using lucene/nutch for a project, but i have 
>> zero
>> java experience. i'm around the cali/bay area.
>>
>> the guy who was going to be provide the java experience oversold his
>> expertise.. so i might have to bite the bullet on this one.
>>
>> thanks
>>
>> -bruce
>>
>>
>>
>>   
>
>
>




Re: lucene/nutch investigation

Posted by "Insurance Squared Inc." <gc...@insurancesquared.com>.
Hi Bruce,

This list is not only very active - it's full of people constantly 
giving helpful, instructive answers.  If you've got questions, this is 
the place.

I would say based on my experience that nutch is a) excellent and b) not 
for the faint of heart when it comes to java - you'll need someone who 
knows what they're doing probably even to get it installed.

g.


bruce wrote:
> anybody running lucene/nutch that i can talk to, to exchange information,
> ideas.. i'm considering using lucene/nutch for a project, but i have zero
> java experience. i'm around the cali/bay area.
>
> the guy who was going to be provide the java experience oversold his
> expertise.. so i might have to bite the bullet on this one.
>
> thanks
>
> -bruce
>
>
>
>