You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by bruce <be...@earthlink.net> on 2006/12/05 18:43:56 UTC
lucene/nutch investigation
anybody running lucene/nutch that i can talk to, to exchange information,
ideas.. i'm considering using lucene/nutch for a project, but i have zero
java experience. i'm around the cali/bay area.
the guy who was going to be provide the java experience oversold his
expertise.. so i might have to bite the bullet on this one.
thanks
-bruce
RE: lucene/nutch investigation
Posted by Fuad Efendi <fu...@efendi.ca>.
Nutch is mostly an Internet Search Engine, although experienced in Java
users may develop own protocol implementations as plugins to Nutch, and own
parsers in addition to HTML, PDF, even MPG...
It repeats many ideas of Google, including PageRank
It is difficult to implement Nutch without 'commercial support'...
-----Original Message-----
From: Phillip Rhodes [mailto:spamsucks@rhoderunner.com]
Sent: Tuesday, December 05, 2006 2:42 PM
To: nutch-user@lucene.apache.org
Subject: Re: lucene/nutch investigation
Bruce,
I recently wrestled with this same issue.
Nutch is good if you need something crawled. (e.g. apache web server,
file system)
Lucene is good if you need to index something that can't be crawled
(e.g. database)
While there are exceptions to the above, I would stick to that as a
general rule of thumb when evaluating lucene or nutch to use in a
project. Of course, an understanding of lucene will probably help out
with nutch.
IMO.
Phillip
Insurance Squared Inc. wrote:
> Hi Bruce,
>
> This list is not only very active - it's full of people constantly
> giving helpful, instructive answers. If you've got questions, this is
> the place.
>
> I would say based on my experience that nutch is a) excellent and b)
> not for the faint of heart when it comes to java - you'll need someone
> who knows what they're doing probably even to get it installed.
>
> g.
>
>
> bruce wrote:
>
>> anybody running lucene/nutch that i can talk to, to exchange
>> information,
>> ideas.. i'm considering using lucene/nutch for a project, but i have
>> zero
>> java experience. i'm around the cali/bay area.
>>
>> the guy who was going to be provide the java experience oversold his
>> expertise.. so i might have to bite the bullet on this one.
>>
>> thanks
>>
>> -bruce
>>
>>
>>
>>
>
>
>
Re: lucene/nutch investigation
Posted by Phillip Rhodes <sp...@rhoderunner.com>.
Bruce,
I recently wrestled with this same issue.
Nutch is good if you need something crawled. (e.g. apache web server,
file system)
Lucene is good if you need to index something that can't be crawled
(e.g. database)
While there are exceptions to the above, I would stick to that as a
general rule of thumb when evaluating lucene or nutch to use in a
project. Of course, an understanding of lucene will probably help out
with nutch.
IMO.
Phillip
Insurance Squared Inc. wrote:
> Hi Bruce,
>
> This list is not only very active - it's full of people constantly
> giving helpful, instructive answers. If you've got questions, this is
> the place.
>
> I would say based on my experience that nutch is a) excellent and b)
> not for the faint of heart when it comes to java - you'll need someone
> who knows what they're doing probably even to get it installed.
>
> g.
>
>
> bruce wrote:
>
>> anybody running lucene/nutch that i can talk to, to exchange
>> information,
>> ideas.. i'm considering using lucene/nutch for a project, but i have
>> zero
>> java experience. i'm around the cali/bay area.
>>
>> the guy who was going to be provide the java experience oversold his
>> expertise.. so i might have to bite the bullet on this one.
>>
>> thanks
>>
>> -bruce
>>
>>
>>
>>
>
>
>
Re: lucene/nutch investigation
Posted by "Insurance Squared Inc." <gc...@insurancesquared.com>.
Hi Bruce,
This list is not only very active - it's full of people constantly
giving helpful, instructive answers. If you've got questions, this is
the place.
I would say based on my experience that nutch is a) excellent and b) not
for the faint of heart when it comes to java - you'll need someone who
knows what they're doing probably even to get it installed.
g.
bruce wrote:
> anybody running lucene/nutch that i can talk to, to exchange information,
> ideas.. i'm considering using lucene/nutch for a project, but i have zero
> java experience. i'm around the cali/bay area.
>
> the guy who was going to be provide the java experience oversold his
> expertise.. so i might have to bite the bullet on this one.
>
> thanks
>
> -bruce
>
>
>
>