You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Michael Gang <mi...@gmail.com> on 2013/01/08 13:07:42 UTC

differences between nutch 1 and nutch 2

Hi all,

While trying to solve challenges when using nutch2 i search in the nutch
wiki and mailing list.
Most responses are for nutch 1.
My question is, in which area did the interface of nutch change.
Are the configuration files the same. Is the only difference in the
internal implementation ?

Thanks,
David

Re: differences between nutch 1 and nutch 2

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi David,

The best resources we have for this can be found on the wiki. These explain
quite a bit about the respective Nutch tools (Injector, Generator, etc.)
and how they are implemented in 2.x.

http://wiki.apache.org/nutch/Nutch2Crawling

On Tue, Jan 8, 2013 at 4:07 AM, Michael Gang <mi...@gmail.com> wrote:

> My question is, in which area did the interface of nutch change.
>

The interface was deprecated after the release of Nutch 1.2 for preference
of using Apache Solr (mainly) for indexing but also for the GUI you can
stick on top.

Are the configuration files the same.


Mostly. There are some differences, Nutch 1.x is a more intricate beast
with dare I say it more functionality (properties) for fine tuning your
Nutch server. Sebastian provided us with an excellent page explaining the
differences in properties between the two [0]. This can be considered as a
complete list of the current features.


> Is the only difference in the
> internal implementation ?
>

I would advise you to watch Julien's previous videos @Berlin Buzzwords and
the various presentations which are floating around. Essentially, they
explain the core differences between the 1.x trunk & 2.x branch.


>
> Thanks,
> David
>

[0] http://wiki.apache.org/nutch/NutchPropertiesCompleteList

-- 
*Lewis*