You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Richard Huang <ri...@gmail.com> on 2010/09/13 02:31:20 UTC

New to Nutch

Sorry for the SPAM.

I am new to Nutch. I wanted to setup a crawler, but I haven't figured out
how to get started. I am looking for some help to point out where to find
out some tutorial/demo, so that I can setup up a quick environment, and get
some idea before I can move deeper.

Sorry for a newbie question. Really appreciate your help.

Rg,
Richard

Re: New to Nutch

Posted by Israel <we...@gmail.com>.
Yo hice un tutorial detalado:

http://www.box.net/shared/ph24ligzuk

is great....yeahhhhhhhhh




2010/9/12 Mattmann, Chris A (388J) <ch...@jpl.nasa.gov>

> Hi Richard,
>
> No worries! Glad to have you on board.
>
> Here's a fairly decent tutorial of getting Nutch up and running quickly:
>
> http://wiki.apache.org/nutch/NutchTutorial
>
> The latest officially released version of Nutch is 1.1, available from:
>
> http://www.apache.org/dyn/closer.cgi/nutch/
>
> (NOTE: we are currently voting in releasing 1.2 right now, see: [1])
>
> The above tutorial is a bit out of date, but should be enough to get you
> started at least, and then you can come back here and we'll fill the gaps.
>
> Cheers,
> Chris
>
> [1] http://s.apache.org/7od
>
> On 9/12/10 5:31 PM, "Richard Huang" <ri...@gmail.com> wrote:
>
> Sorry for the SPAM.
>
> I am new to Nutch. I wanted to setup a crawler, but I haven't figured out
> how to get started. I am looking for some help to point out where to find
> out some tutorial/demo, so that I can setup up a quick environment, and get
> some idea before I can move deeper.
>
> Sorry for a newbie question. Really appreciate your help.
>
> Rg,
> Richard
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/<http://sunset.usc.edu/%7Emattmann/>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Re: New to Nutch

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Richard,

No worries! Glad to have you on board.

Here's a fairly decent tutorial of getting Nutch up and running quickly:

http://wiki.apache.org/nutch/NutchTutorial

The latest officially released version of Nutch is 1.1, available from:

http://www.apache.org/dyn/closer.cgi/nutch/

(NOTE: we are currently voting in releasing 1.2 right now, see: [1])

The above tutorial is a bit out of date, but should be enough to get you started at least, and then you can come back here and we'll fill the gaps.

Cheers,
Chris

[1] http://s.apache.org/7od

On 9/12/10 5:31 PM, "Richard Huang" <ri...@gmail.com> wrote:

Sorry for the SPAM.

I am new to Nutch. I wanted to setup a crawler, but I haven't figured out
how to get started. I am looking for some help to point out where to find
out some tutorial/demo, so that I can setup up a quick environment, and get
some idea before I can move deeper.

Sorry for a newbie question. Really appreciate your help.

Rg,
Richard



++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++