You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Bernd Fehling <be...@uni-bielefeld.de> on 2006/02/05 18:35:09 UTC
Installing nutch
Hi list,
I came across nutch while looking for search engines
and that nutch with its NDFS is very interesting to me.
A basic question:
Is it possible to install nutch with NDFS on a single machine
or do I need at least two maschines?
I followed the instructions from Stefan Groschupf which helped
a lot but still makes some trouble.
The installation and setup instructions are OK.
Before installing the user interface I tried to create a searchable
index. As far as I can see the "admin" command has been removed
from nutch version 0.8?
So I tried "quick tutorial for nutch 0.8" but this does not work.
Using "bin/nutch ndfs -mkdir urls" makes no directory.
How do I get a searchable index?
Best regards,
Bernd
Re: Installing nutch
Posted by Owen O'Malley <ow...@yahoo-inc.com>.
On Feb 5, 2006, at 9:35 AM, Bernd Fehling wrote:
>
> A basic question:
> Is it possible to install nutch with NDFS on a single machine
> or do I need at least two maschines?
Yes, it is possible. I just ran a Hadoop map/reduce example on a
single machine using Hadoop DFS. On a single node, I ran one instance
of all 4 servers (namenode, datanode, jobtracker, and tasktracker). I
was able to run a map/reduce application with reading the inputs from
DFS and writing the output to DFS.
Note that effectively, this configuration is only useful for testing
because you are wasting time using the distributed framework for a
single node. As a test, it was very useful. *smile*
Note that you do want to change the value of dfs.replication to 1.
I've never run the indexing part of Nutch, so I can't help you on that
side. The Hadoop framework works fine in that configuration.
-- Owen