You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Mohammad Monirul Hoque <im...@yahoo.com> on 2008/07/31 13:52:18 UTC
problem in putting urls in dfs
Hi,
After installing nutch on a single machine in a pseudo-distributed mode -i got problem while putting urls(which contains a file urllist.txt of seed urls ).
After starting the cluster when i try to upload a file in the DFS it shows
$ bin/hadoop dfs -put urls urls
put: No valid local directories in property: dfs.client.buffer.dir
after that when i type
$ bin/hadoop dfs -ls
Found 1 items
/user/nutch/urls <dir>
But i can't see the urls/urllist.txt file in DFS using the -cat command.
$ bin/hadoop dfs -cat urls/*
cat: urls/urllist.txt
which normally should show the content of urllist.txt.
Here is my hadoop-site.xml:
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!-- Put site-specific property overrides in this file. -->
<configuration>
<property>
<name>fs.default.name</name>
<value>sysmonitor:9000</value>
<description>
The name of the default file system. Either the literal string
"local" or a host:port for NDFS.
</description>
</property>
<property>
<name>mapred.job.tracker</name>
<value>sysmonitor:9001</value>
<description>
The host and port that the MapReduce job tracker runs at. If
"local", then jobs are run in-process as a single map and
reduce task.
</description>
</property>
<property>
<name>mapred.tasktracker.tasks.maximum</name>
<value>2</value>
<description>
The maximum number of tasks that will be run simultaneously by
a task tracker. This should be adjusted according to the heap size
per task, the amount of RAM available, and CPU consumption of each task.
</description>
</property>
<property>
<name>dfs.name.dir</name>
<value>/nutch/filesystem/name</value>
</property>
<property>
<name>dfs.data.dir</name>
<value>/nutch/filesystem/data</value>
</property>
<property>
<name>mapred.system.dir</name>
<value>/nutch/filesystem/mapreduce/system</value>
</property>
<property>
<name>mapred.local.dir</name>
<value>/nutch/filesystem/mapreduce/local</value>
</property>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
Tell me if i m missing something.
I m using nutch 0.9.
Thanks in advance
--monirul