You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Mohammad Monirul Hoque <im...@yahoo.com> on 2008/07/31 13:52:18 UTC

problem in putting urls in dfs

Hi,

After installing nutch on a single machine in a pseudo-distributed mode -i got problem while putting urls(which contains a file urllist.txt of seed urls ).

After starting the cluster  when i try to upload a file in the DFS it shows

$ bin/hadoop dfs -put urls urls

put: No valid local directories in property: dfs.client.buffer.dir


after that when i type 

$ bin/hadoop dfs -ls

Found 1 items
/user/nutch/urls        <dir>

But i can't see the urls/urllist.txt file in DFS using the -cat command.

$ bin/hadoop dfs -cat urls/*
cat: urls/urllist.txt

which normally should show the content of urllist.txt.

Here is my hadoop-site.xml:

<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
<property>
  <name>fs.default.name</name>
  <value>sysmonitor:9000</value>
  <description>
    The name of the default file system. Either the literal string 
    "local" or a host:port for NDFS.
  </description>
</property>
<property>
  <name>mapred.job.tracker</name>
  <value>sysmonitor:9001</value>
  <description>
    The host and port that the MapReduce job tracker runs at. If 
    "local", then jobs are run in-process as a single map and 
    reduce task.
  </description>
</property>
<property>
  <name>mapred.tasktracker.tasks.maximum</name>
  <value>2</value>
  <description>
    The maximum number of tasks that will be run simultaneously by
    a task tracker. This should be adjusted according to the heap size
    per task, the amount of RAM available, and CPU consumption of each task.
  </description>
</property>
<property>
  <name>dfs.name.dir</name>
  <value>/nutch/filesystem/name</value>
</property>
<property>
  <name>dfs.data.dir</name>
  <value>/nutch/filesystem/data</value>
</property>

<property>
  <name>mapred.system.dir</name>
  <value>/nutch/filesystem/mapreduce/system</value>
</property>
<property>
  <name>mapred.local.dir</name>
  <value>/nutch/filesystem/mapreduce/local</value>
</property>

<property>
  <name>dfs.replication</name>
  <value>1</value>
</property>
</configuration>

Tell me if i m missing something.  
I m using nutch 0.9.

Thanks in advance

--monirul