You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Meiping Wang(Amelia)" <me...@hengtiansoft.com> on 2013/02/01 07:56:10 UTC

Nutch Configuration Problems

Nutch Configuration Environment: Nutch2.1, Cygwin (have added to the PATH),Windows, Eclipse, Jdk1.6

I have followed the tutorial named NutchInEclipse of Nutch Wiki, there will occurs the following problems when I run the java project,

solrUrl is not set, indexing will be skipped...
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
solrUrl=null
topN = 5
Injector: starting at 2013-02-01 14:45:43
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Failed to set permissions of path: \tmp\hadoop-test\mapred\staging\test2083949620\.staging to 0700
   at org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
   at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
   at org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
   at org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
   at org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
   at org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
   at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:918)
   at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:912)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:396)
   at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)
   at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:912)
   at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:886)
   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1323)
   at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
   at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
   at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)

Can u give me some suggestions?
Thanks.
Best Regards
Amelia

RE: Nutch Configuration Problems

Posted by "Meiping Wang(Amelia)" <me...@hengtiansoft.com>.
Thanks. I'll try to change the Hadoop.

Best Regards
Amelia (Meiping Wang)
MSN: ameliawmp@hotmail.com
ACS InSigma HengTian Software Ltd. 

-----Original Message-----
From: feng lu [mailto:amuseme.lu@gmail.com] 
Sent: 2013年2月1日 16:53
To: user@nutch.apache.org
Subject: Re: Nutch Configuration Problems

On Fri, Feb 1, 2013 at 2:56 PM, Meiping Wang(Amelia) <
meipingwang@hengtiansoft.com> wrote:

> Failed to set permissions of path


Hi Meiping

see http://comments.gmane.org/gmane.comp.jakarta.lucene.hadoop.user/25837


-- 
Don't Grow Old, Grow Up... :-)

Re: Nutch Configuration Problems

Posted by feng lu <am...@gmail.com>.
On Fri, Feb 1, 2013 at 2:56 PM, Meiping Wang(Amelia) <
meipingwang@hengtiansoft.com> wrote:

> Failed to set permissions of path


Hi Meiping

see http://comments.gmane.org/gmane.comp.jakarta.lucene.hadoop.user/25837


-- 
Don't Grow Old, Grow Up... :-)