You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Revathi R <re...@persistent.co.in> on 2013/01/25 12:49:42 UTC

Installation of NUTCH on windows7

Hello


I am Trying to install NUTCH on windows7

I got error loke this

D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch crawl
"D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
"D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
File Not Found
The system cannot find the file specified.
solrUrl is not set, indexing will be skipped...
crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
1.6/win32/bin
rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
threads = 10
depth = 5
solrUrl=null
Injector: starting at 2013-01-25 15:46:14
Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
1.6/win32/bin/crawldb
Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Failed to set permissions of
path:
\tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
to 0700
        at org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
        at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
        at
org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
        at
org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
        at
org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
        at
org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
        at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
        at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)


Regards,
Revathi R.






--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Installation of NUTCH on windows7

Posted by 王亮 <be...@gmail.com>.
hello , I need help about  building Nutch on win7 in Eclipse

I have modified ivy.xml ,
    <dependency org="org.apache.hadoop" name="hadoop-core"
      rev="0.20.2" conf="*->default">

but after build ,  ivy download hadoop-core-1.0.1.jar .

why ?


sorry for my poor Englishi.


2013/6/5 Aldrin Leal <al...@leal.eng.br>

> Install the Microsoft Hadoop Distro, http://www.microsoft.com/bigdata/,
> and
> see if hadoop jar gets any better please :)
>
> --
> -- Aldrin Leal, <al...@leal.eng.br>
> Master your EC2-fu! Get the latest ekaterminal public beta
> http://www.ingenieux.com.br/products/ekaterminal/
>
>
> On Wed, Jun 5, 2013 at 6:33 AM, Shobha <sh...@gmail.com> wrote:
>
> > Changing the hadoop jar file to a lower version solved the issue
> >
> > I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
> > hadoop-core-0.20.2.jar file
> >
> >
> > It works for me
> >
> >
> >
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
>

Re: Installation of NUTCH on windows7

Posted by Aldrin Leal <al...@leal.eng.br>.
Install the Microsoft Hadoop Distro, http://www.microsoft.com/bigdata/, and
see if hadoop jar gets any better please :)

--
-- Aldrin Leal, <al...@leal.eng.br>
Master your EC2-fu! Get the latest ekaterminal public beta
http://www.ingenieux.com.br/products/ekaterminal/


On Wed, Jun 5, 2013 at 6:33 AM, Shobha <sh...@gmail.com> wrote:

> Changing the hadoop jar file to a lower version solved the issue
>
> I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
> hadoop-core-0.20.2.jar file
>
>
> It works for me
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>

Re: Installation of NUTCH on windows7

Posted by Shobha <sh...@gmail.com>.
Changing the hadoop jar file to a lower version solved the issue 

I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
hadoop-core-0.20.2.jar file 


It works for me




--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Installation of NUTCH on windows7

Posted by peterbarretto <pe...@gmail.com>.
Hi,

Changing the hadoop jar file to a lower version solved the issue

I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
hadoop-core-0.20.2.jar file


Sebastian Nagel wrote
> Hi,
> 
> that's a known problem with Hadoop on Windows / Cygwin:
> 
> https://issues.apache.org/jira/browse/HADOOP-7682
> 
> I don't know whether there are is a reliable fix
> or a word-around but you should search for the error
> - you are not alone ;-)
> 
> Sebastian
> 
> On 01/25/2013 12:49 PM, Revathi R wrote:
>> Hello
>> 
>> 
>> I am Trying to install NUTCH on windows7
>> 
>> I got error loke this
>> 
>> D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch
>> crawl
>> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
>> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
>> File Not Found
>> The system cannot find the file specified.
>> solrUrl is not set, indexing will be skipped...
>> crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
>> 1.6/win32/bin
>> rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
>> threads = 10
>> depth = 5
>> solrUrl=null
>> Injector: starting at 2013-01-25 15:46:14
>> Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
>> 1.6/win32/bin/crawldb
>> Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
>> Injector: Converting injected urls to crawl db entries.
>> Exception in thread "main" java.io.IOException: Failed to set permissions
>> of
>> path:
>> \tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
>> to 0700
>>         at
>> org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
>>         at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
>>         at
>> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
>>         at
>> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
>>         at
>> org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
>>         at
>> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
>>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
>>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
>>         at java.security.AccessController.doPrivileged(Native Method)
>>         at javax.security.auth.Subject.doAs(Subject.java:396)
>>         at
>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
>>         at
>> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
>>         at
>> org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
>>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
>>         at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
>>         at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
>>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
>> 
>> 
>> Regards,
>> Revathi R.
>> 
>> 
>> 
>> 
>> 
>> 
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>





--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4036404.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Installation of NUTCH on windows7

Posted by Sebastian Nagel <wa...@googlemail.com>.
Hi,

that's a known problem with Hadoop on Windows / Cygwin:

https://issues.apache.org/jira/browse/HADOOP-7682

I don't know whether there are is a reliable fix
or a word-around but you should search for the error
- you are not alone ;-)

Sebastian

On 01/25/2013 12:49 PM, Revathi R wrote:
> Hello
> 
> 
> I am Trying to install NUTCH on windows7
> 
> I got error loke this
> 
> D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch crawl
> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
> File Not Found
> The system cannot find the file specified.
> solrUrl is not set, indexing will be skipped...
> crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
> 1.6/win32/bin
> rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
> threads = 10
> depth = 5
> solrUrl=null
> Injector: starting at 2013-01-25 15:46:14
> Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
> 1.6/win32/bin/crawldb
> Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
> Injector: Converting injected urls to crawl db entries.
> Exception in thread "main" java.io.IOException: Failed to set permissions of
> path:
> \tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
> to 0700
>         at org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
>         at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
>         at
> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
>         at
> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
>         at
> org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
>         at
> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:396)
>         at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
>         at
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
>         at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
>         at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
> 
> 
> Regards,
> Revathi R.
> 
> 
> 
> 
> 
> 
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>