You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Revathi R <re...@persistent.co.in> on 2013/01/25 12:49:42 UTC
Installation of NUTCH on windows7
Hello
I am Trying to install NUTCH on windows7
I got error loke this
D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch crawl
"D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
"D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
File Not Found
The system cannot find the file specified.
solrUrl is not set, indexing will be skipped...
crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
1.6/win32/bin
rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
threads = 10
depth = 5
solrUrl=null
Injector: starting at 2013-01-25 15:46:14
Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
1.6/win32/bin/crawldb
Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Failed to set permissions of
path:
\tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
to 0700
at org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
at
org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
at
org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
at
org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
at
org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
Regards,
Revathi R.
--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Installation of NUTCH on windows7
Posted by 王亮 <be...@gmail.com>.
hello , I need help about building Nutch on win7 in Eclipse
I have modified ivy.xml ,
<dependency org="org.apache.hadoop" name="hadoop-core"
rev="0.20.2" conf="*->default">
but after build , ivy download hadoop-core-1.0.1.jar .
why ?
sorry for my poor Englishi.
2013/6/5 Aldrin Leal <al...@leal.eng.br>
> Install the Microsoft Hadoop Distro, http://www.microsoft.com/bigdata/,
> and
> see if hadoop jar gets any better please :)
>
> --
> -- Aldrin Leal, <al...@leal.eng.br>
> Master your EC2-fu! Get the latest ekaterminal public beta
> http://www.ingenieux.com.br/products/ekaterminal/
>
>
> On Wed, Jun 5, 2013 at 6:33 AM, Shobha <sh...@gmail.com> wrote:
>
> > Changing the hadoop jar file to a lower version solved the issue
> >
> > I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
> > hadoop-core-0.20.2.jar file
> >
> >
> > It works for me
> >
> >
> >
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
>
Re: Installation of NUTCH on windows7
Posted by Aldrin Leal <al...@leal.eng.br>.
Install the Microsoft Hadoop Distro, http://www.microsoft.com/bigdata/, and
see if hadoop jar gets any better please :)
--
-- Aldrin Leal, <al...@leal.eng.br>
Master your EC2-fu! Get the latest ekaterminal public beta
http://www.ingenieux.com.br/products/ekaterminal/
On Wed, Jun 5, 2013 at 6:33 AM, Shobha <sh...@gmail.com> wrote:
> Changing the hadoop jar file to a lower version solved the issue
>
> I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
> hadoop-core-0.20.2.jar file
>
>
> It works for me
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
Re: Installation of NUTCH on windows7
Posted by Shobha <sh...@gmail.com>.
Changing the hadoop jar file to a lower version solved the issue
I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
hadoop-core-0.20.2.jar file
It works for me
--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4068287.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Installation of NUTCH on windows7
Posted by peterbarretto <pe...@gmail.com>.
Hi,
Changing the hadoop jar file to a lower version solved the issue
I removed hadoop-core-1.0.3.jar from the lib folder and replaced it with
hadoop-core-0.20.2.jar file
Sebastian Nagel wrote
> Hi,
>
> that's a known problem with Hadoop on Windows / Cygwin:
>
> https://issues.apache.org/jira/browse/HADOOP-7682
>
> I don't know whether there are is a reliable fix
> or a word-around but you should search for the error
> - you are not alone ;-)
>
> Sebastian
>
> On 01/25/2013 12:49 PM, Revathi R wrote:
>> Hello
>>
>>
>> I am Trying to install NUTCH on windows7
>>
>> I got error loke this
>>
>> D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch
>> crawl
>> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
>> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
>> File Not Found
>> The system cannot find the file specified.
>> solrUrl is not set, indexing will be skipped...
>> crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
>> 1.6/win32/bin
>> rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
>> threads = 10
>> depth = 5
>> solrUrl=null
>> Injector: starting at 2013-01-25 15:46:14
>> Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
>> 1.6/win32/bin/crawldb
>> Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
>> Injector: Converting injected urls to crawl db entries.
>> Exception in thread "main" java.io.IOException: Failed to set permissions
>> of
>> path:
>> \tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
>> to 0700
>> at
>> org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
>> at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
>> at
>> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
>> at
>> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
>> at
>> org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
>> at
>> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
>> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
>> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
>> at java.security.AccessController.doPrivileged(Native Method)
>> at javax.security.auth.Subject.doAs(Subject.java:396)
>> at
>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
>> at
>> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
>> at
>> org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
>> at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
>> at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>> at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
>>
>>
>> Regards,
>> Revathi R.
>>
>>
>>
>>
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
--
View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210p4036404.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Installation of NUTCH on windows7
Posted by Sebastian Nagel <wa...@googlemail.com>.
Hi,
that's a known problem with Hadoop on Windows / Cygwin:
https://issues.apache.org/jira/browse/HADOOP-7682
I don't know whether there are is a reliable fix
or a word-around but you should search for the error
- you are not alone ;-)
Sebastian
On 01/25/2013 12:49 PM, Revathi R wrote:
> Hello
>
>
> I am Trying to install NUTCH on windows7
>
> I got error loke this
>
> D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin>nutch crawl
> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\URLs" -dir
> "D:\Nutch-1\apache-nutch-1.6-bin\NUTCH TEST\Nutch 1.6\win32\bin"
> File Not Found
> The system cannot find the file specified.
> solrUrl is not set, indexing will be skipped...
> crawl started in: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
> 1.6/win32/bin
> rootUrlDir = D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
> threads = 10
> depth = 5
> solrUrl=null
> Injector: starting at 2013-01-25 15:46:14
> Injector: crawlDb: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/Nutch
> 1.6/win32/bin/crawldb
> Injector: urlDir: D:/Nutch-1/apache-nutch-1.6-bin/NUTCH TEST/URLs
> Injector: Converting injected urls to crawl db entries.
> Exception in thread "main" java.io.IOException: Failed to set permissions of
> path:
> \tmp\hadoop-revathi_ramanadham\mapred\staging\revathi_ramanadham818841982\.staging
> to 0700
> at org.apache.hadoop.fs.FileUtil.checkReturnValue(FileUtil.java:689)
> at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:662)
> at
> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:509)
> at
> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:344)
> at
> org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:189)
> at
> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:116)
> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:856)
> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
> at
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
> at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
> at org.apache.nutch.crawl.Crawl.run(Crawl.java:127)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
>
>
> Regards,
> Revathi R.
>
>
>
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Installation-of-NUTCH-on-windows7-tp4036210.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>