You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Håvard W. Kongsgård" <h....@niap.no> on 2006/07/20 10:34:03 UTC
Generate linkDb | hadoop/nutch 0.8
When I run “bin/nutch invertlinks linkdb segments” I get this error
Exception in thread "main" java.io.IOException: Input directory
/user/nutch/segments/parse_data in linux3:9000 is invalid.
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
I have tried to create the directory segments/parse_data but still the
same error.
Re: Generate linkDb | hadoop/nutch 0.8
Posted by "Håvard W. Kongsgård" <h....@niap.no>.
Sami Siren wrote:
> try “bin/nutch invertlinks linkdb -dir segments”
>
> --
> Sami Siren
>
>
> Håvard W. Kongsgård wrote:
>
>> When I run “bin/nutch invertlinks linkdb segments” I get this error
>>
>> Exception in thread "main" java.io.IOException: Input directory
>> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>>
>> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>>
>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>>
>> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>>
>> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>>
>>
>> I have tried to create the directory segments/parse_data but still
>> the same error.
>>
>>
>
>
Thanks it worked
Re: Generate linkDb | hadoop/nutch 0.8
Posted by Sami Siren <ss...@gmail.com>.
try “bin/nutch invertlinks linkdb -dir segments”
--
Sami Siren
Håvard W. Kongsgård wrote:
> When I run “bin/nutch invertlinks linkdb segments” I get this error
>
> Exception in thread "main" java.io.IOException: Input directory
> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>
> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>
>
> I have tried to create the directory segments/parse_data but still the
> same error.
>
>
Re: Generate linkDb | hadoop/nutch 0.8
Posted by Zaheed Haque <za...@gmail.com>.
Hi:
Not sure if this is the case - are you running fetch with -noParsing
option being enabled? Then you need to run bin/nutch parse before you
run bin/nutch invertlinks.
Regards
Zaheed
On 7/20/06, "Håvard W. Kongsgård" <h....@niap.no> wrote:
> When I run "bin/nutch invertlinks linkdb segments" I get this error
>
> Exception in thread "main" java.io.IOException: Input directory
> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>
> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>
>
> I have tried to create the directory segments/parse_data but still the
> same error.
>
>