You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Håvard W. Kongsgård" <h....@niap.no> on 2006/07/20 10:34:03 UTC

Generate linkDb | hadoop/nutch 0.8

When I run “bin/nutch invertlinks linkdb segments” I get this error

Exception in thread "main" java.io.IOException: Input directory 
/user/nutch/segments/parse_data in linux3:9000 is invalid.

at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)

at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)

at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)

at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)


I have tried to create the directory segments/parse_data but still the 
same error.


Re: Generate linkDb | hadoop/nutch 0.8

Posted by "Håvard W. Kongsgård" <h....@niap.no>.
Sami Siren wrote:
> try “bin/nutch invertlinks linkdb -dir segments”
>
> -- 
> Sami Siren
>
>
> Håvard W. Kongsgård wrote:
>
>> When I run “bin/nutch invertlinks linkdb segments” I get this error
>>
>> Exception in thread "main" java.io.IOException: Input directory 
>> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>>
>> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>>
>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>>
>> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>>
>> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>>
>>
>> I have tried to create the directory segments/parse_data but still 
>> the same error.
>>
>>
>
>
Thanks it worked


Re: Generate linkDb | hadoop/nutch 0.8

Posted by Sami Siren <ss...@gmail.com>.
try “bin/nutch invertlinks linkdb -dir segments”

--
 Sami Siren


Håvard W. Kongsgård wrote:

> When I run “bin/nutch invertlinks linkdb segments” I get this error
>
> Exception in thread "main" java.io.IOException: Input directory 
> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>
> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>
>
> I have tried to create the directory segments/parse_data but still the 
> same error.
>
>


Re: Generate linkDb | hadoop/nutch 0.8

Posted by Zaheed Haque <za...@gmail.com>.
Hi:

Not sure if this is the case - are you running fetch with -noParsing
option being enabled? Then you need to run bin/nutch parse before you
run bin/nutch invertlinks.

Regards
Zaheed

On 7/20/06, "Håvard W. Kongsgård" <h....@niap.no> wrote:
> When I run "bin/nutch invertlinks linkdb segments" I get this error
>
> Exception in thread "main" java.io.IOException: Input directory
> /user/nutch/segments/parse_data in linux3:9000 is invalid.
>
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
>
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:212)
>
> at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:316)
>
>
> I have tried to create the directory segments/parse_data but still the
> same error.
>
>