You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by prashant_nutch <pr...@in.v2solutions.com> on 2007/03/23 07:25:14 UTC

Merging WebDBs

Hi,
i face problem like this (Nutch 0.8.1)
after crawling, i have two webdb, i want to merge them and after merging i
created new WebDB (comibine both)
then i want searching on them.

i created new webdb under which i create two folder crawldb and segments
(which is combination of two webdb),
but now i want create Linkdb and index....
how this can be created......i use command like this in eclipse program
argument(Windows)

 invertlinks linkdb segments/*  
i got error like

INFO  crawl.LinkDb - LinkDb: starting
INFO  crawl.LinkDb - LinkDb: linkdb: invertlinks
INFO  crawl.LinkDb - LinkDb: adding segment: linkdb
INFO  crawl.LinkDb - LinkDb: adding segment: segments/*
ERROR mapred.JobClient - Input directory
E:/Data/prashant/Projects/DummyNutch/Nutch/linkdb/parse_data in local is
invalid.
thanks in advance for help
-- 
View this message in context: http://www.nabble.com/Merging-WebDBs-tf3451970.html#a9629093
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Merging WebDBs

Posted by Sami Siren <ss...@gmail.com>.
2007/3/23, prashant_nutch <pr...@in.v2solutions.com>:
>
> i created new webdb under which i create two folder crawldb and segments
> (which is combination of two webdb),
> but now i want create Linkdb and index....
> how this can be created......i use command like this in eclipse program
> argument(Windows)
>
> invertlinks linkdb segments/*
> i got error like
>
> INFO  crawl.LinkDb - LinkDb: starting
> INFO  crawl.LinkDb - LinkDb: linkdb: invertlinks
> INFO  crawl.LinkDb - LinkDb: adding segment: linkdb
> INFO  crawl.LinkDb - LinkDb: adding segment: segments/*
> ERROR mapred.JobClient - Input directory
> E:/Data/prashant/Projects/DummyNutch/Nutch/linkdb/parse_data in local is
> invalid.
> thanks in advance for help



LinkDb treats the parameter invertlinks as the path to linkdb (the 1st
parameter), remove it and the command should succeed.

--
 Sami Siren