You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by derevo <da...@inbox.ru> on 2007/04/13 15:43:12 UTC

How to add ney segment to index

Hi, 
nutch version 8.1 and  9.0 
I'm trying, this steps: 

#mkdir dmoz
#bin/nutch inject crawl/crawldb dmoz
(in dmoz file urls whith my target sites)
#bin/nutch generate crawl/crawldb crawl/segments
#s1=`ls -d crawl/segments/2* | tail -1`
#bin/nutch fetch $s1
#bin/nutch updatedb crawl/crawldb $s1
#bin/nutch invertlinks crawl/linkdb $s1
#bin/nutch index crawl/indexes crawl/crawldb crawl/linkdb $s1

ok, now some pages in index. 

Next, i try to add new segment to index 

#bin/nutch generate crawl/crawldb crawl/segments -topN 1000
#s2=`ls -d crawl/segments/2* | tail -1`
#bin/nutch fetch $s2
#bin/nutch updatedb crawl/crawldb $s2
#bin/nutch invertlinks crawl/linkdb $s2


#bin/nutch index crawl/indexes crawl/crawldb crawl/linkdb $s2
Exception in thread "main" java.io.IOException: Output directory
/root/SE/SE_java/nutch-0.8.1/crawl/indexes already exists.

In version 7.2 it's simple to add by segments, but i'm dont understand how
do this in 8.1 and 9.0 versions

10x


-- 
View this message in context: http://www.nabble.com/How-to-add-ney-segment-to-index-tf3571707.html#a9979218
Sent from the Nutch - User mailing list archive at Nabble.com.