You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/07/17 02:59:12 UTC

[Nutch Wiki] Trivial Update of "bin/nutch segslice" by RobPettengill

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RobPettengill:
http://wiki.apache.org/nutch/bin/nutch_segslice

------------------------------------------------------------------------------
  
  Data is read sequentially from input segments, and appended to output segment until it reaches the target count of entries, at which point the next output segment is created, and so on.
  
- NOTE 1: this tool does NOT de-duplicate data - use SegmentMergeTool for that.
+ NOTE 1: this tool does NOT de-duplicate data - use !SegmentMergeTool for that.
  
  NOTE 2: this tool does NOT copy indexes. It is currently impossible to slice Lucene indexes. The proper procedure is first to create slices, and then to index them.