You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2019/01/21 11:03:33 UTC

[Nutch Wiki] Update of "IndexWriters" by SebastianNagel

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "IndexWriters" page has been changed by SebastianNagel:
https://wiki.apache.org/nutch/IndexWriters?action=diff&rev1=17&rev2=18

Comment:
CSV Indexer: add limitation that it can only be used in local mode

  || maxfieldlength || Max. length of a single field value in characters || 4096 ||
  || maxfieldvalues || Max. number of values of one field, useful for, e.g., the anchor texts field || 12 ||
  || header || Write CSV column headers || true ||
- || outpath || Output path / directory || csvindexwriter ||
+ || outpath || Output path / directory (local filesystem path, relative to current working directory)  || csvindexwriter ||
  
+ {{{#!wiki caution
+ The CSV indexer does not work in distributed mode, the output is written to the local filesystem, not to HDFS, see [[https://issues.apache.org/jira/browse/NUTCH-1541|NUTCH-1541]].
+ }}}
+