You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2019/01/21 11:03:33 UTC
[Nutch Wiki] Update of "IndexWriters" by SebastianNagel
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "IndexWriters" page has been changed by SebastianNagel:
https://wiki.apache.org/nutch/IndexWriters?action=diff&rev1=17&rev2=18
Comment:
CSV Indexer: add limitation that it can only be used in local mode
|| maxfieldlength || Max. length of a single field value in characters || 4096 ||
|| maxfieldvalues || Max. number of values of one field, useful for, e.g., the anchor texts field || 12 ||
|| header || Write CSV column headers || true ||
- || outpath || Output path / directory || csvindexwriter ||
+ || outpath || Output path / directory (local filesystem path, relative to current working directory) || csvindexwriter ||
+ {{{#!wiki caution
+ The CSV indexer does not work in distributed mode, the output is written to the local filesystem, not to HDFS, see [[https://issues.apache.org/jira/browse/NUTCH-1541|NUTCH-1541]].
+ }}}
+