You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2007/05/14 20:07:25 UTC
[Nutch Wiki] Update of "Upgrading from 0.8.x to 0.9" by NicolásLichtmaier
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by NicolásLichtmaier:
http://wiki.apache.org/nutch/Upgrading_from_0%2e8%2ex_to_0%2e9
The comment on the change is:
Removed backticks, which are wrong here.
------------------------------------------------------------------------------
* if you ran your fetcher in parsing mode, or if you already parsed the segments, then for each segment you need to first remove directories containing parsed content:
{{{
- for i in `segments/2006*`
+ for i in segments/2006* ; do
- do
(cd $i && rm -rf crawl_parse parse_data parse_text)
done
}}}
* then you can use 'mergesegs' command, which was modified to perform the conversion of remaining segment parts, e.g. like this:
{{{
mkdir converted
- for i in `segments/2006*`
+ for i in segments/2006* ; do
- do
nutch mergesegs converted $i
done
}}}
@@ -40, +38 @@
* finally, you will need to re-parse converted segments:
{{{
- for i in `converted/*`
+ for i in converted/* ; do
- do
nutch parse $i
done
}}}