You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2014/09/23 03:22:51 UTC
[Nutch Wiki] Trivial Update of "Nutch2Tutorial" by LewisJohnMcgibbney
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "Nutch2Tutorial" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/Nutch2Tutorial?action=diff&rev1=10&rev2=11
This document describes how to get Nutch 2.0 to use HBase as a storage backend for Gora.
* Grab the latest distribution of Nutch 2.X from [[http://www.apache.org/dyn/closer.cgi/nutch/|here]]
- * Install and configure HBase. You can get it [[http://archive.apache.org/dist/hbase/|here]] ('''N.B.''' Gora 0.3 uses HBase 0.90.4, however the setup is known to work with more recent versions of the HBase 0.90.x branch)
+ * Install and configure HBase. You can get it [[http://archive.apache.org/dist/hbase/hbase-0.94.14/|here]] ('''N.B.''' Gora 0.4 uses HBase 0.94.14 we therefore suggest you use this version if possible.
* Specify the GORA backend in nutch-site.xml
{{{
@@ -22, +22 @@
{{{
<!-- Uncomment this to use HBase as Gora backend. -->
- <dependency org="org.apache.gora" name="gora-hbase" rev="0.3" conf="*->default" />
+ <dependency org="org.apache.gora" name="gora-hbase" rev="0.4" conf="*->default" />
}}}
* Ensure that HBaseStore is set as the default datastore in gora.properties