You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/04/06 23:11:34 UTC

[Nutch Wiki] Update of "GettingNutchRunningWithUbuntu" by RandallLucas

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RandallLucas:
http://wiki.apache.org/nutch/GettingNutchRunningWithUbuntu

------------------------------------------------------------------------------
  {{{
  root@db2:/opt# ./jdk-1_5_0_04-linux-amd64.bin
  }}}
+ 
+ ''You might also want to follow the instructions for Debian-izing the Sun JDK:'' http://plugindoc.mozdev.org/faqs/distronotes/ubuntu-x86.html#java-sun
  
  Let's put JAVA_HOME in our ~/.bash_profiles, and source said ~/.bash_profiles for root and nutch
  
@@ -110, +112 @@

  Follow the nutch tutorial (http://lucene.apache.org/nutch/tutorial.html) to build a index, or for a simple index:
  
  {{{
+ nutch@db2:~/nutch/trunk $ mkdir urls
- nutch@db2:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > urls
+ nutch@db2:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > urls/nutch
  nutch@db2:~/nutch/trunk $ perl -pi -e 's|MY.DOMAIN.NAME|lucene.apache.org/nutch|' \
    conf/crawl-urlfilter.txt
  nutch@db2:~/nutch/trunk $ bin/nutch crawl urls -dir crawl.test -depth 3