You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2011/07/14 22:39:06 UTC

[Nutch Wiki] Update of "NutchTutorialPre1.3" by JoeLencioni

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "NutchTutorialPre1.3" page has been changed by JoeLencioni:
http://wiki.apache.org/nutch/NutchTutorialPre1.3?action=diff&rev1=36&rev2=37

Comment:
Removing references to < 1.3 or >= 1.3

- ## page was copied from NutchTutorial
  == Requirements ==
   1. Java 1.4.x, either from Sun or IBM on Linux is preferred. Set NUTCH_JAVA_HOME to the root of your JVM installation. Nutch 0.9 requires Sun JDK 1.5 or higher.
   1. Apache's Tomcat 5.x. or higher.
@@ -12, +11 @@

  
  Try the following command:
  
+ {{{ bin/nutch }}} 
- {{{ bin/nutch (version < 1.3) }}} 
- 
- or
- 
- {{{runtime/local/bin/nutch (version >= 1.3) }}}
  
  This will display the documentation for the Nutch command script.
  
  Good! You are almost ready to crawl. You need to give your crawler a name. This is required.
  
-  1. Edit $NUTCH_HOME/conf/nutch-site.xml (or $NUTCH_HOME/runtime/local/conf/nutch-site.xml with version >= 1.3) and add
+  1. Edit $NUTCH_HOME/conf/nutch-site.xml and add
  
  {{{
  <property>
@@ -183, +178 @@

  
  Now we're ready to search!
  
- == Command Line Searching (version < 1.3)  ==
+ == Command Line Searching  ==
  Simplest way to verify the integrity of your crawl is to launch NutchBean from command line:
  
  {{{ bin/nutch org.apache.nutch.searcher.NutchBean apache }}}
  
  where ''apache'' is the search term (note that NutchBean will only search pages in the {{{crawl}}} directory, so if you named the crawl directory something else, NutchBean will not find any results). After you have verified that the above command returns results you can proceed to setting up the web interface.
  
- == Installing in Tomcat (version < 1.3) ==
+ == Installing in Tomcat ==
  To search you need to put the nutch war file into your servlet container. (If instead of downloading a Nutch release you checked the sources out of SVN, then you'll first need to build the war file, with the command {{{ant war}}}.)
  
  Assuming you've unpacked Tomcat as ~/local/tomcat, then the Nutch war file may be installed with the commands: