You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2010/08/02 18:07:23 UTC

[Nutch Wiki] Update of "NutchTutorial" by MatthewSimoneau

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "NutchTutorial" page has been changed by MatthewSimoneau.
The comment on this change is: This is a better way to install..
http://wiki.apache.org/nutch/NutchTutorial?action=diff&rev1=25&rev2=26

--------------------------------------------------

  
  Now we're ready to search!
  
- == Searching ==
+ == Command Line Searching ==
  
  Simplest way to verify the integrity of your crawl is to launch NutchBean from command line:
  
@@ -189, +189 @@

  
  where ''apache'' is the search term (note that NutchBean will only search pages in the {{{crawl}}} directory, so if you named the crawl directory something else, NutchBean will not find any results). After you have verified that the above command returns results you can proceed to setting up the web interface. 
  
+ == Installing in Tomcat ==
  To search you need to put the nutch war file into your servlet container. (If instead of downloading a Nutch release you checked the sources out of SVN, then you'll first need to build the war file, with the command {{{ant war}}}.)
  
  Assuming you've unpacked Tomcat as ~/local/tomcat, then the Nutch war file may be installed with the commands:
  
- {{{ 
- rm -rf ~/local/tomcat/webapps/ROOT*
- cp nutch*.war ~/local/tomcat/webapps/ROOT.war 
- }}}
- 
- Alternative: it is NOT NECESSARY to remove the ROOT context nor is it desirable. It was noted above that the Tomcat Manager allows us to view and control our multiple applications. Removing ROOT would break this functionality.
- How to deploy nutch in http://localhost:8080/nutch/
  {{{ 
  mkdir ~/local/tomcat/webapps/nutch
  cp nutch*.war ~/local/tomcat/webapps/nutch/
@@ -207, +201 @@

  rm nutch-1.1.war;
  }}}
  
- '''The webapp finds its indexes in ./crawl, relative to where you start Tomcat, so use a command like:'''
+ The webapp finds its indexes in ./crawl, relative to where you start Tomcat, so use a command like:
  
  {{{ ~/local/tomcat/bin/catalina.sh start }}}
  
- Then visit http://localhost:8080/ and have fun!
+ Then visit: http://localhost:8080/nutch/