You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2012/07/09 17:35:44 UTC

[Nutch Wiki] Update of "FAQ" by JulienNioche

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "FAQ" page has been changed by JulienNioche:
http://wiki.apache.org/nutch/FAQ?action=diff&rev1=133&rev2=134

  ==== I have two XML files, nutch-default.xml and nutch-site.xml, why? ====
  nutch-default.xml is the out of the box configuration for Nutch, and most configurations can (and should unless you know what your doing) stay as per. nutch-site.xml is where you make the changes that override the default settings.
  
+ === Compiling Nutch ===
+ ==== How do I compile Nutch? ====
+ Install ANT and call 'ant' on the command line from the directory containing the Nutch source code. Note : this won't work for the binary release for obvious reasons.
+ 
+ ==== How do I compile Nutch in Eclipse? ====
+ Nutch uses ANT+IVY to compile the code and manage the dependencies (see above). There are instructions on how to get Nutch working with Eclipse on [http://wiki.apache.org/nutch/RunNutchInEclipse] but the easiest way of doing is to use ANT for compiling and rely on Eclipse just for visualising the code. You can also debug with Eclipse using the remote debugging and setting e.g. "export NUTCH_OPTS=-Xdebug -agentlib:jdwp=transport=dt_socket,server=y,address=8000" prior to calling the nutch script in /runtime/local/bin.
  
  === Injecting ===
  ==== What happens if I inject urls several times? ====