You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2008/10/17 17:01:28 UTC

[Nutch Wiki] Update of "FrontPage" by DogacanGuney

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by DogacanGuney:
http://wiki.apache.org/nutch/FrontPage

------------------------------------------------------------------------------
  Please contribute your knowledge about Nutch here!
  
- == General Information ==
+  == General Information ==
-  * [http://www.nutch.org Nutch Website ]
+   * [http://www.nutch.org Nutch Website ]
-  * ["Features"]
+   * ["Features"]
-  * PublicServers running Nutch
+   * PublicServers running Nutch
-  * ["Presentations"] on Nutch
+   * ["Presentations"] on Nutch
-  * Press ["Articles"]
+   * Press ["Articles"]
-  * ["Evaluations"] of Search Quality
+   * ["Evaluations"] of Search Quality
-  * ["Help Wanted"] organizations hiring Nutch expertise
+   * ["Help Wanted"] organizations hiring Nutch expertise
-  * Commercial ["Support"] and developers for hire
+   * Commercial ["Support"] and developers for hire
-  * ["Mailing"] Lists
+   * ["Mailing"] Lists
-  * AcademicArticles that deal with Nutch
+   * AcademicArticles that deal with Nutch
  
- == Nutch Administration ==
+  == Nutch Administration ==
-  * DownloadingNutch
+   * DownloadingNutch
-  * HardwareRequirements
+   * HardwareRequirements
-  * '''[http://peterpuwang.googlepages.com/NutchGuideForDummies.htm Tutorial] -- Latest step by Step Installation guide for dummies: Nutch 0.9.'''
+   * '''[http://peterpuwang.googlepages.com/NutchGuideForDummies.htm Tutorial] -- Latest step by Step Installation guide for dummies: Nutch 0.9.'''
-  * [http://lucene.apache.org/nutch/tutorial.html Tutorial] -- A Step-by-Step guide to getting Nutch up and running.
+   * [http://lucene.apache.org/nutch/tutorial.html Tutorial] -- A Step-by-Step guide to getting Nutch up and running.
-  * NutchTutorial ''on the wiki''
+   * NutchTutorial ''on the wiki''
-  * ["Nutch - The Java Search Engine"] (Builds on the basic tutorials. Includes index maintenance scripts)
+   * ["Nutch - The Java Search Engine"] (Builds on the basic tutorials. Includes index maintenance scripts)
-  * [:NutchHadoopTutorial:Nutch Hadoop Tutorial] - How to setup Nutch and Hadoop over a cluster of machines
+   * [:NutchHadoopTutorial:Nutch Hadoop Tutorial] - How to setup Nutch and Hadoop over a cluster of machines
-  * [:Automating_Fetches_with_Python:Automating Fetches with Python] - How to automatic the Nutch fetching process using Python
+   * [:Automating_Fetches_with_Python:Automating Fetches with Python] - How to automatic the Nutch fetching process using Python
-  * [:Upgrading_Hadoop:Upgrading Hadoop Version in Nutch] - Basic steps for upgrading Hadoop in Nutch.
+   * [:Upgrading_Hadoop:Upgrading Hadoop Version in Nutch] - Basic steps for upgrading Hadoop in Nutch.
-  * ["FAQ"]
+   * ["FAQ"]
-  * [:CommandLineOptions:Commandline] options for 0.7.x
+   * [:CommandLineOptions:Commandline] options for 0.7.x
-  * [:08CommandLineOptions:Commandline] options for version 0.8
+   * [:08CommandLineOptions:Commandline] options for version 0.8
-  * OverviewDeploymentConfigs
+   * OverviewDeploymentConfigs
-  * NutchConfigurationFiles
+   * NutchConfigurationFiles
-  * GettingNutchRunningWithUtf8 - For support of non-ASCII characters (Chinese, German, Japanese, Korean).
+   * GettingNutchRunningWithUtf8 - For support of non-ASCII characters (Chinese, German, Japanese, Korean).
-  * GettingNutchRunningWithResin - Resin is a JSP/Servlet/EJB application server (alternative to tomcat).
+   * GettingNutchRunningWithResin - Resin is a JSP/Servlet/EJB application server (alternative to tomcat).
-  * GettingNutchRunningWithJetty
+   * GettingNutchRunningWithJetty
-  * GettingNutchRunningWithUbuntu
+   * GettingNutchRunningWithUbuntu
-  * GettingNutchRunningWithWindows
+   * GettingNutchRunningWithWindows
-  * GettingNutchRunningWithMacOsx
+   * GettingNutchRunningWithMacOsx
-  * GettingNutchRunningWithRedHatApplicationServer
+   * GettingNutchRunningWithRedHatApplicationServer
-  * GettingNutchRunningWithDebian
+   * GettingNutchRunningWithDebian
-  * GettingNutchRunningWithSocksProxy
+   * GettingNutchRunningWithSocksProxy
-  * ErrorMessages -- What they mean and suggestions for getting rid of them.
+   * ErrorMessages -- What they mean and suggestions for getting rid of them.
-  * SimpleMapReduceTutorial
+   * SimpleMapReduceTutorial
-  * SetupProxyForNutch - using Tinyproxy on Ubuntu
+   * SetupProxyForNutch - using Tinyproxy on Ubuntu
-  * CreateNewFilter - for example to add a category metadata to your index and be able to search for it
+   * CreateNewFilter - for example to add a category metadata to your index and be able to search for it
-  * UpgradeFrom07To08
+   * UpgradeFrom07To08
-  * ["Upgrading_from_0.8.x_to_0.9"]
+   * ["Upgrading_from_0.8.x_to_0.9"]
-  * RunNutchInEclipse for v0.8
+   * RunNutchInEclipse for v0.8
-  * ["RunNutchInEclipse0.9"] for v0.9
+   * ["RunNutchInEclipse0.9"] for v0.9
-  * ["Crawl"] - script to crawl (and possible recrawl too)
+   * ["Crawl"] - script to crawl (and possible recrawl too)
-  * IntranetRecrawl - script to recrawl a crawl
+   * IntranetRecrawl - script to recrawl a crawl
-  * MergeCrawl - script to merge 2 (or more) crawls 
+   * MergeCrawl - script to merge 2 (or more) crawls
-  * SearchOverMultipleIndexes - configuring nutch to enable searching over multiple indexes
+   * SearchOverMultipleIndexes - configuring nutch to enable searching over multiple indexes
-  * CrossPlatformNutchScripts
+   * CrossPlatformNutchScripts
-  * MonitoringNutchCrawls - techniques for keeping an eye on a nutch crawl's progress.
+   * MonitoringNutchCrawls - techniques for keeping an eye on a nutch crawl's progress.
-  * ["Nutch 0.9 Crawl Script Tutorial"]
+   * ["Nutch 0.9 Crawl Script Tutorial"]
-  * HttpAuthenticationSchemes - How to enable Nutch to authenticate itself using NTLM, Basic or Digest authentication schemes.
+   * HttpAuthenticationSchemes - How to enable Nutch to authenticate itself using NTLM, Basic or Digest authentication schemes.
-  * NonDefaultIntranetCrawlingOptions - Desirable options to add to your intranet crawling configuration.
+   * NonDefaultIntranetCrawlingOptions - Desirable options to add to your intranet crawling configuration.
-  * RunningNutchAndSolr - How to configure Nutch to crawl, but post to Solr for search/index
+   * RunningNutchAndSolr - How to configure Nutch to crawl, but post to Solr for search/index
  
- == Nutch Development ==
+  == Nutch Development ==
-  * [:Becoming_A_Nutch_Developer:Becoming a Nutch Developer] - Start developing and contributing to Nutch.
+   * [:Becoming_A_Nutch_Developer:Becoming a Nutch Developer] - Start developing and contributing to Nutch.
-  * PluginCentral -- How to write your own plugins and use other people's.
+   * PluginCentral -- How to write your own plugins and use other people's.
-  * InternalDocumentation -- How Nutch works.
+   * InternalDocumentation -- How Nutch works.
-  * [http://lucene.apache.org/nutch/apidocs/index.html JavaDocs] -- The !JavaDocs for Nutch.
+   * [http://lucene.apache.org/nutch/apidocs/index.html JavaDocs] -- The !JavaDocs for Nutch.
-  * [http://lucene.apache.org/nutch/version_control.html Nutch Version Control]
+   * [http://lucene.apache.org/nutch/version_control.html Nutch Version Control]
-  * MultiLingualSupport - ''In development''.
+   * MultiLingualSupport - ''In development''.
-  * FixingOpicScoring - ''In planning''.
+   * FixingOpicScoring - ''In planning''.
-  * HowToContribute
+   * HowToContribute
-  * TaskList -- Tasks for Nutch developers.
+   * TaskList -- Tasks for Nutch developers.
-  * ["Development"] -- More tasks for Nutch developers.
+   * ["Development"] -- More tasks for Nutch developers.
-  * ["Committer's_Rules"] -- Committers should follow these guidelines when deciding, which branch to use for committing the patches and when to commit.
+   * ["Committer's_Rules"] -- Committers should follow these guidelines when deciding, which branch to use for committing the patches and when to commit.
-  * ["Release_HOWTO"]
+   * ["Release_HOWTO"]
-  * ["Website Update HOWTO"]
+   * ["Website Update HOWTO"]
-  * ["Image Search Design"]
+   * ["Image Search Design"]
-  * ["NutchOSGi"]
+   * ["NutchOSGi"]
-  * ["StrategicGoals"]
+   * ["StrategicGoals"]
-  * ["IndexStructure"]
+   * ["IndexStructure"]
-  * ["Getting Started"]
+   * ["Getting Started"]
-  * JavaDemoApplication - A simple demonstration of how to use the Nutch APIin a Java application
+   * JavaDemoApplication - A simple demonstration of how to use the Nutch APIin a Java application
-  * InstallingWeb2
+   * InstallingWeb2
  
- == Nutch 2.0 ==
+  == Nutch 2.0 ==
-  * ["Nutch2Architecture"] -- Discussions on the Nutch 2.0 architecture.
+   * ["Nutch2Architecture"] -- Discussions on the Nutch 2.0 architecture.
  
- == Other Resources ==
+  == Other Resources ==
-  * [http://nutch.sourceforge.net/blog/cutting.html Doug's Weblog] -- He's the one who originally wrote Lucene and Nutch.
+   * [http://nutch.sourceforge.net/blog/cutting.html Doug's Weblog] -- He's the one who originally wrote Lucene and Nutch.
-  * [http://wiki.media-style.com/display/nutchDocu/Home Stefan's Nutch Documentation]
+   * [http://wiki.media-style.com/display/nutchDocu/Home Stefan's Nutch Documentation]
-  * [http://frutch.free.fr/wikini/ Frutch Wiki] -- French Nutch Wiki
+   * [http://frutch.free.fr/wikini/ Frutch Wiki] -- French Nutch Wiki
-  * The [http://nutch.sourceforge.net/cgi-bin/twiki/view/Main/Nutch Old Wiki]
+   * The [http://nutch.sourceforge.net/cgi-bin/twiki/view/Main/Nutch Old Wiki]
-  * ["Search_Theory"] Search Theory & White Papers
+   * ["Search_Theory"] Search Theory & White Papers
-  * [http://wiki.apache.org/nutch-data/attachments/FrontPage/attachments/Hadoop-Nutch%200.8%20Tutorial%2022-07-06%20%3CNavoni%20Roberto%3E Tutorial Hadoop+Nutch 0.8 night build Roberto Navoni 24-07-06]
+   * [http://wiki.apache.org/nutch-data/attachments/FrontPage/attachments/Hadoop-Nutch%200.8%20Tutorial%2022-07-06%20%3CNavoni%20Roberto%3E Tutorial Hadoop+Nutch 0.8 night build Roberto Navoni 24-07-06]
-  * [http://blog.foofactory.fi/ FooFactory] Nutch and Hadoop related posts
+   * [http://blog.foofactory.fi/ FooFactory] Nutch and Hadoop related posts
-  * [http://spinn3r.com Spinn3r] [http://spinn3r.com/opensource.php Open Source components] (our contribution to the crawling OSS community with more to come).
+   * [http://spinn3r.com Spinn3r] [http://spinn3r.com/opensource.php Open Source components] (our contribution to the crawling OSS community with more to come).
-  * [http://www.interadvertising.co.uk/blog/nutch_logos Larger / better quality Nutch logos] Re-created Nutch logos available in GIF, PNG & EPS in resolutions up to 1200 x 449
+   * [http://www.interadvertising.co.uk/blog/nutch_logos Larger / better quality Nutch logos] Re-created Nutch logos available in GIF, PNG & EPS in resolutions up to 1200 x 449