You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by le...@apache.org on 2014/03/17 02:34:51 UTC

svn commit: r1578202 - /nutch/site/publish/doap.rdf

Author: lewismc
Date: Mon Mar 17 01:34:50 2014
New Revision: 1578202

URL: http://svn.apache.org/r1578202
Log:
update DOAP

Modified:
    nutch/site/publish/doap.rdf

Modified: nutch/site/publish/doap.rdf
URL: http://svn.apache.org/viewvc/nutch/site/publish/doap.rdf?rev=1578202&r1=1578201&r2=1578202&view=diff
==============================================================================
--- nutch/site/publish/doap.rdf (original)
+++ nutch/site/publish/doap.rdf Mon Mar 17 01:34:50 2014
@@ -23,16 +23,12 @@
 -->
   <Project rdf:about="http://nutch.apache.org">
     <created>2011-07-21</created>
-    <license rdf:resource="http://usefulinc.com/doap/licenses/asl20" />
+    <license rdf:resource="http://www.apache.org/licenses/LICENSE-2.0" />
     <name>Apache Nutch</name>
     <homepage rdf:resource="http://nutch.apache.org" />
     <asfext:pmc rdf:resource="http://nutch.apache.org" />
     <shortdesc>Apache Nutch is an open source web-search software project.</shortdesc>
-    <description>Apache Nutch is an open source web-search software project. Stemming from Apache Lucene, it now builds on Apache Solr adding web-specifics, such as a crawler, a link-graph database and parsing support handled by Apache Tika for HTML and and array other document formats.
-
-Apache Nutch can run on a single machine, but gains a lot of its strength from running in a Hadoop cluster
-
-The system can be enhanced (eg other document formats can be parsed) using a highly flexible, easily extensible and thoroughly maintained plugin infrastructure.</description>
+    <description>Apache Nutch is a highly extensible and scalable open source web crawler software project. Stemming from Apache Lucene, the project has diversified and now comprises two codebases, namely: Nutch 1.x: A well matured, production ready crawler. 1.x enables fine grained configuration, relying on Apache Hadoop data structures, which are great for batch processing. Nutch 2.x: An emerging alternative taking direct inspiration from 1.x, but which differs in one key area; storage is abstracted away from any specific underlying data store by using Apache Gora for handling object to persistent mappings. This means we can implement an extremely flexibile model/stack for storing everything (fetch time, status, content, parsed text, outlinks, inlinks, etc.) into a number of NoSQL storage solutions. Being pluggable and modular of course has it's benefits, Nutch provides extensible interfaces such as Parse, Index and ScoringFilter's for custom implementations e.g. Apache Tika for p
 arsing. Additonally, pluggable indexing exists for Apache Solr, Elastic Search, etc. Nutch can run on a single machine, but gains a lot of its strength from running in a Hadoop cluster </description>
     <bug-database rdf:resource="http://issues.apache.org/jira/browse/NUTCH" />
     <mailing-list rdf:resource="http://nutch.apache.org/mailing_lists.html" />
     <download-page rdf:resource="http://www.apache.org/dyn/closer.cgi/nutch/" />
@@ -40,6 +36,48 @@ The system can be enhanced (eg other doc
     <category rdf:resource="http://projects.apache.org/category/web-framework" />
     <release>
       <Version>
+        <name>Apache Nutch 1.8</name>
+        <created>2014-03-17</created>
+        <revision>1.8</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
+        <name>Apache Nutch 2.2.1</name>
+        <created>2013-07-02</created>
+        <revision>2.2.1</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
+        <name>Apache Nutch 1.7</name>
+        <created>2013-06-24</created>
+        <revision>1.7</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
+        <name>Apache Nutch 2.2</name>
+        <created>2013-06-05</created>
+        <revision>2.2</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
+        <name>Apache Nutch 1.6</name>
+        <created>2012-12-06</created>
+        <revision>1.6</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
+        <name>Apache Nutch 2.1</name>
+        <created>2012-10-05</created>
+        <revision>1.5.1</revision>
+      </Version>
+    </release>
+    <release>
+      <Version>
         <name>Apache Nutch 1.5.1</name>
         <created>2012-07-10</created>
         <revision>1.5.1</revision>