You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by sr...@apache.org on 2010/10/20 12:57:43 UTC

svn commit: r1025493 - in /mahout/site: new_website/index.html src/documentation/content/xdocs/index.xml

Author: srowen
Date: Wed Oct 20 10:57:43 2010
New Revision: 1025493

URL: http://svn.apache.org/viewvc?rev=1025493&view=rev
Log:
Add 0.4 release news to old and new sites

Modified:
    mahout/site/new_website/index.html
    mahout/site/src/documentation/content/xdocs/index.xml

Modified: mahout/site/new_website/index.html
URL: http://svn.apache.org/viewvc/mahout/site/new_website/index.html?rev=1025493&r1=1025492&r2=1025493&view=diff
==============================================================================
--- mahout/site/new_website/index.html (original)
+++ mahout/site/new_website/index.html Wed Oct 20 10:57:43 2010
@@ -154,6 +154,36 @@
 
 		Interested in helping? See the Wiki or send us an email. Also note, we are just getting off the ground, so please be patient as we get the various infrastructure pieces in place.
 		<h2>Mahout News</h2>
+
+        <h3 class="boxed">21 October 2010 - Apache Mahout 0.4 released</h3>
+        <p>We are pleased to announce release 0.4 of Mahout. Virtually every corner of the project has changed,
+        and significantly, since 0.3. Developers are invited to use and depend on version 0.4 even as yet more
+        change is to be expected before the next release. Highlights include:
+          <ul>
+            <li>Model refactoring and CLI changes to improve integration and consistency</li>
+            <li>New ClusterEvaluator and CDbwClusterEvaluator offer new ways to evaluate clustering effectiveness</li>
+            <li>New Spectral Clustering and MinHash Clustering (still experimental)</li>
+            <li>New VectorModelClassifier allows any set of clusters to be used for classification</li>
+            <li>Map/Reduce job to compute the pairwise similarities of the rows of a matrix using a customizable similarity measure</li>
+            <li>Map/Reduce job to compute the item-item-similarities for item-based collaborative filtering</li>
+            <li>RecommenderJob has been evolved to a fully distributed item-based recommender</li>
+            <li>Distributed Lanczos SVD implementation</li>
+            <li>More support for distributed operations on very large matrices</li>
+            <li>Easier access to Mahout operations via the command line</li>
+            <li>New HMM based sequence classification from GSoC (currently as sequential version only and still experimental)</li>
+            <li>Sequential logistic regression training framework</li>
+            <li>New SGD classifier</li>
+            <li>Experimental new type of NB classifier, and feature reduction options for existing one</li>
+            <li>New vector encoding framework for high speed vectorization without a pre-built dictionary</li>
+            <li>Additional elements of supervised model evaluation framework</li>
+            <li>Promoted several pieces of old Colt framework to tested status (QR decomposition, in particular)</li>
+            <li>Can now save random forests and use it to classify new data</li>
+            <li>Many, many small fixes, improvements, refactorings and cleanup</li>
+          </ul>
+        <p>Details on what's included can be found in the <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310751&amp;styleName=Html&amp;version=12314281">release notes</a>.
+        Downloads are available from the <a href="http://www.apache.org/dyn/closer.cgi/lucene/mahout/">Apache Mirrors</a>.
+        </p>
+
 		
 		 <h3 class="boxed">29 March 2010 - Google Summer Of Code Projects</h3> <p>Its Summer of Code time again and ASF is accepting proposals from students.
         Mahout has a number of people willing to be mentors, so if you are a student interested in working on machine learning algorithms using Hadoop or

Modified: mahout/site/src/documentation/content/xdocs/index.xml
URL: http://svn.apache.org/viewvc/mahout/site/src/documentation/content/xdocs/index.xml?rev=1025493&r1=1025492&r2=1025493&view=diff
==============================================================================
--- mahout/site/src/documentation/content/xdocs/index.xml (original)
+++ mahout/site/src/documentation/content/xdocs/index.xml Wed Oct 20 10:57:43 2010
@@ -6,8 +6,8 @@
     </title>
   </header>
   <body>
-    <section id="Apache Lucene">
-      <title>Apache Lucene Mahout</title>
+    <section id="Apache Mahout">
+      <title>Apache Mahout</title>
       <p>Mahout's goal is to build scalable machine learning libraries. With scalable we mean:
         <ul><li>Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. However we do not restrict contributions to Hadoop based implementations: Contributions that run on a single node or on a non-Hadoop cluster are welcome as well. The core libraries are highly optimized to allow for good performance also for non-distributed algorithms.</li>
             <li>Scalable to support your business case. Mahout is distributed under a commercially friendly Apache Software license.</li>
@@ -38,6 +38,42 @@
       </section>
 
       <section>
+        <title>21 October 2010 - Apache Mahout 0.4 released</title>
+        <p>We are pleased to announce release 0.4 of Mahout. Virtually every corner of the project has changed,
+        and significantly, since 0.3. Developers are invited to use and depend on version 0.4 even as yet more
+        change is to be expected before the next release.</p>
+        <p>
+          Highlights include:
+          <ul>
+            <li>Model refactoring and CLI changes to improve integration and consistency</li>
+            <li>New ClusterEvaluator and CDbwClusterEvaluator offer new ways to evaluate clustering effectiveness</li>
+            <li>New Spectral Clustering and MinHash Clustering (still experimental)</li>
+            <li>New VectorModelClassifier allows any set of clusters to be used for classification</li>
+            <li>Map/Reduce job to compute the pairwise similarities of the rows of a matrix using a customizable similarity measure</li>
+            <li>Map/Reduce job to compute the item-item-similarities for item-based collaborative filtering</li>
+            <li>RecommenderJob has been evolved to a fully distributed item-based recommender</li>
+            <li>Distributed Lanczos SVD implementation</li>
+            <li>More support for distributed operations on very large matrices</li>
+            <li>Easier access to Mahout operations via the command line</li> 
+            <li>New HMM based sequence classification from GSoC (currently as sequential version only and still experimental)</li>
+            <li>Sequential logistic regression training framework</li>
+            <li>New SGD classifier</li>
+            <li>Experimental new type of NB classifier, and feature reduction options for existing one</li>
+            <li>New vector encoding framework for high speed vectorization without a pre-built dictionary</li>
+            <li>Additional elements of supervised model evaluation framework</li>
+            <li>Promoted several pieces of old Colt framework to tested status (QR decomposition, in particular)</li>
+            <li>Can now save random forests and use it to classify new data</li>
+            <li>Many, many small fixes, improvements, refactorings and cleanup</li>
+          </ul>
+        </p>
+        <p>Details on what's included can be found in the <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310751&amp;styleName=Html&amp;version=12314281">release notes</a>.
+        </p>
+        <p>Downloads are available from the <a href="http://www.apache.org/dyn/closer.cgi/lucene/mahout/">Apache Mirrors</a>.
+        </p>
+
+      </section>
+
+      <section>
         <title>17 March 2010 - Apache Mahout 0.3 released</title>
         <p>The Apache Lucene project is pleased to announce the release of Apache Mahout 0.3.</p>
         <p>