You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pdfbox.apache.org by le...@apache.org on 2013/05/13 00:08:15 UTC

svn commit: r1481660 - /pdfbox/cmssite/trunk/content/references.mdtext

Author: lehmi
Date: Sun May 12 22:08:15 2013
New Revision: 1481660

URL: http://svn.apache.org/r1481660
Log:
minor changes

Modified:
    pdfbox/cmssite/trunk/content/references.mdtext

Modified: pdfbox/cmssite/trunk/content/references.mdtext
URL: http://svn.apache.org/viewvc/pdfbox/cmssite/trunk/content/references.mdtext?rev=1481660&r1=1481659&r2=1481660&view=diff
==============================================================================
--- pdfbox/cmssite/trunk/content/references.mdtext (original)
+++ pdfbox/cmssite/trunk/content/references.mdtext Sun May 12 22:08:15 2013
@@ -9,6 +9,7 @@ Please file an [improvement issue](https
 | Project Name  | License | Project Description |
 |--|--|--|
 | [Alfresco](http://www.alfresco.org/) | LGPL - commercial services/support/training is available | Alfresco is an open source, open-standards content repository built by the most experienced content management team that includes the co-founder of Documentum.|
+| [Apache Nutch](http://nutch.apache.org/) | Apache License V2.0 | Apache Nutch is open source web-search software. It builds on Apache Lucene, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.|
 | [Apache Tika](http://tika.apache.org/) | Apache License V2.0 | Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.|
 | [Centric CRM](http://www.centriccrm.com/) | Free To Use But Restricted/Commercial | The Most Advanced Open Source CRM Software.|
 | [Canoo Webtest](http://webtest.canoo.com/webtest/manual/WebTestHome.html) | BSD Like | Free OpenSource tool for XP-style acceptance testing of Java-based Web applications.|
@@ -23,7 +24,6 @@ Please file an [improvement issue](https
 | [LuceGene](http://www.gmod.org/lucegene/) | Artistic License | LuceGene is an open-source document/object search and retrieval system specially tuned for bioinformatics text databases and documents.|
 | [Lutece](http://www.lutece.paris.fr/) | BSD-like | Lutece is a portal engine which allows you to easily create your websites or intranets based upon HTML,XML content.|
 | [MMBase Lucene Module](http://mmapps.sourceforge.net/lucenemodule/) | MPL | Lucenemodule is a plugin (module) for the MMBase content management system that enables Lucene full text search through it's content, and thanks to PDFBox also PDF content.|
-| [Nutch](http://lucene.apache.org/nutch/) | ASL | Nutch is open source web-search software. It builds on Lucene, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.|
 | [OpenCms](http://www.opencms.org/) | Custom | OpenCms is a professional level Open Source Website Content Management System.|
 | [OpenSearchServer](http://www.open-search-server.com/) | GPLv3 | An open source search engine and crawler based on best open source technologies. It is a modern search engine and a suite of high-powered full text search algorithms.|
 | [Orbeon PresentationServer](http://forge.objectweb.org/projects/ops) | LGPL | Orbeon PresentationServer (OPS) is an open source J2EE-based platform for XML-centric web applications. OPS is built around XHTML, XForms, XSLT, XML pipelines, and Web Services, which makes it ideal for applications that capture, process and present XML data. Commercial consulting/training/support is available through orbeon.|