You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2011/09/30 15:34:05 UTC

[Hadoop Wiki] Update of "Distributions and Commercial Support" by SteveLoughran

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Distributions and Commercial Support" page has been changed by SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diff&rev1=41&rev2=42

Comment:
roll back some of the hype, remove personal "we" claims, use Apache name more thoroughly

   * [[http://aws.amazon.com/|Amazon Web Services]]
    * Amazon offers a version of Apache Hadoop on their EC2 infrastructure, sold as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
  
-  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.
+  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature-rich API for defining and executing complex and fault tolerant data processing workflows on a Apache Hadoop cluster.
  
   * [[Cloudera]]: [[http://www.cloudera.com/downloads/|Cloudera's Distribution including Apache Hadoop]] currently includes:
    * [[http://www.cloudera.com/downloads/|Docs and Setup Guide]]
-   * Tested and integrated packages for related Hadoop projects (hive, pig, zookeeper, hbase, flume, sqoop, oozie, hue)
+   * Tested and integrated packages for related Hadoop projects (Apache hive, pig, zookeeper, hbase, flume, sqoop, oozie, hue)
    * Standard Linux service management for all Hadoop services
    * RPM and Debian packages for redhat / ubuntu based systems in binary and source form
     * Public YUM and APT repository for distribution and updates
@@ -34, +34 @@

      * High performance bare metal cloud with [[http://www.softlayer.com|Softlayer]] ([[mailto:info@cloudera.com|contact]])
  
   * [[http://www.cloudspace.com/|Cloudspace]]
-   * Cloudspace is a web technology consulting company, since 1996. Cloudspace uses Hadoop to scale client and internal projects on Amazon's EC2 and bare metal architectures.
+   * Cloudspace is a web technology consulting company, since 1996. Cloudspace uses Apache Hadoop to scale client and internal projects on Amazon's EC2 and bare metal architectures.
  
   * [[http://www.datameer.com|Datameer]]
-   * Datameer Analytics Solution (DAS) is the first Hadoop-based solution for big data analytics that includes data source integration, storage, an analytics engine and visualization.
+   * Datameer Analytics Solution (DAS) is a Hadoop-based solution for big data analytics that includes data source integration, storage, an analytics engine and visualization.
    * DAS Log File Aggregator is a plug-in to DAS that makes it easy to import large numbers of log files stored on disparate servers.
  
   * [[http://www.debian.org|Debian]]
@@ -53, +53 @@

    * Available as [[http://www.hstreaming.com/products/cloud/|cloud service]] and as a [[http://www.hstreaming.com/products/enterprise/|software license]].
  
   * [[http://www.alphaworks.ibm.com/tech/idah|IBM]]
-   * IBM now offers a repackaged version of Apache Hadoop that IBM supports on IBM JVMs.
+   * IBM offers a repackaged version of Apache Hadoop that IBM supports on IBM JVMs.
  
   * [[http://www.impetus.com/ |Impetus]]
    *Impetus' LADAP system is built for large enterprises and Websites to effectively derive intelligence out of raw data from discrete sources. With LADAP, an in-depth analysis can be undertaken on data from many different sources including social networks, to find out the patterns and structures within it. [[http://bigdata.impetus.com/big_data_analytics_platform# | More info about LADAP @Impetus]]
@@ -61, +61 @@

    *With a strong focus, established thought leadership and open source contributions in the area of Big Data analytics and consulting services, Impetus uses its Global Delivery Model to help technology businesses and enterprises evaluate and implement solutions tailored to their specific context, without being biased towards a particular solution. [[http://bigdata.impetus.com/# | More info about BigData @Impetus]]
  
   * [[http://www.karmasphere.com/|Karmasphere]]
-   * Distributes [[http://www.hadoopstudio.org/|Karmasphere Studio for Hadoop]], which allows cross-version development and management of Hadoop jobs in a familiar integrated development environment.
+   * Distributes [[http://www.hadoopstudio.org/|Karmasphere Studio for Hadoop]], which allows cross-version development and management of Apache Hadoop jobs in a familiar integrated development environment.
  
   * [[http://lucene.apache.org/mahout|Mahout]]
    * Another Apache project using Hadoop to build scalable machine learning algorithms like canopy clustering, k-means and many more.
@@ -69, +69 @@

   * [[http://mapr.com|MapR Technologies]]
    * MapR sells a high performance map-reduce framework based on Apache Hadoop that includes the standard eco-system components.  A significant amount of re-engineering of the file system and the map-reduce components allows significantly higher performance than standard Hadoop while eliminating Hadoop's single points of failure (the NameNode and JobTracker) and allowing full read-write access to the cluster file store via NFS.
  
-  * [[http://lucene.apache.org/nutch|Nutch]] - flexible web search engine software
+  * [[http://lucene.apache.org/nutch|Nutch]] - Apache Nutch: flexible web search engine software
  
   * [[http://pentaho.com|Pentaho]] – Open Source Business Intelligence
-   * Pentaho provides the only complete, end-to-end open  source BI alternative to proprietary offerings like Oracle, SAP and IBM.
+   * Pentaho provides a complete, end-to-end open-source BI alternative to proprietary offerings like Oracle, SAP and IBM.
-   * We provide an easy-to-use, graphical ETL tool that  is integrated with Hadoop for managing data and coordinating Hadoop related tasks in the broader context of your ETL and Business Intelligence workflow.
+   * Offers an easy-to-use, graphical ETL tool that  is integrated with Apache Hadoop for managing data and coordinating Hadoop related tasks in the broader context of your ETL and Business Intelligence workflow.
-   * We also provide Reporting and Analysis capabilities against big data in Hadoop.
+   * Provides Reporting and Analysis capabilities against big data in Hadoop.
    * Learn more at [[http://www.pentaho.com/hadoop/|http://www.pentaho.com/hadoop]].
  
   * [[http://www.pervasivedatarush.com|Pervasive Software]]
-   * We provide[[http://www.pervasivedatarush.com|Pervasive DataRush]], a parallel dataflow framework which improves performance of Hadoop and MapReduce jobs by exploiting fine-grained parallelism on multicore servers.  [[mailto:info@pervasivedatarush.com|(contact)]]
+   * Provides [[http://www.pervasivedatarush.com|Pervasive DataRush]], a parallel dataflow framework which improves performance of Apache Hadoop and MapReduce jobs by exploiting fine-grained parallelism on multicore servers.  [[mailto:info@pervasivedatarush.com|(contact)]]
   * [[http://www.platform.com|Platform Computing]]
-   *[[http://www.platform.com/mapreduce|Platform Computing]] provides an Enterprise Class MapReduce solution for Big Data Analytics with high scalability and fault tolerance. [[http://www.platform.com/products/mapreduce|Platform MapReduce]] provides unique scheduling capabilities and its architecture is based on almost two decades of distributed computing research and development. Based on the same low-latency distributed architecture deployed in the leading financial institutions on Wallstreet, the solution meets the needs of the most demanding enterprise customers. With comprehensive GUI management tools and commercial support available for HDFS, the solution also supports other distributed file systems. [[mailto:mapreduce@platform.com|(contact)]]
+   *[[http://www.platform.com/mapreduce|Platform Computing]] provides an Enterprise Class MapReduce solution for Big Data Analytics with high scalability and fault tolerance. [[http://www.platform.com/products/mapreduce|Platform MapReduce]] provides unique scheduling capabilities and its architecture is based on almost two decades of distributed computing research and development. Based on the same low-latency distributed architecture deployed in the leading financial institutions on Wallstreet, the solution meets the needs of the most demanding enterprise customers. With comprehensive GUI management tools and commercial support available for HDFS, the solution also supports other distributed file systems. 
   * [[http://www.sematext.com/|Sematext International]]
-   * Provides consulting services around Hadoop and HBase, along with large-scale search using Lucene, Solr, and Elastic Search.
+   * Provides consulting services around Apache Hadoop and Apache HBase, along with large-scale search using Apache Lucene, Apache Solr, and Elastic Search.
    * Runs the popular [[http://search-hadoop.com/|search-hadoop.com]] search service.
  
   * [[http://www.talend.com/|Talend]] - The Open Source Integration Company
-   * [[http://www.talend.com/products-data-integration/talend-integration-suite-mpx.php|Talend Integration Suite MPx]] includes support for Hadoop's distributed file system (HDFS) that provides high throughput access to application data.
+   * [[http://www.talend.com/products-data-integration/talend-integration-suite-mpx.php|Talend Integration Suite MPx]] includes support for Apache Hadoop's distributed file system (HDFS) that provides high throughput access to application data.
-   * As well as Hadoop's data warehouse infrastructure (Hive) that provides data summarization and ad-hoc querying.
+   * Supports Apache Hive for data summarization and ad-hoc querying.
  
   * [[http://thinkbiganalytics.com|Think Big Analytics]] - Flexible Data Solution Services
-   * Think Big Analytics offers expert consulting services specializing in Hadoop, MapReduce and related data processing architectures.
+   * Think Big Analytics offers expert consulting services specializing in Apache Hadoop, MapReduce and related data processing architectures.
-   * We offer 1 week Brainstorm Workshops and 6 week Deployment Iterations delivered collaboratively with our customers.
+   * Offers 1 week Brainstorm Workshops and 6 week Deployment Iterations delivered collaboratively with our customers.
  
- 
-  * [[http://tresata.com|Tresata]] - Hadoop powered Big Data as a Service Platform for Financial Services
+  * [[http://tresata.com|Tresata]] - Apache Hadoop powered Big Data as a Service Platform for Financial Services
-   * The 1st and only Big Data Analytics Platform for Financial Data powered by Hadoop
+   * Big Data Analytics Platform for Financial Data powered by Apache Hadoop
    * Provides industry leading data processing, analytics and visualization capabilities