You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2009/10/17 20:54:30 UTC

[Hadoop Wiki] Update of "FrontPage" by BarronGraham

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "FrontPage" page has been changed by BarronGraham.
http://wiki.apache.org/hadoop/FrontPage?action=diff&rev1=137&rev2=138

--------------------------------------------------

  = Apache Hadoop =
- 
- [[http://hadoop.apache.org/|Apache Hadoop]] is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named [[HadoopMapReduce| Map/Reduce]], where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system ([[DFS|HDFS]]) that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both Map/Reduce and the distributed file system are designed so that node failures are automatically handled by the framework.
+ [[http://hadoop.apache.org/|Apache Hadoop]] is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named [[HadoopMapReduce|Map/Reduce]], where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system ([[DFS|HDFS]]) that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both Map/Reduce and the distributed file system are designed so that node failures are automatically handled by the framework.
  
  == General Information ==
-  * [[http://hadoop.apache.org/|Official Apache Hadoop Website ]]: download, bug-tracking, mailing-lists, etc.
+  * [[http://hadoop.apache.org/|Official Apache Hadoop Website]]: download, bug-tracking, mailing-lists, etc.
-  * [[ProjectDescription| Overview]] of Apache Hadoop
+  * [[ProjectDescription|Overview]] of Apache Hadoop
   * [[FAQ]]
-  * [[HadoopIsNot| What Hadoop is not]]
+  * [[HadoopIsNot|What Hadoop is not]]
-  * [[Distribution| Distributions]] for Hadoop (RPMs, Debs, AMIs, etc)
+  * [[Distribution|Distributions]] for Hadoop (RPMs, Debs, AMIs, etc)
-  * [[HadoopPresentations| Presentations]], [[Books| books]], [[HadoopArticles| articles]] and [[Papers| papers]] about Hadoop
+  * [[HadoopPresentations|Presentations]], [[Books|books]], [[HadoopArticles|articles]] and [[Papers|papers]] about Hadoop
   * PoweredBy, a list of sites and applications powered by Apache Hadoop
   * Support
    * [[Help|Getting help from the hadoop community]].
@@ -18, +17 @@

   * Hadoop Community Events and Conferences
    * HadoopUserGroups (HUGs)
    * HadoopSummit
-  * [[http://developer.yahoo.com/hadoop/tutorial/|Yahoo! Hadoop Tutorial ]]: A thorough tutorial covering Hadoop setup, HDFS, and [[HadoopMapReduce| MapReduce]]
+  * [[http://developer.yahoo.com/hadoop/tutorial/|Yahoo! Hadoop Tutorial]]: A thorough tutorial covering Hadoop setup, HDFS, and [[HadoopMapReduce|MapReduce]]
   * [[http://www.cloudera.com/hadoop-training-basic|Cloudera Online Hadoop Training]]: Video lectures, exercises and a pre-configured [[http://www.cloudera.com/hadoop-training-virtual-machine|virtual machine]] to follow along. Sessions cover [[http://www.cloudera.com/hadoop-training-programming-with-hadoop|Hadoop]], [[http://www.cloudera.com/hadoop-training-mapreduce-algorithms|MapReduce]], [[http://www.cloudera.com/hadoop-training-hive-introduction|Hive]], [[http://www.cloudera.com/hadoop-training-pig-introduction|Pig]] and more.
  
  == User Documentation ==
@@ -26, +25 @@

   * GettingStartedWithHadoop (lots of details and explanation)
   * QuickStart (for those who just want it to work ''now'')
   * [[http://hadoop.apache.org/core/docs/current/commands_manual.html|Command Line Options]] for hadoop shell script.
-  * [[HadoopOverview| Hadoop Code Overview]]
+  * [[HadoopOverview|Hadoop Code Overview]]
-  * [[TroubleShooting| Troubleshooting]] What do when things go wrong
+  * [[TroubleShooting|Troubleshooting]] What do when things go wrong
  
   * Cluster setup
    * [[Running_Hadoop_On_Ubuntu_Linux_(Single-Node_Cluster)]] (tutorial on installing, configuring and running Hadoop on a single machine)
-   * [[Running_Hadoop_On_OS_X_10.5_64-bit_(Single-Node_Cluster)]] 
+   * [[Running_Hadoop_On_OS_X_10.5_64-bit_(Single-Node_Cluster)]]
    * HowToConfigure Hadoop software
-   * [[WebApp_URLs| WebApps for monitoring your system]]
+   * [[WebApp_URLs|WebApps for monitoring your system]]
-   * [[NameNodeFailover| How to handle name node failure]]
+   * [[NameNodeFailover|How to handle name node failure]]
-   * [[GangliaMetrics| How to get metrics into ganglia]]
+   * [[GangliaMetrics|How to get metrics into ganglia]]
-   * [[LargeClusterTips| Tips for managing a large cluster]]
+   * [[LargeClusterTips|Tips for managing a large cluster]]
-   * [[VirtualCluster| How to bring up a cluster of Virtual Machines]]
+   * [[VirtualCluster|How to bring up a cluster of Virtual Machines]]
-   * [[DiskSetup| Disk Setup: some suggestions]]
+   * [[DiskSetup|Disk Setup: some suggestions]]
-   * [[PerformanceTuning| Performance:]] getting extra throughput
+   * [[PerformanceTuning|Performance:]] getting extra throughput
-   * [[http://v-lad.org/Tutorials/Hadoop/00%20-%20Intro.html|Hadoop Windows/Eclipse Tutorial ]]: Tutorial on how to setup and configure Hadoop development cluster for Windows and Eclipse.
+   * [[http://v-lad.org/Tutorials/Hadoop/00%20-%20Intro.html|Hadoop Windows/Eclipse Tutorial]]: Tutorial on how to setup and configure Hadoop development cluster for Windows and Eclipse.
  
   * Map/Reduce
    * HadoopMapReduce
@@ -51, +50 @@

  
   * Examples
    * WordCount
-   * [[PythonWordCount| Python Word Count]]
+   * [[PythonWordCount|Python Word Count]]
-   * [[C++WordCount| C/C++ Word Count]]
+   * [[C++WordCount|C/C++ Word Count]]
    * [[Grep]]
    * [[Sort]]
    * RandomWriter
-   * [[HadoopDfsReadWriteExample| How to read from and write to HDFS]]
+   * [[HadoopDfsReadWriteExample|How to read from and write to HDFS]]
-   
+ 
   * Amazon
    * Running Hadoop on [[AmazonEC2]]
    * Running Hadoop with AmazonS3
  
   * Benchmarks
-   * [[HardwareBenchmarks| Hardware benchmarks]]
+   * [[HardwareBenchmarks|Hardware benchmarks]]
-   * [[DataProcessingBenchmarks| Data processing benchmarks]]
+   * [[DataProcessingBenchmarks|Data processing benchmarks]]
  
   * Sub-Projects
    * [[Hbase]], a Bigtable-like structured storage system for Hadoop HDFS
@@ -84, +83 @@

   * HowToContribute
   * HowToDevelopUnitTests
   * HowToSetupYourDevelopmentEnvironment
-  * [[CodeReviewChecklist| HowToCodeReview]]
+  * [[CodeReviewChecklist|HowToCodeReview]]
   * [[Jira]] usage guidelines
   * HowToCommit
   * HowToRelease
   * HudsonBuildServer
   * DevelopmentHints
   * ProjectSuggestions
-  * [[HadoopUnderIDEA| Building/Testing under IntelliJ IDEA]]
+  * [[HadoopUnderIDEA|Building/Testing under IntelliJ IDEA]]
  
  == Related Resources ==
- 
   * [[http://wiki.apache.org/nutch/NutchHadoopTutorial|Nutch Hadoop Tutorial]] (Useful for understanding Hadoop in an application context)
   * [[http://www.alphaworks.ibm.com/tech/mapreducetools|IBM MapReduce Tools for Eclipse]] (An Eclipse plug-in that simplifies the creation and deployment of MapReduce programs)
   * Hadoop IRC channel is #hadoop at irc.freenode.net.
@@ -104, +102 @@

   * [[http://lucene.apache.org/mahout|Mahout]], scalable Machine Learning algorithms using Hadoop
   * [[http://opensolaris.org/os/project/livehadoop/|Live Hadoop]] A three-node, distributed Hadoop cluster running on an !OpenSolaris live CD
   * [[https://rc.usf.edu/trac/hadoop/wiki/SGEIntegration|SGE Integration]] A guide on tight-integration of Hadoop on Sun Gridengine
+ 
  ----
  CategoryHomepage