You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2010/06/20 16:07:04 UTC

[Hadoop Wiki] Update of "ZooKeeper/GSoCMonitoringAndWebInterface" by AndreiSavu

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "ZooKeeper/GSoCMonitoringAndWebInterface" page has been changed by AndreiSavu.
http://wiki.apache.org/hadoop/ZooKeeper/GSoCMonitoringAndWebInterface?action=diff&rev1=4&rev2=5

--------------------------------------------------

  == GSoC 2010: ZooKeeper Monitoring Recipes and Web-based Administrative Interface ==
- 
   * Student: [[http://andreisavu.ro/|Andrei Savu]] (savu.andrei at gmail dot com)
-  * Assigned mentor: Patrick Hunt (phunt at apache dot org) 
+  * Assigned mentor: Patrick Hunt (phunt at apache dot org)
  
  == Abstract ==
- 
- ZooKeeper is a complex distributed system. Understanding how well it is running is tremendously important. Patrick Hunt has created a [[http://github.com/phunt/zookeeper_dashboard|Django-based dashboard]] that allows some insight into how ZooKeeper is running. This is the foundation I'm going to build on. This project would capture much more information from ZooKeeper, adding hooks to retrieve it where necessary and visualize it in a appealing and useful way. I'm also going to provide a bunch of monitoring recipes for systems like: Ganglia, Nagios, Cacti. 
+ ZooKeeper is a complex distributed system. Understanding how well it is running is tremendously important. Patrick Hunt has created a [[http://github.com/phunt/zookeeper_dashboard|Django-based dashboard]] that allows some insight into how ZooKeeper is running. This is the foundation I'm going to build on. This project would capture much more information from ZooKeeper, adding hooks to retrieve it where necessary and visualize it in a appealing and useful way. I'm also going to provide a bunch of monitoring recipes for systems like: Ganglia, Nagios, Cacti.
  
  == Work In Progress ==
+  * monitoring for Cacti and Ganglia
+  * commit as zookeeper-monitoring as a contrib
- 
-  * monitoring script: [[http://github.com/andreisavu/zkstats|zkstats]] - parse output from mntr 4letter word
-  * monitoring script for nagios based on zkstats
   * [[https://issues.apache.org/jira/browse/ZOOKEEPER-175|ZOOKEEPER-175]]
   * [[https://issues.apache.org/jira/browse/ZOOKEEPER-757|ZOOKEEPER-757]]
   * [[https://issues.apache.org/jira/browse/ZOOKEEPER-613|ZOOKEEPER-613]]
-  * learn more about Ganglia and Cacti (I have previous experience with Nagios)
-  * init.d (or daemontools) scripts for zookeeper
  
  == Done ==
- 
+  * monitoring tools and recipes: [[http://github.com/andreisavu/zookeeper-monitoring|zookeeper-monitoring]] : Nagios
   * [[https://issues.apache.org/jira/browse/ZOOKEEPER-744|ZOOKEEPER-744]]
  
  == Milestones ==
+ === Community Bonding (starts: 26 April ends: 24 May) ===
+ Activities:
  
- === Community Bonding (starts: 26 April ends: 24 May) ===
- 
- Activities:
   * read mail lists archives
   * read source code
-  * discuss with the community members 
-  (monitoring and administration requirements, production stories)
+  * discuss with the community members  (monitoring and administration requirements, production stories)
   * discuss with the Adobe Hadoop / Hbase team about their specific monitoring requirements
  
  Expected results:
+ 
   * understand source code and the known bugs
   * understand how the software is used in production
   * understand monitoring requirements
-  * understand debugging requirements 
+  * understand debugging requirements
-  * setup a development environment 
+  * setup a development environment
  
  === Monitoring and Data Collection (starts: 24 May ends: 20 June ) ===
+ Activities:
  
- Activities:
   * deploy small scale (multinode) cluster for development (virtual machines)
   * identify important health signals add hooks (if needed) for realtime data collection
   * create scripts / plugins for cluster monitoring using Cacti, Ganglia, Nagios, SNMP
-  * document script install procedures 
+  * document script install procedures
   * collaborate with the Adobe Hadoop / Hbase team and deploy the monitoring scripts in production
  
  Expected results:
+ 
   * production ready scripts / plugins for monitoring
   * easy to understand and follow install guides
  
+ === Web Application (starts: 20 June ends: 9 august) ===
+ Activities:
  
- === Web Application (starts: 20 June ends: 9 august) ===
- 
- Activities:
   * package zkpython bindings (distutils, .deb, .rpm)
   * simple authentication and custom authentication backend based on zookeeper
   * interface and server side code for server list configuration
@@ -65, +59 @@

   * javascript based znode hierarchy browser
   * simple in-browser event monitor
   * deploy on production or development cluster at Adobe (if possible)
-  
+ 
  Expected results:
+ 
   * packages for zkpython
   * working web application
  
  === Cleanup and final fixes (starts: 9 august ends: 16 august) ===
+ Activities:
  
- Activities:
   * improve tests and documentation
  
  === Submit code to code.google.com : 30 August ===
- 
  == Related JIRA ==
   * [[https://issues.apache.org/jira/browse/ZOOKEEPER-701|ZOOKEEPER-701]]