You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2010/05/07 01:48:20 UTC

[Hadoop Wiki] Update of "ZooKeeper/GSoCMonitoringAndWebInterface" by AndreiSavu

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "ZooKeeper/GSoCMonitoringAndWebInterface" page has been changed by AndreiSavu.
The comment on this change is: added milestones and abstract.
http://wiki.apache.org/hadoop/ZooKeeper/GSoCMonitoringAndWebInterface

--------------------------------------------------

New page:
== GSoC 2010: ZooKeeper Monitoring Recipes and Web-based Administrative Interface ==

 * Student: Andrei Savu (savu.andrei at gmail dot com)
 * Assigned mentor: Patrick Hunt (phunt at apache dot org) 

== Abstract ==

ZooKeeper is a complex distributed system. Understanding how well it is running is tremendously important. Patrick Hunt has created a [[http://github.com/phunt/zookeeper_dashboard|Django-based dashboard]] that allows some insight into how ZooKeeper is running. This is the foundation I'm going to build on. This project would capture much more information from ZooKeeper, adding hooks to retrieve it where necessary and visualize it in a appealing and useful way. I'm also going to provide a bunch of monitoring recipes for systems like: Ganglia, Nagios, Cacti. 

== Milestones ==

=== Community Bonding (starts: 26 April ends: 24 May) ===

Activities:
 * read mail lists archives
 * read source code
 * discuss with the community members 
 (monitoring and administration requirements, production stories)
 * discuss with the Adobe Hadoop / Hbase team about their specific monitoring requirements

Expected results:
 * understand source code and the known bugs
 * understand how the software is used in production
 * understand monitoring requirements
 * understand debugging requirements 
 * setup a development environment 

=== Monitoring and Data Collection (starts: 24 May ends: 20 June ) ===

Activities:
 * deploy small scale (multinode) cluster for development (virtual machines)
 * identify important health signals add hooks (if needed) for realtime data collection
 * create scripts / plugins for cluster monitoring using Cacti, Ganglia, Nagios, SNMP
 * document script install procedures 
 * collaborate with the Adobe Hadoop / Hbase team and deploy the monitoring scripts in production

Expected results:
 * production ready scripts / plugins for monitoring
 * easy to understand and follow install guides


=== Web Application (starts: 20 June ends: 9 august) ===

Activities:
 * package zkpython bindings (distutils, .deb, .rpm)
 * simple authentication and custom authentication backend based on zookeeper
 * interface and server side code for server list configuration
 * view server, environment and connection info: most of the code already works
 * javascript based znode hierarchy browser
 * simple in-browser event monitor
 * deploy on production or development cluster at Adobe (if possible)
 
Expected results:
 * packages for zkpython
 * working web application

=== Cleanup and final fixes (starts: 9 august ends: 16 august) ===

Activities:
 * improve tests and documentation

=== Submit code to code.google.com : 30 August ===

== Related JIRA ==
 * [[https://issues.apache.org/jira/browse/ZOOKEEPER-701|ZOOKEEPER-701]]