You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@cloudstack.apache.org by "sebastien goasguen (JIRA)" <ji...@apache.org> on 2014/07/14 15:56:04 UTC

[jira] [Resolved] (CLOUDSTACK-1919) Runbooks

     [ https://issues.apache.org/jira/browse/CLOUDSTACK-1919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

sebastien goasguen resolved CLOUDSTACK-1919.
--------------------------------------------

    Resolution: Fixed

Closing this as too old and too generic

> Runbooks
> --------
>
>                 Key: CLOUDSTACK-1919
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-1919
>             Project: CloudStack
>          Issue Type: Improvement
>      Security Level: Public(Anyone can view this level - this is the default.) 
>          Components: Doc
>            Reporter: Jessica Tomechak
>
> The RS [RightScale] runbooks are good stuff - we should seriously consider producing CS-specific content like that. (Kevin Kluge)
> On 03/02/2012 01:27 PM, Chiradeep Vittal wrote:
> > Those are useful /tools. /I am more interested in the /content. / 
> > Running a cloud is a combination of operating cloudstack (stop / start 
> > / add host / delete host/ devices / storage) + operating the storage + 
> > operating the network + operating the hypervisor + ancilliary items 
> > like the SQL server database. It is clear for instance that the 
> > requisite checks were not done at [customer] before adding hosts to the 
> > cluster (check CPU level, driver patch levels, firmware upgrades), nor 
> > were they monitoring cloudstack for warnings about filling up storage 
> > or monitoring the XS hotfix mailblast. The Run Book for the cloud will 
> > contain content like this and solutions for when they receive alerts 
> > about storage filling up or how to recover corrupt vhds, how to 
> > periodically back up primary storage, etc. How to transfer VMs between 
> > failure domains when a particular failure domain has failed. 
> > References to other runbooks such as How to backup and restore MySQL. 
> > Monitor CS server and the underlying hardware with Nagios etc. Host 
> > maintenance procedures, storage maintenance procedures.
> > 
> > In addition, there needs to be a reference architecture for deploying 
> > a cloud (define your failure domains, plan for capacity based on 
> > service offerings, calculate IOPs requirements, network bandwidth, 
> > switch capacity, core router capacity, ip address planning – public and private).
> > 
> > Finally, [before a user sets up a cloud, they should evaluate whether they are ready]. Do they have change 
> > management procedures? Do they have a CMDB? Do they have document 
> > problem management procedures? Let me just throw ITIL in there.
> > 
> I agree - honestly the complexity of what IaaS is incredible.
> I'd go a step further in [the user] evaluation and say:
> * Do they have config management
> * Do they have automated provisioning (esp for hypervisors) - can they get a new hypervisor up in EXACTLY the same configuration as the last one, down to network bonding without any manual intervention?
> * Do they have a monitoring system in place - and is it or can it be made capable of monitoring cloudstack. [Need to define the baseline of what the user should be monitoring]



--
This message was sent by Atlassian JIRA
(v6.2#6252)