Goal:
- Modernize IT Operations by adopting Agile Practices / SRE / DevOps
- Deliver superior client experience and minimize operational risk
Pillars:
- SLOs & Error Budgets
- Monitoring / Observability
- Reduce Toil
Goal:
Pillars:
Frontline support personnel are critical in managing firm risk. We can help if any of the below are priorities —
Superior and consistent client experience
Service Level Indicator (SLI) – A key aspect of the level of service
Service Level Objective (SLO) – Target level for reliability of service (usually expressed in nines)
Service Level Agreements (SLA) – Contracts with users along with consequences of not meeting SLOs
Error Budgets — Accepted level of unreliability
Defining Risk
Managing Risk
Big Picture
Summary:
Monitoring
Definition: Collecting, Processing, Aggregating, and Displaying real-time data about the system
Alerting on SLOs
Definition: Notification intended to be read by a human (what’s broken and why)