Get in Touch

Course Outline

SRE Anti-Patterns

  • Identifying counterproductive practices
  • Recognizing the impact of anti-patterns on system reliability
  • Best practices and corrective alternatives

Using SLOs as a Proxy for Customer Satisfaction

  • Defining Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Managing error budgets and balancing innovation with reliability
  • Understanding the limits of distributed systems

Building Secure and Reliable Systems

  • Designing for fault tolerance and resilience
  • Integrating security into reliability engineering practices
  • Scalability and data protection strategies

Comprehensive Observability Across the Stack

  • Instrumentation and metrics collection
  • Distributed tracing and synthetic monitoring
  • Observability-driven development approaches

Platform Engineering and AIOps

  • Platform-centered engineering methodologies
  • Automation and orchestration in Site Reliability Engineering
  • Leveraging DataOps and operational intelligence

Incident Management in SRE

  • Roles and responsibilities in incident response
  • Applying frameworks such as OODA (Observe, Orient, Decide, Act)
  • Automated remediation and AI/ML-assisted resolution techniques

Chaos Engineering

  • Principles and strategies for resilience testing
  • Planning and executing 'game day' exercises
  • Gaining insights from controlled failure experiments

SRE as the Purest Form of DevOps

  • Integrating SRE principles into DevOps workflows
  • Cultural alignment and collaborative practices
  • Driving organizational transformation through SRE adoption

Post-Class Exercises

  • Large-scale system design case studies
  • Advanced instrumentation and monitoring scenarios
  • Real-world reliability problem-solving tasks

Review and Exam Preparation

  • Final review of the DevOps Institute SRE Practitioner syllabus
  • Sample questions and practice tests
  • Exam-taking strategies and recommendations

Summary and Next Steps

Requirements

  • Understanding of foundational Site Reliability Engineering principles
  • Experience with DevOps practices and associated tools
  • Familiarity with system monitoring, incident management, and automation

Target Audience

  • SRE professionals pursuing the DevOps Institute SRE Practitioner certification
  • DevOps engineers looking to transition into reliability-focused roles
  • Operations leaders responsible for defining and executing reliability strategies
 35 Hours

Number of participants


Price per participant

Testimonials (2)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories