Sale!
, ,

Site Reliability Engineering (SRE) Practitioner Live Online Course with PeopleCert Exam & PDF Study Material – Course code: SRE-Prac-LO

Original price was: $ 3,990.00 USD.Current price is: $ 1,995.00 USD.

  • Official Site Reliability Engineering Practitioner exam included, delivered through a web-proctored format

  • Student edition of Site Reliability Engineering Practitioner training materials provided, created and accredited by PeopleCert

  • Course completion certificate issued upon finishing the training

  • Live online training sessions conducted via Zoom or Microsoft Teams

  • Official PDF sample exam papers included

  • Certification training designed to fully prepare students for the DevOps Institute – Site Reliability Engineering (SRE) Practitioner exam

  • Online practice exams simulating the official sample papers

  • Access to instructors for any questions or support

  • PMI members receive a preapproved 16 PDU code for certification maintenance

  • Small class sizes guaranteed to promote interactive learning with trainers

  • Additional 25% extra time on PeopleCert exams for candidates whose native language differs from the exam language

  • The DevOps Institute – Site Reliability Engineering (SRE) Practitioner exam administered through PeopleCert’s Online Proctored Exam service

Private corporate training available for groups of 7 or more employees, offered live online or in-person at your facility or other venues (e.g., hotels, event centers) Flexible scheduling for corporate training sessions to fit your preferred dates and times — request a quote for details

Today’s organizations deal with a higher volume of change in a more complex tech environment leading to a higher risk of outages and incidents. IT teams must improve service reliability and system resiliency. With automation and observability becoming key factors for more efficient and rapid deployments, the SRE profile has become one of the fastest-growing enterprise roles and set of operational practices for managing services at scale.

To maintain the highest quality learning for our community, DevOps Institute Certifications expire two years from the date of completion. Members can maintain their certification by participating in the Continuing Education Program and earning Continuing Education Units through participation in learning opportunities.

Course Objectives

At the end of the course, the following learning objectives are expected to be
achieved:

  • A practical view of how to successfully implement a flourishing SRE culture in your
    organization.
  • The underlying principles of SRE and an understanding of what it is not in terms of
    anti-patterns, and how you become aware of them to avoid them.
  • The organizational impact of introducing SRE.
  • Acing the art of SLIs and SLOs in a distributed ecosystem and extending the usage of
    Error Budgets beyond the normal to innovate and avoid risks
  • Building security and resilience by design in a distributed, zero-trust environment.
  • How do you implement full stack observability, and distributed tracing and bring about
    an Observability-driven development culture?
  • Curating data using AI to move from reactive to proactive and predictive incident
    management. Also, how do you use DataOps to build clean data lineage?
  • Why is Platform Engineering so important in building consistency and predictability of
    SRE culture?
  • Implementing practical Chaos Engineering.
  • Major incident response responsibilities for an SRE based on incident command
    framework, and examples of the anatomy of unmanaged incidents.
  • The perspective of why SRE can be considered the purest implementation of DevOps.
  • SRE Execution model
  • Understanding the SRE role and understanding why reliability is everyone’s problem.
  • SRE success story learnings

Benefits

  • Implementing SRE and DevOps in the right way leads to higher Business Value
  • Enhanced stability and reliability of services
  • Major improvement of the product in the development, deployment, and operations life-cycle
  • The increased balance between technical investment in reliability and customer experience
  • Homogenous culture and greater synchronization between product, development, and operational teams Improvements in staff morale and retention
  • Higher understanding of the practical implementation of SRE culture
  • Designing services for higher security and reliability
  • Building fault-tolerant distributed ecosystems that can be tested for risks of disaster
  • Building observability and intelligence in operations
  • Broader skills-based capabilities that leverage the latest in automation
  • Higher understanding of other roles and contributing towards creating a better workplace culture

Course Outline

SRE Anti-Patterns

  • Common reliability pitfalls and how to avoid them.
  • Case study: Monzo Bank’s reliability failures and lessons learned.
  • Conducting blameless postmortems and retrospectives.

Service Level Objectives (SLOs) – The Proxy for Customer Happiness

  • Establishing SLOs, SLIs, and error budgets.
  • Case studies: Kudos Engineering and Home Depot’s SLO implementation.
  • Practical exercise: Obtaining service credits.

Full-Stack Observability

  • Implementing end-to-end monitoring, logging, and alerting.
  • Reducing false positives and alert fatigue.

Using Platform Engineering & AIOps

  • Leveraging automation and AI-driven operations to enhance system reliability.

SRE & Incident Response Management

  • Best practices for incident response and on-call management.
  • The role of incident command systems.

Chaos Engineering

  • Designing fault injection experiments to improve system resilience.
  • Case study: How Netflix uses chaos engineering.

SRE as a Form of DevOps

  • Bridging software engineering and system operations.
  • Implementing SRE culture and best practices in organisations.

Prerequisites

It is highly recommended that learners attend the SRE Foundation course with an accredited DevOps Institute Education Partner and earn the SRE Foundation certification prior to attending the SRE Practitioner course and exam. An understanding and knowledge of common SRE terminology, concepts, principles, and related work experience is recommended.

Certification Exam

Successfully passing (65%) the 90-minute examination, consisting of 40 multiple-choice questions leads to the SRE Practitioner certificate. The certification is governed and maintained by DevOps Institute.

Shopping Cart
Scroll to Top