Site Reliability Engineering

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems to create scalable and reliable software systems. Coursera's SRE catalogue equips you with the principles of SRE, including service level objectives, error budgets, and automation. You'll learn about the design, deployment, and maintenance of large-scale, efficient, and reliable software systems. By understanding incident management, disaster recovery, and creating monitoring systems, you can enhance system reliability and efficiency, making you valuable to any company that relies on robust software infrastructure.
10credentials
28courses

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "site reliability engineering"

  • Skills you'll gain: Site Reliability Engineering, DevOps, Data-Driven Decision-Making, Continuous Delivery, Safety Culture, Change Management, Culture Transformation, Organizational Change, Service Level, Continuous Integration, Continuous Improvement Process, CI/CD, Cross-Functional Collaboration, Automation

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Google Cloud Platform, Cloud Computing, Cost Management, Cloud Infrastructure, DevOps, Scalability, Operational Efficiency, Resource Allocation, Operational Excellence, Site Reliability Engineering, Identity and Access Management, Corporate Sustainability, Disaster Recovery, Customer Support

  • Status: Preview

    Skills you'll gain: Site Reliability Engineering, Service Level, DevOps, Continuous Delivery, Safety Culture, Data-Driven Decision-Making, Culture Transformation, Continuous Integration, Performance Measurement, Cross-Functional Collaboration, Organizational Change, Incident Management, Collaboration, Automation, Change Management, Communication

  • Skills you'll gain: Site Reliability Engineering, Safety Culture, Culture Transformation, Continuous Delivery, DevOps, Service Level, Continuous Integration, Performance Measurement, Performance Metric, Change Management, Design Thinking, Automation, Data-Driven Decision-Making, Prototyping

  • Status: Preview

    Skills you'll gain: Site Reliability Engineering, Incident Management, System Monitoring, Network Monitoring, Prometheus (Software), Google Cloud Platform, Application Performance Management, Continuous Monitoring, Kubernetes, Cloud Applications, Security Information and Event Management (SIEM), Service Level, Firewall, Cloud Computing, Debugging

  • Skills you'll gain: Cloud Security, Cloud Computing Architecture, Network Planning And Design, Cloud Infrastructure, Google Cloud Platform, Cloud Solutions, Cloud Standards, Solution Architecture, Cloud Computing, Process Analysis, IT Infrastructure, Infrastructure Architecture, Data Infrastructure, Process Optimization, Cloud Platforms, Continuous Deployment, Site Reliability Engineering, Key Performance Indicators (KPIs), Cost Reduction, Scalability

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Google Cloud Platform, Cost Management, Cloud Computing, Cloud Infrastructure, Budget Management, Capacity Management, Operational Excellence, Corporate Sustainability, Resource Management, Resource Allocation, Sustainability Reporting, Identity and Access Management, Client Support, Disaster Recovery

  • Skills you'll gain: Cloud Computing Architecture, Amazon Web Services, Cloud Security, Operational Excellence, Reliability, Solution Architecture, Corporate Sustainability, Performance Tuning, Operational Efficiency, Cost Reduction, Security Strategy, Operational Analysis, System Requirements, Site Reliability Engineering, Scalability, Cost Management, Disaster Recovery, Interviewing Skills

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Google Cloud Platform, Cost Management, Site Reliability Engineering, Cloud Computing, DevOps, Cloud Infrastructure, Operational Excellence, Operational Efficiency, Scalability, Sustainability Reporting, Customer Support, System Monitoring

  • Status: Free

    Skills you'll gain: Load Balancing, Kubernetes, Site Reliability Engineering, Scalability, Application Deployment, Disaster Recovery, Containerization, YAML, Servers, System Monitoring

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Cloud Infrastructure, Google Cloud Platform, Public Cloud, Cloud Computing, DevOps, Budget Management, Scalability, Cost Management, Operational Excellence, Operational Efficiency, Corporate Sustainability, Sustainable Business, Customer Support, Disaster Recovery

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Google Cloud Platform, Google App Engine, Identity and Access Management, Cloud Infrastructure, Microservices, Service Level, Load Balancing, Software Design Patterns, Kubernetes, Platform As A Service (PaaS), Firewall, Infrastructure As A Service (IaaS), Public Cloud, CI/CD, Managed Services, Infrastructure as Code (IaC), Cloud Services, Cloud Computing Architecture, Virtual Machines

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Duke University
  • LearnKartS
  • Amazon Web Services
  • KodeKloud
  • Packt
  • Pearson
  • Starweaver