Site Reliability Engineering

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems to create scalable and reliable software systems. Coursera's SRE catalogue equips you with the principles of SRE, including service level objectives, error budgets, and automation. You'll learn about the design, deployment, and maintenance of large-scale, efficient, and reliable software systems. By understanding incident management, disaster recovery, and creating monitoring systems, you can enhance system reliability and efficiency, making you valuable to any company that relies on robust software infrastructure.
10credentials
28courses

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "site reliability engineering"

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Digital Transformation, Real Time Data, Cloud Infrastructure, Data Strategy, Google Cloud Platform, Cloud Services, Cost Management, Data Governance, Cloud Applications, Cloud Computing, Hybrid Cloud Computing, Data Transformation, Application Programming Interface (API), Business Transformation, Cloud Platforms, Budget Management, Public Cloud, Technology Strategies

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Google Cloud Platform, Cost Management, Cloud Computing, Cloud Infrastructure, Budget Management, Capacity Management, Operational Excellence, Corporate Sustainability, Resource Management, Resource Allocation, Sustainability Reporting, Identity and Access Management, Client Support, Disaster Recovery

  • Status: Preview

    Skills you'll gain: Site Reliability Engineering, Incident Management, System Monitoring, Network Monitoring, Prometheus (Software), Google Cloud Platform, Application Performance Management, Continuous Monitoring, Kubernetes, Cloud Applications, Security Information and Event Management (SIEM), Service Level, Firewall, Cloud Computing, Debugging

  • Status: Free Trial

    Skills you'll gain: Application Deployment, Cloud Infrastructure, CI/CD, Cloud Computing Architecture, Cloud Security, Microservices, Service Level Agreement, Kubernetes, Site Reliability Engineering, Google Cloud Platform, Cloud Storage, Key Performance Indicators (KPIs), Network Architecture, Restful API, API Design, Systems Architecture, Scalability, Load Balancing, System Monitoring, Disaster Recovery

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Google Cloud Platform, Google App Engine, Identity and Access Management, Cloud Infrastructure, Microservices, Service Level, Load Balancing, Software Design Patterns, Kubernetes, Platform As A Service (PaaS), Firewall, Infrastructure As A Service (IaaS), Public Cloud, CI/CD, Managed Services, Infrastructure as Code (IaC), Cloud Services, Cloud Computing Architecture, Virtual Machines

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Docker (Software), Containerization, Kubernetes, Virtualization, Devops Tools, Microservices, Application Deployment, Virtual Machines, Cloud Development, Database Management, GitHub, Cloud-Based Integration, Scalability

  • Status: Free Trial

    Skills you'll gain: Application Deployment, Microservices, Kubernetes, Google Cloud Platform, Cloud Computing Architecture, CI/CD, Cloud Security, Site Reliability Engineering, Service Level Agreement, Restful API, Network Architecture, DevOps, Key Performance Indicators (KPIs), Cloud Storage, API Design, Systems Design, Performance Metric, Application Design, Load Balancing, Scalability

  • Skills you'll gain: Site Reliability Engineering, DevOps, Data-Driven Decision-Making, Continuous Delivery, Safety Culture, Change Management, Culture Transformation, Organizational Change, Service Level, Continuous Integration, Continuous Improvement Process, CI/CD, Cross-Functional Collaboration, Automation

  • Status: Preview

    Skills you'll gain: Amazon Web Services, Cloud Engineering, Site Reliability Engineering, Amazon Elastic Compute Cloud, Amazon CloudWatch, Cloud-Native Computing, Kubernetes, Systems Engineering, System Monitoring, Serverless Computing, Scalability, Application Performance Management, Performance Testing, Scenario Testing

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Cloud Solutions, Google Cloud Platform, Cloud Computing, DevOps, Operational Excellence, Cloud Infrastructure, Cost Management, Site Reliability Engineering, Operational Efficiency, Scalability, Corporate Sustainability, System Monitoring, Customer Support

  • Status: Free Trial

    Skills you'll gain: Cloud Security, Cloud Management, Site Reliability Engineering, Cost Management, Cloud Computing, Google Cloud Platform, DevOps, IT Security Architecture, Data Security, Multi-Tenant Cloud Environments, Financial Controls, System Monitoring, Cybersecurity, Identity and Access Management

  • Status: Free

    Skills you'll gain: Load Balancing, Kubernetes, Site Reliability Engineering, Scalability, Application Deployment, Disaster Recovery, Containerization, YAML, Servers, System Monitoring

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Duke University
  • LearnKartS
  • Amazon Web Services
  • KodeKloud
  • Packt
  • Pearson
  • Starweaver