Course description
SRE Infrastructure, Resiliency and Deployment Automation
Site Reliability Engineers must have the right tools and strategies to perform in a fast-paced technical environment. Nine competency areas guide the successful practice of IBM Cloud SREs.
- Applying Site Reliability Engineering principles
- Operations
- Monitoring and incident management
- Security and compliance
- Compute infrastructure
- Networking
- Storage and data management
- Reliability and resiliency
- Deployment automation
In this second course of the three-part Professional Certificate in Site Reliability Engineering (SRE), you will focus on the following five SRE competencies:
- Compute infrastructure
- Networking
- Storage and data management
- Reliability and resiliency
- Deployment automation
Upcoming start dates
1 start date available
Suitability - Who should attend?
Prerequisites
At least 1 year experience in SRE or technology.
Understanding of:
- DevOps practices
- Software engineering principles
- System administration
- Network and OSI model
- Incident management
- Root cause analysis
Training Course Content
Compute Infrastructure
You will cover the following topics:
- IBM Cloud service models: IaaS, PaaS, and FaaS
- Troubleshooting VMs on IBM Cloud
- Troubleshooting clusters on IBM Kubernetes Service
- Troubleshooting clusters on Red Hat OpenShift on IBM Cloud
- Troubleshooting serverless services
Networking
You will cover the following topics:
- Applying IBM Cloud networking features
- Implementing and managing virtual networks on IBM Cloud
- Configuring name resolution on IBM Cloud
- Managing performance on IBM Cloud
- Troubleshooting external connections on IBM Cloud
- Troubleshooting interservice connectivity on IBM Cloud
Storage and data management
You will cover the following topics:
- Managing storage and data attributes
- Managing storage accounts
- Managing data on IBM Cloud
- Managing data replication and retention
Reliability and resiliency
You will cover the following topics:
- Importance of reliability and resiliency for services
- Designing and improving Reliability for systems and services
- Designing for failure and recovering from failure
Module 5: Deployment automation
You will cover the following topics:
- Deployment automation
- Implement Infrastructure as Code
- SRE responsibilities to CI/CD pipeline
Course delivery details
This course is offered through IBM, a partner institute of EdX.
2-3 hours per week
Expenses
- Verified Track -$99
- Audit Track - Free
Ads