Final Assignment – Nikhil
What is SRE? – 3-4 Lines
- Site Reliability Engineer, takes care of the platform/site reliability and stability.
- Bridges the gap between development and operations.
- Manages risk.
- Leverages automation to reduce toil.
- Engineering approach to operations.
5 Action items to perform to transform from Ops to SRE.
- Work closely with product/development team.
- Break down the silos between product and ops.
- Automate as much as possible to reduce the toil.
- Incorporate engineering approach to operations.
- Being proactive rather than reactive.
- Set sli’s, slo’s, sla’s.
- Set Error budjets.
- Incident Management.
Top 10 SRE tools.
- Docker swarm
- Gitlab CI/CD