Site reliability engineer - devops
Hyderabad/AhmedabadLogicloop Ventures Limited
...- drive root cause analysis (RCA) and postmortems.- Create and maintain runbooks and standard operating procedures for high availability services.- Design and implement observability frameworks using ELK, Prometheus, and Grafana; drive telemetry adoption.- Coordinate cross-functional war-room sessions during major incidents and [...]
Category IT & Telecommunications