Principal site reliability engineer - observability services
Hyderabad/PuneIntraedge Technologies Ltd
...architectural decisions to ensure systems are resilient, observable, and fault-tolerant.Operational Excellence : - Champion operational excellence by driving improvements in monitoring, alerting, incident response, and capacity planning.- Establish and track SLIs, SLOs, and error budgets to balance reliability with feature [...]
Category IT & Telecommunications