Site reliability engineer - aws services
NoidaTRDFIN SUPPORT SERVICES PRIVATE LIMITED
...observability using Grafana and Prometheus- Ensure system reliability, performance, uptime, and scalability- Participate in incident response, root cause analysis (RCA), and post-incident reviews- Implement Infrastructure as Code (IaC) and automation best practices- Collaborate with development teams to [...]
Category IT & Telecommunications