Platform architect
MeerutMichael Page
...strategies that ensure high availability and fault tolerance for enterprise-grade workloads.4. Monitoring, Logging & Incident Management Observability: Implement and maintain the ELK stack, Prometheus, and Grafana to provide real-time visibility into system health. The Reliability Anchor: Lead incident response and [...]
Category Engineering & Architecture / Sector IT, Information Technology and Telecommunications
9 days ago in MichaelPage