Systems engineer, site reliability engineering
BangaloreGoogle
...distributed, and fault tolerant systems used by Google products. 4. Monitor live services by tracking availability, latency, capacity, and overall system health metrics. 5. Reduce operational toil by improving automation, reliability, and system efficiency. 6. Respond to incidents and ensure services meet defined Service Level [...]
Category Office & Administration