Staff ai runtime engineer
BangaloreScaling Theory Technologies Pvt Ltd
...model lifecycle : training, checkpointing, fault recovery, packaging, and deployment.- Implement observability hooks, diagnostics, and resilience mechanisms for deep learning workloads.- Champion best practices in CI/CD, testing, and software quality across the AI Runtime stack.- Work cross-functionally with Research, [...]
Category IT & Telecommunications