Ai runtime engineer
BangaloreScaling Theory
...model lifecycle : training, checkpointing, fault recovery, packaging, and deployment.- Implement observability hooks, diagnostics, and resilience mechanisms for deep learning workloads.- Champion best practices in CI/CD, testing, and software quality across the AI runtime stack.Collaborate and Mentor :- Work cross-functionally [...]
Category IT & Telecommunications