Ai runtime lead - llm/devops
BangaloreWorksconsultancy
...runtime architecture supporting AI training and inference at scale.- Design resilient and elastic runtime features (e.g. dynamic node scaling, job recovery) within our custom PyTorch stack.- Optimize distributed training reliability, orchestration, and job-level [...]
Category Fashion & Arts