Lead/staff ai runtime engineer - llm/pytorch
BangaloreTalent Pro
...and inference pipelines.- Improve packaging, deployment, and integration of customer models in production environments.- Ensure consistent throughput, latency, and reliability metrics across multi-node, multi- GPU setups.Build Internal Tooling & Frameworks : - Design and maintain libraries and services that support model [...]
Category IT & Telecommunications