Ai runtime lead - llm/devops
BangaloreWorksconsultancy
...and inference pipelines.- Improve packaging, deployment, and integration of customer models in production environments.- Ensure consistent throughput, latency, and reliability metrics across multi-node, multi- GPU setups.Build Internal Tooling & Frameworks :- Design and maintain libraries and services that support model [...]
Category Fashion & Arts