Staff ai runtime engineer
BangaloreScaling Theory Technologies Pvt Ltd
...and inference pipelines.- Improve packaging, deployment, and integration of customer models in production environments.- Ensure consistent throughput, latency, and reliability metrics across multi-node, multi-GPU setups.- Design and maintain libraries and services that support the model lifecycle : training, checkpointing, [...]
Category IT & Telecommunications