Product engineer - ai infrastructure
GandhinagarKatonic AI
...Multi-tenant GPU allocation across Kubernetes clusters Auto-scaling: Handle traffic spikes without manual intervention Guardrails: Safety, compliance, and quality enforcement at inference time Your Responsibilities Learn to deploy and test LLM serving infrastructure (v LLM, SGLang, NIM) Test fine-tuning pipelines - Lo RA, QLo [...]
Category IT & Telecommunications