Senior ai platform engineer — llm, agentic systems & production mlops
New DelhiAIBound
...strategies (HPA, GPU-aware scaling, traffic-based scaling)Implement CI/CD pipelines for models and services with canary and rollback strategiesSet up monitoring, logging, and alerting for latency, errors, throughput, GPU utilization, and costDrive security and cost optimization: IAM, secrets management, network policies, [...]
Category IT & Telecommunications