Founding ai engineer
BangaloreRecro
...classification for agent routingBuild RL-based systems for multi-step action planningDevelop evaluation models for agent output qualityCreate meta-learning pipelines for continuous improvementHandle conflicting agent recommendations with trained arbitration modelsTech Stack: PyTorch, Ray for distributed training, custom RL [...]
Category IT & Telecommunications