Ai/ml engineer - generative ai
Hyderabad/Bangalore/MumbaiPinnacle Search Services
...LoRA, for cost-effective domain adaptation.- Optimize high-speed inference pipelines leveraging multi-GPU clusters (up to 8x H100s) to reduce latency and improve throughput.3. Multi-Agent Systems & Orchestration :- Create multi-agent systems & Implement orchestration patterns like supervisor-agent, hierarchical, and networked [...]
Category IT & Telecommunications