Aiml engineer
BangaloreTally Solutions Private Limited
...larger teacher models for cost-effective deployment. Fine-tuning: Experience with PEFT, LoRA, and QLoRA for domain adaptation. High-Performance Inference Serving Engines: Experience optimizing inference throughput using high-performance serving frameworks such as vLLM or SGLang. Latency Engineering: Ability to debug and [...]
Category IT & Telecommunications