Senior gpu optimisation engineer
BangaloreSilverpeople
What You'll Do :- Optimize model architectures (ASR, TTS, SLMs) for maximum performance on specific GPU hardware- Profile models end-to-end to identify GPU bottlenecks - memory bandwidth, kernel launch overhead, fusion opportunities, quantization constraints- Design and implement custom kernels (CUDA/Triton/Tinygrad) for performance-critical model [...]
Category IT & Telecommunications