Gpu optimization engineer - python
BangaloreHirelo
...models for real-world inference performance.Youll work across CUDA kernels, model graph optimizations, hardware-specific tuning, and porting models across GPU architectures.Your work directly impacts the latency, throughput, and reliability of smallests real-time speech models.What Youll Do : - Optimize model architectures [...]
Category IT & Telecommunications