Senior gpu optimisation engineer
BangaloreSilverpeople
...hierarchy, occupancy tuning- Hands-on experience with CUDA, kernel writing, and kernel-level debugging- Experience with kernel fusion and model graph optimizations- Familiarity with TensorRT, ONNX, Triton, tinygrad, or similar inference engines- Strong proficiency in PyTorch and Python- Deep understanding of [...]
Category IT & Telecommunications