Senior c++ developer - ai inference engine
BangaloreK S A INC
...for the AI Model Library - GEMM kernels, operator fusion, cache-optimized inference pipelines- Profile & optimize cache hierarchy, NUMA-aware memory allocation, and CPU-based inferencing for sub-ms latency- Write production-grade Modern C++ (C++17/20) with OpenMP parallelization and HPC best practices- Conduct rigorous code [...]
Category Manufacturing & Production