Senior c++ developer - ai inference engine
BangaloreK S A INC
...processor-specific optimizations- Build reusable components for the AI Model Library - GEMM kernels, operator fusion, cache-optimized inference pipelines- Profile & optimize cache hierarchy, NUMA-aware memory allocation, and CPU-based inferencing for sub-ms latency- Write production-grade Modern C++ (C++17/20) with OpenMP [...]
Category Manufacturing & Production