Senior c++ developer - ai inference engine
BangaloreK S A INC
...implement high-performance AI model architectures using SIMD intrinsics (AVX2/AVX-512) and processor-specific optimizations- Build reusable components for the AI Model Library - GEMM kernels, operator fusion, cache-optimized inference pipelines- Profile & optimize cache hierarchy, NUMA-aware memory allocation, and CPU-based [...]
Category Manufacturing & Production