Fissionlabs - senior ai/ml developer
Pune/HyderabadFISSION COMPUTER LABS PRIVATE LIMITED
...Deep understanding of hardware architectures for AI workloads (NVIDIA, AMD, Intel Habana, TPU).LLM Inference Optimization :- Expert knowledge of inference optimization techniques including speculative decoding, KV cache optimization (MQA/GQA/PagedAttention), and dynamic batching.- Deep understanding of prefill vs decode phases, [...]
Category IT & Telecommunications