Fissionlabs - senior ai/ml developer
Pune/HyderabadFISSION COMPUTER LABS PRIVATE LIMITED
...:- Hands-on experience with production inference engines : vLLM, TensorRT-LLM, DeepSpeed-Inference, or TGI.- Proficiency with serving frameworks : Triton Inference Server, KServe, or Ray Serve.- Familiarity with kernel optimization libraries (FlashAttention, xFormers).Performance Engineering :- Proven ability to optimize [...]
Category IT & Telecommunications