Fissionlabs - senior ai/ml developer
Pune/HyderabadFISSION COMPUTER LABS PRIVATE LIMITED
...with kernel optimization libraries (FlashAttention, xFormers).Performance Engineering :- Proven ability to optimize inference metrics : TTFT (first token latency), ITL (inter-token latency), and throughput.- Experience profiling and resolving GPU memory bottlenecks and OOM issues.- Knowledge of hardware-specific optimizations [...]
Category IT & Telecommunications