Fissionlabs - senior ai/ml developer
Pune/HyderabadFISSION COMPUTER LABS PRIVATE LIMITED
...optimization libraries (FlashAttention, xFormers).Performance Engineering :- Proven ability to optimize inference metrics : TTFT (first token latency), ITL (inter-token latency), and throughput.- Experience profiling and resolving GPU memory bottlenecks and OOM issues.- Knowledge of hardware-specific optimizations for modern [...]
Category IT & Telecommunications