Fissionlabs - senior ai/ml developer
Pune/HyderabadFISSION COMPUTER LABS PRIVATE LIMITED
...ability to optimize inference metrics : TTFT (first token latency), ITL (inter-token latency), and throughput.- Experience profiling and resolving GPU memory bottlenecks and OOM issues.- Knowledge of hardware-specific optimizations for modern GPU architectures (A100/H100).Fine tuning :- Drive end-to-end fine-tuning of LLMs, [...]
Category IT & Telecommunications