Software development engineer ii, agi data services
HyderabadADCI HYD 13 SEZ
...a) Supervised Fine-Tuning (SFT): creating high-quality, task-specific examples (e.g., writing SQL queries, summarizing legal documents) to teach the model desired behaviors, b) Reinforcement Learning from Human Feedback (RLHF): Aligning models based on human feedback (ranking, rating, or correcting model outputs) based on [...]
Category IT & Telecommunications