Software development engineer ii, agi data services
HyderabadADCI HYD 13 SEZ
...SQL queries, summarizing legal documents) to teach the model desired behaviors, b) Reinforcement Learning from Human Feedback (RLHF): Aligning models based on human feedback (ranking, rating, or correcting model outputs) based on safety, accuracy, and helpfulness, c) Evaluations: Quality assessments to identify performance gaps [...]
Category IT & Telecommunications