Software development engineer ii, agi data services
HyderabadADCI HYD 13 SEZ
...writing SQL queries, summarizing legal documents) to teach the model desired behaviors, b) Reinforcement Learning from Human Feedback (RLHF): Aligning models based on human feedback (ranking, rating, or correcting model outputs) based on safety, accuracy, and helpfulness, c) Evaluations: Quality assessments to identify [...]
Category IT & Telecommunications