Papers tagged “rlhf”
2 papers ·
All papers →
Training Language Models to Follow Instructions with Human Feedback
2022
NeurIPS
Deep Reinforcement Learning from Human Preferences
2017
NeurIPS