MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Year
2024
Type(s)
Author(s)
Souradip Chakraborty and Jiahao Qiu and Hui Yuan and Alec Koppel and Furong Huang and Dinesh Manocha and Amrit Bedi and Mengdi Wang
Source
In Oral, ICML 2024 Workshop on Models of Human Feedback for AI Alignment, ICML 2024, 2024
BibTeX
BibTeX