MaxMin-RLHF: Alignment with Diverse Human Preferences

Year
2024
Type(s)
Author(s)
Chakraborty, Souradip, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang.
Source
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
BibTeX
BibTeX