MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Year

2024

Type(s)

Conference articles

Author(s)

Souradip Chakraborty and Jiahao Qiu and Hui Yuan and Alec Koppel and Furong Huang and Dinesh Manocha and Amrit Bedi and Mengdi Wang

Source

In Oral, ICML 2024 Workshop on Models of Human Feedback for AI Alignment, ICML 2024, 2024

BibTeX

Furong Huang