Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Xu, Yuancheng, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang.
Xu, Yuancheng, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang.
Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
AdvML-Frontiers workshop, ICML 2023.
Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang
Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Zheng, Ruijie, Ching-An Cheng, Hal Daum ́e III, Furong Huang, Andrey Kolobov.
Zheng, Ruijie, Ching-An Cheng, Hal Daum ́e III, Furong Huang, Andrey Kolobov.
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Zheng, Ruijie, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daum ́e III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang.
Zheng, Ruijie, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daum ́e III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang.
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Ji, Tianying, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu.
Ji, Tianying, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu.
MaxMin-RLHF: Alignment with Diverse Human Preferences
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Chakraborty, Souradip, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang.
Chakraborty, Souradip, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang.
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models
Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Guan, Tianrui, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, and Tianyi Zhou
Guan, Tianrui, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, and Tianyi Zhou
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Spotlight. The Twelfth International Conference on Learning Representations (ICLR), 2024.
Liu, Xiangyu, Chenghao Deng, Yanchao Sun, Yongyuan Liang, and Furong Huang
Publisher's website
Liu, Xiangyu, Chenghao Deng, Yanchao Sun, Yongyuan Liang, and Furong Huang
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
The Twelfth International Conference on Learning Representations (ICLR), 2024.
Liu, Xiangyu, Souradip Chakraborty, Yanchao Sun, and Furong Huang
Publisher's website
Liu, Xiangyu, Souradip Chakraborty, Yanchao Sun, and Furong Huang
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning
The Twelfth International Conference on Learning Representations (ICLR), 2024.
Chakraborty, Souradip, Amrit Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, and Furong Huang
Publisher's website
Chakraborty, Souradip, Amrit Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, and Furong Huang
