Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Publisher's website
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
Publisher's website
Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
Transfer Q-star: Principled Decoding for LLM Alignment
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS),
2024.
Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
Publisher's website
Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
ACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
Publisher's website
Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
Publisher's website
Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
Publisher's website
McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
Publisher's website
Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
Publisher's website
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.