Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Publisher's website
BibTeX Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
BibTeX
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
Publisher's website
BibTeX Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
BibTeX
Transfer Q-star: Principled Decoding for LLM Alignment
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS),
2024.
Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
Publisher's website
BibTeX Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
BibTeX
ACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
Publisher's website
BibTeX Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
BibTeX
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
Publisher's website
BibTeX Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
BibTeX
Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
Publisher's website
BibTeX McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
BibTeX
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
Publisher's website
BibTeX Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
BibTeX
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
Publisher's website
BibTeX Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
BibTeX
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
BibTeX Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
BibTeX
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.
BibTeX Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.