Publication Type: Conference proceedings

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Publisher's website

Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
Publisher's website

Transfer Q-star: Principled Decoding for LLM Alignment

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
Publisher's website

ACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
Publisher's website

Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
Publisher's website

Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
Publisher's website

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
Publisher's website

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
Publisher's website

AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models

First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.

Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models

First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.