Publication Type: Conference proceedings

Safety Guaranteed Robust Multi-Agent Reinforcement Learning with Hierarchical Control for Connected and Automated Vehicles

IEEE International Conference on Robotics and Automation (ICRA), 2025.
Zhang, Zhili, H M Sabbir Ahmad, Ehsan Sabouni, Yanchao Sun, Furong Huang, Wenchao Li, and Fei Miao.

Is poisoning a real threat to DPO? Maybe more so than you think.

AAAI 2025 AI Alignment Track (AAAI), 2025.
Pathmanathan, Pankayaraj, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, and Furong Huang.

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?

The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
Panaitescu-Liess, Michael-Andrei, Zora Che, Bang An, Yuancheng Xu, Pankayaraj Path- manathan, Souradip Chakraborty, Sicheng Zhu, Tom Goldstein, and Furong Huang.

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Publisher's website

Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Xu, Yuancheng, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, and Furong Huang.
Publisher's website

Transfer Q-star: Principled Decoding for LLM Alignment

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Chakraborty, Souradip, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Bedi, and Furong Huang.
Publisher's website

ACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Bornstein, Marco, Amrit Bedi, Abdirisak Mohamed, and Furong Huang.
Publisher's website

Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
Liang, Yongyuan, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, and Huazhe Xu.
Publisher's website

Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
McClellan, Joshua, Naveed Haghani, John Winder, Furong Huang, and Pratap Tokekar.
Publisher's website

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Zhou, Yuhang, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, and Furong Huang.
Publisher's website