AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
Publisher's website
Wu, Xiyang, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, and Dinesh Manocha.
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
First Conference on Language Modeling (COLM), 2024.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
Wang, Xiyao, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, and Furong Huang.
Wang, Xiyao, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, and Furong Huang.
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
Zhou, Yuhang, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, and Furong Huang.
Zhou, Yuhang, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, and Furong Huang.
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Zheng, Ruijie, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daum ́e III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang.
Zheng, Ruijie, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daum ́e III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang.
WAVES: Benchmarking the Robustness of Image Watermarks
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
An, Bang, Mucong Ding, Tahseen Rabbani, Aakriti Agrawal, Yuancheng Xu, Chenghao Deng, Sicheng Zhu, Abdirisak Mohamed, Yuxin Wen, Tom Goldstein, Furong Huang.
An, Bang, Mucong Ding, Tahseen Rabbani, Aakriti Agrawal, Yuancheng Xu, Chenghao Deng, Sicheng Zhu, Abdirisak Mohamed, Yuxin Wen, Tom Goldstein, Furong Huang.
Position Paper: On the Possibilities of AI-Generated Text Detection
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Chakraborty, Souradip, Amrit Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, and Furong Huang
Chakraborty, Souradip, Amrit Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, and Furong Huang
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Xu, Yuancheng, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang.
Xu, Yuancheng, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang.
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
Zheng, Ruijie, Ching-An Cheng, Hal Daum ́e III, Furong Huang, Andrey Kolobov.
Zheng, Ruijie, Ching-An Cheng, Hal Daum ́e III, Furong Huang, Andrey Kolobov.