Furong Huang
Associate Professor @ University of Maryland
Home
Publications
Research
Project Page Highlights
Students
Teaching
Blog
Contact
Students on the Job Market
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
Publications
Year
2024
Type(s)
Conference proceedings
Author(s)
Zhu, Sicheng, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, and Tong Sun.
Source
First Conference on Language Modeling (COLM), 2024.
BibTeX
BibTeX
BibTeX
@inproceedings{zhu2023autodan, title={AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models}, author={Zhu, Sicheng and Zhang, Ruiyi and An, Bang and Wu, Gang and Barrow, Joe and Wang, Zichao and Huang, Furong and Nenkova, Ani and Sun, Tong}, booktitle={First Conference on Language Modeling}, year={2023} }