Furong Huang
Associate Professor @ University of Maryland
Home
Publications
Research
Project Page Highlights
Students
Teaching
Blog
Contact
Students on the Job Market
AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models
Publications
Year
2023
Type(s)
Conference articles
Author(s)
Sicheng Zhu and Ruiyi Zhang and Bang An and Gang Wu and Joe Barrow and Zichao Wang and Furong Huang and Ani Nenkova and Tong Sun
Source
In Workshop on Socially Responsible Language Modelling Research (SoLaR), NeurIPS 2023, 2023
BibTeX
BibTeX
BibTeX
@inproceedings{zhu2023autodanautomaticinterpretable, title = {{AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models}}, author = {Sicheng Zhu and Ruiyi Zhang and Bang An and Gang Wu and Joe Barrow and Zichao Wang and Furong Huang and Ani Nenkova and Tong Sun}, booktitle = {Workshop on Socially Responsible Language Modelling Research (SoLaR), NeurIPS 2023}, year = {2023},