Furong Huang
Associate Professor @ University of Maryland
Home
Publications
Research
Project Page Highlights
Students
Teaching
Blog
Contact
Students on the Job Market
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
Publications
Year
2024
Type(s)
Conference proceedings
Author(s)
Zhu, Sicheng, Bang An, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, and Furong Huang.
Source
First Conference on Language Modeling (COLM), 2024.
BibTeX
BibTeX
BibTeX
@inproceedings{an2023automatic, title={Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models}, author={An, Bang and Zhu, Sicheng and Zhang, Ruiyi and Panaitescu-Liess, Michael-Andrei and Xu, Yuancheng and Huang, Furong}, booktitle={First Conference on Language Modeling}, year={2023} }