Furong Huang
Associate Professor @ University of Maryland
Home
Publications
Research
Project Page Highlights
Students
Teaching
Blog
Contact
Students on the Job Market
Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs
Publications
Year
2026
Type(s)
Conference articles
Author(s)
James Beetham and Souradip Chakraborty and Mengdi Wang and Furong Huang and Amrit Singh Bedi and Mubarak Shah
Source
In 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026, 2026
Url
https://aclanthology.org/2026.eacl-long.360.pdf
BibTeX
BibTeX
BibTeX
@inproceedings{beetham2026jailbreaksinferencetime, title = {{Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs}}, author = {James Beetham and Souradip Chakraborty and Mengdi Wang and Furong Huang and Amrit Singh Bedi and Mubarak Shah}, booktitle = {19th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026}, year = {2026}, url = {https://aclanthology.org/2026.eacl-long.360.pdf}, note = {
Paper
},