Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Year

2025

Type(s)

Conference articles

Author(s)

Souradip Chakraborty Soumya Suvra Ghosal and Vaibhav Singh and Tianrui Guan and Mengdi Wang and Ahmad Beirami and Furong Huang and Alvaro Velasquez and Dinesh Manocha and Amrit Singh Bedi

Source

In Conference on Computer Vision and Pattern Recognition (CVPR), 2025, 2025

Url

https://arxiv.org/abs/2411.18688

BibTeX

Furong Huang

Associate Professor @ University of Maryland

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

BibTeX

Where Has Furong Been? Behind the Scenes of Our NeurIPS Competition

Past News

NeurIPS ’22 Main Conference Papers from Huang Lab @UMD