AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment

Year
2025
Type(s)
Author(s)
Pankayaraj Pathmanathan and Udari Madhushani Sehwag and Michael-Andrei Panaitescu-Liess and Furong Huang
Source
In The AAAI-25 Workshop on Artificial Intelligence for Cyber Security, AICS 2025, 2025
Url
https://arxiv.org/abs/2410.11283
BibTeX
BibTeX