Conference articles – Page 3

AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment

In The AAAI-25 Workshop on Artificial Intelligence for Cyber Security, AICS 2025, 2025
Pankayaraj Pathmanathan and Udari Madhushani Sehwag and Michael-Andrei Panaitescu-Liess and Furong Huang

Publisher's website

Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs

In 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026, 2026
James Beetham and Souradip Chakraborty and Mengdi Wang and Furong Huang and Amrit Singh Bedi and Mubarak Shah

Publisher's website

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), Spotlight, 2025, 2025
Xiyao Wang and Zhengyuan Yang and Chao Feng and Hongjin Lu and Linjie Li and Chung-Ching Lin and Kevin Lin and Furong Huang and Lijuan Wang

Publisher's website

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025, 2025
Xiyao Wang and Zhengyuan Yang and Chao Feng and Yuhang Zhou and Xiaoyu Liu and Yongyuan Liang and Ming Li and Ziyi Zang and Linjie Li and Chung-Ching Lin and Kevin Lin and Furong Huang and Lijuan Wang

Publisher's website

Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models

In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025, 2025
Soumya Suvra Ghosal and Souradip Chakraborty and Avinash Reddy and Yifu Lu and Mengdi Wang and Dinesh Manocha and Furong Huang and Mohammad Ghavamzadeh and Amrit Singh Bedi

Publisher's website

A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks

In The Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS), 2025, 2025
Mucong Ding and Bang An and Tahseen Rabbani and Chenghao Deng and Anirudh Satheesh and Souradip Chakraborty and Mehrdad Saberi and Yuxin Wen and Kyle Rui Sang and Aakriti Agrawal and Xuandong Zhao and Mo Zhou and Mary-Anne Hartley and Lei Li and Yu-Xiang Wang and Vishal M. Patel and Soheil Feizi and Tom Goldstein and Furong Huang

Publisher's website

Practical Memorization Tests for Detecting Copyrighted Data in Large Language Models

In Seventh Workshop on Privacy in Natural Language Processing (PrivateNLP), ACL 2026, 2026
Michael-Andrei Panaitescu-Liess and Aadi Palnitkar and Archit Kambhamettu and Yigitcan Kaya and Daniel Brown and Sungbin Oh and Sean Michael McLeish and Marco Bornstein and Furong Huang and Tom Goldstein

Furong Huang

Associate Professor @ University of Maryland

Publication Type: Conference articles

AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment

Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models

A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks

Practical Memorization Tests for Detecting Copyrighted Data in Large Language Models

Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems

DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data

Imagine, Verify, Execute: Agentic Exploration with Vision-Language Models

Where Has Furong Been? Behind the Scenes of Our NeurIPS Competition

Past News

NeurIPS ’22 Main Conference Papers from Huang Lab @UMD