Furong Huang
Associate Professor @ University of Maryland
Home
Publications
Research
Project Page Highlights
Students
Teaching
Blog
Contact
Students on the Job Market
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Publications
Year
2024
Type(s)
Conference proceedings
Author(s)
Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.
Source
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
Url
https://arxiv.org/abs/2409.18433
BibTeX
BibTeX
BibTeX
@inproceedings{ding2024easy2hard, title={Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization}, author={Ding, Mucong and Deng, Chenghao and Choo, Jocelyn and Wu, Zichu and Agrawal, Aakriti and Schwarzschild, Avi and Zhou, Tianyi and Goldstein, Tom and Langford, John and Anandkumar, Anima and others}, booktitle={The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track}, year={2024}, url={https://openreview.net/forum?id=iNB4uoFQJb} }