Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Year

2024

Type(s)

Conference proceedings

Author(s)

Ding, Mucong, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, and Furong Huang.

Source

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.

Url

https://arxiv.org/abs/2409.18433

BibTeX

Furong Huang

Associate Professor @ University of Maryland

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

BibTeX

Where Has Furong Been? Behind the Scenes of Our NeurIPS Competition

Past News

NeurIPS ’22 Main Conference Papers from Huang Lab @UMD