Kavli Affiliate: Li Xin Li | First 5 Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao | Summary: Traditional benchmarks struggle to evaluate increasingly sophisticated language models in multilingual and culturally diverse contexts. To address this gap, we introduce MMLU-ProX, a comprehensive multilingual benchmark covering 13 typologically diverse languages with approximately 11,829 […]
Continue.. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation