Kavli Affiliate: Ke Wang | First 5 Authors: Ke Wang, Junting Pan, Weikang Shi, Zimu Lu, Mingjie Zhan | Summary: Recent advancements in Large Multimodal Models (LMMs) have shown promising results in mathematical reasoning within visual contexts, with models approaching human-level performance on existing benchmarks such as MathVista. However, we observe significant limitations in the […]
Continue.. Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset