Kavli Affiliate: Ke Wang | First 5 Authors: Ke Wang, Lei He, Kun Liu, Yan Deng, Wenning Wei | Summary: Large Multimodal Models (LMMs) have demonstrated exceptional performance across a wide range of domains. This paper explores their potential in pronunciation assessment tasks, with a particular focus on evaluating the capabilities of the Generative Pre-trained […]
Continue.. Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment