Kavli Affiliate: Ke Wang | First 5 Authors: Ke Wang, Ke Wang, , , | Summary: The growing capabilities of large language models and multimodal systems have spurred interest in voice-first AI assistants, yet existing benchmarks are inadequate for evaluating the full range of these systems’ capabilities. We introduce VoiceAssistant-Eval, a comprehensive benchmark designed to […]
Continue.. VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing