Kavli Affiliate: Xiang Zhang | First 5 Authors: Zhawnen Chen, Tianchun Wang, Yizhou Wang, Michal Kosinski, Xiang Zhang | Summary: Can large multimodal models have a human-like ability for emotional and social reasoning, and if so, how does it work? Recent research has discovered emergent theory-of-mind (ToM) reasoning capabilities in large language models (LLMs). LLMs […]
Continue.. Through the Theory of Mind’s Eye: Reading Minds with Multimodal Video Large Language Models