M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation

Kavli Affiliate: Zhuo Li | First 5 Authors: Mingshuang Luo, Ruibing Hou, Zhuo Li, Hong Chang, Zimo Liu | Summary: This paper presents M$^3$GPT, an advanced $textbf{M}$ultimodal, $textbf{M}$ultitask framework for $textbf{M}$otion comprehension and generation. M$^3$GPT operates on three fundamental principles. The first focuses on creating a unified representation space for various motion-relevant modalities. We employ […]


Continue.. M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation