Kavli Affiliate: Zheng Zhu | First 5 Authors: Tianbao Zhang, Jian Zhao, Yuer Li, Zheng Zhu, Ping Hu | Summary: Whole-body audio-driven avatar pose and expression generation is a critical task for creating lifelike digital humans and enhancing the capabilities of interactive virtual agents, with wide-ranging applications in virtual reality, digital entertainment, and remote communication. […]
Continue.. AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars