Kavli Affiliate: Dan Luo | First 5 Authors: Zhiqi Huang, Dan Luo, Jun Wang, Huan Liao, Zhiheng Li | Summary: Our research introduces an innovative framework for video-to-audio synthesis, which solves the problems of audio-video desynchronization and semantic loss in the audio. By incorporating a semantic alignment adapter and a temporal synchronization adapter, our method […]
Continue.. Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis