Kavli Affiliate: Yi Zhou | First 5 Authors: Bingzhi Liu, Yin Cao, Haohe Liu, Yi Zhou, | Summary: Diffusion models have demonstrated promising results in text-to-audio generation tasks. However, their practical usability is hindered by slow sampling speeds, limiting their applicability in high-throughput scenarios. To address this challenge, progressive distillation methods have been effective in […]
Continue.. Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation