Kavli Affiliate: Zeeshan Ahmed | First 5 Authors: Jinzheng Zhao, Niko Moritz, Egor Lakomkin, Ruiming Xie, Zhiping Xiu | Summary: Cascaded speech-to-speech translation systems often suffer from the error accumulation problem and high latency, which is a result of cascaded modules whose inference delays accumulate. In this paper, we propose a transducer-based speech translation model […]
Continue.. Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens