Kavli Affiliate: Zeeshan Ahmed | First 5 Authors: Zeeshan Ahmed, Zeeshan Ahmed, , , | Summary: This paper tackles several challenges that arise when integrating Automatic Speech Recognition (ASR) and Machine Translation (MT) for real-time, on-device streaming speech translation. Although state-of-the-art ASR systems based on Recurrent Neural Network Transducers (RNN-T) can perform real-time transcription, achieving […]
Continue.. Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT