Kavli Affiliate: Ting Xu | First 5 Authors: Ting Xu, Zhichao Huang, Jiankai Sun, Shanbo Cheng, Wai Lam | Summary: We present Sequential Policy Optimization for Simultaneous Machine Translation (SeqPO-SiMT), a new policy optimization framework that defines the simultaneous machine translation (SiMT) task as a sequential decision making problem, incorporating a tailored reward to enhance […]
Continue.. SeqPO-SiMT: Sequential Policy Optimization for Simultaneous Machine Translation