Kavli Affiliate: Jiansheng Chen | First 5 Authors: Yiqing Huang, Jiansheng Chen, , , | Summary: Existing image captioning models are usually trained by cross-entropy (XE) loss and reinforcement learning (RL), which set ground-truth words as hard targets and force the captioning model to learn from them. However, the widely adopted training strategies suffer from […]
Continue.. Teacher-Critical Training Strategies for Image Captioning