Kavli Affiliate: Xiang Zhang | First 5 Authors: Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee | Summary: Vision-language contrastive learning suggests a new learning paradigm by leveraging a large amount of image-caption-pair data. The caption supervision excels at providing wide coverage in vocabulary that enables strong zero-shot image recognition performance. On the […]
Continue.. Prefix Conditioning Unifies Language and Label Supervision