Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin | Summary: Recent studies have focused on utilizing multi-modal data to develop robust models for facial Action Unit (AU) detection. However, the heterogeneity of multi-modal data poses challenges in learning effective representations. One such challenge is extracting […]
Continue.. Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on Facial Action Unit Detection