Kavli Affiliate: Feng Wang | First 5 Authors: Feng Wang, Timing Yang, Yaodong Yu, Sucheng Ren, Guoyizhe Wei | Summary: In this work, we introduce the Adventurer series models where we treat images as sequences of patch tokens and employ uni-directional language models to learn visual representations. This modeling paradigm allows us to process images […]
Continue.. Adventurer: Optimizing Vision Mamba Architecture Designs for Efficiency