Kavli Affiliate: Zheng Zhu | First 5 Authors: Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu | Summary: Vision-centric autonomous driving has recently raised wide attention due to its lower cost. Pre-training is essential for extracting a universal representation. However, current vision-centric pre-training typically relies on either 2D or 3D pre-text tasks, overlooking […]
Continue.. DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving