MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer

Kavli Affiliate: Zheng Zhu | First 5 Authors: Chaoqiang Zhao, Youmin Zhang, Matteo Poggi, Fabio Tosi, Xianda Guo | Summary: Self-supervised monocular depth estimation is an attractive solution that does not require hard-to-source depth labels for training. Convolutional neural networks (CNNs) have recently achieved great success in this task. However, their limited receptive field constrains […]


Continue.. MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer