Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. MUSt3R: Multi-view Network for Stereo 3D Reconstruction
 
conference paper

MUSt3R: Multi-view Network for Stereo 3D Reconstruction

Cabon, Yohann
•
Stoffl, Lucas  
•
Antsfeld, Leonid
Show more
June 10, 2025
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025). Proceedings
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025

DUSt3R introduced a novel paradigm in geometric computer vision by proposing a model that can provide dense and unconstrained Stereo 3D Reconstruction of arbitrary image collections with no prior information about camera calibration nor viewpoint poses. Under the hood, however, DUSt3R processes image pairs, regressing local 3D reconstructions that need to be aligned in a global coordinate system. The number of pairs, growing quadratically, is an inherent limitation that becomes especially concerning for robust and fast optimization in the case of large image collections. In this paper, we propose an extension of DUSt3R from pairs to multiple views, that addresses all aforementioned concerns. Indeed, we propose a Multi-view Network for Stereo 3D Reconstruction, or MUSt3R, that modifies the DUSt3R architecture by making it symmetric and extending it to directly predict 3D structure for all views in a common coordinate frame. Second, we entail the model with a multi-layer memory mechanism which allows to reduce the computational complexity and to scale the reconstruction to large collections, inferring thousands of 3D pointmaps at high frame-rates with limited added complexity. The framework is designed to perform 3D reconstruction both offline and online, and hence can be seamlessly applied to SfM and visual SLAM scenarios showing state-of-the-art performance on various 3D downstream tasks, including uncalibrated Visual Odometry, relative camera pose, scale and focal estimation, 3D reconstruction and multi-view depth estimation.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/cvpr52734.2025.00106
Author(s)
Cabon, Yohann
Stoffl, Lucas  

École Polytechnique Fédérale de Lausanne

Antsfeld, Leonid
Csurka, Gabriela
Chidlovskii, Boris
Revaud, Jerome
Leroy, Vincent
Date Issued

2025-06-10

Publisher

IEEE

Published in
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025). Proceedings
DOI of the book
https://doi.org/10.1109/CVPR52734.2025
ISBN of the book

979-8-3315-4364-8

Start page

1050

End page

1060

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
UPAMATHIS  
Event nameEvent acronymEvent placeEvent date
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025

Nashville, Tennessee, US

2025-06-11 - 2025-06-15

Available on Infoscience
August 20, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/253258
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés