CVPR 2026
Yiwen Zhang¹ Joseph Tung² Ruojin Cai³ David Fouhey² Hadar Averbuch-Elor¹
¹Cornell University ²New York University ³Kempner Institute, Harvard University
@misc{zhang2025emergentextremeviewgeometry3d,
title={Emergent Extreme-View Geometry in 3D Foundation Models},
author={Yiwen Zhang and Joseph Tung and Ruojin Cai and David Fouhey and Hadar Averbuch-Elor},
year={2025},
eprint={2511.22686},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2511.22686},
}| Component | Evaluation | Training |
|---|---|---|
| VGGT | ✅ Released | Coming soon |
| π³ | ✅ Released | Coming soon |
| World-Mirror (WM) | ✅ Released | Coming soon |
-
Clone the repository (with submodules):
git clone --recurse-submodules https://github.com/theREALevan/extreme-view-3dfm.git cd extreme-view-3dfm -
Install dependencies: Go into the corresponding model folder under
models/and follow that repo’s setup.- VGGT:
models/vggt— see VGGT Quick Start. - π³ (Pi3):
models/pi3— see Pi3 Quick Start. - World-Mirror (WM):
models/worldmirror— see Dependencies and Installation.
- VGGT:
-
In
scripts/eval.sh, setBASE_DIRto your MegaUnScene data root. -
Set
MODELtovggt,pi3, orwm. SetNO_CKPT=1to use the pre-trained model only; otherwise the fine-tuned checkpoint is used. -
Run:
./scripts/eval.sh
For evaluations on monocular depth, multiview pose estimation, and dense reconstruction (including on UnSceneRecon), please refer to the generalization evaluation GitHub repo.
For MegaUnScene information and download instructions, please refer to the HuggingFace dataset page.
For more details, visit the project page.