AI RESEARCH
Multi-view Pyramid Transformer: Look Coarser to See Broader
arXiv CS.CV
•
ArXi:2512.07806v2 Announce Type: replace We propose Multi-view Pyramid Transformer (MVP), a scalable multi-view transformer architecture that directly reconstructs large 3D scenes from tens to hundreds of images in a single forward pass.