AI RESEARCH

Multi-view Pyramid Transformer: Look Coarser to See Broader

arXiv CS.CV

ArXi:2512.07806v2 Announce Type: replace We propose Multi-view Pyramid Transformer (MVP), a scalable multi-view transformer architecture that directly reconstructs large 3D scenes from tens to hundreds of images in a single forward pass.