AI RESEARCH

Recursive Block-Diagonal Coupling for Resource-Efficient Training of Vision Models

arXiv CS.CV

Training high-capacity vision models from scratch requires substantial computational resources. To improve training efficiency of a wide target model, existing growth methods often assume the availability of narrower models, obscuring the true computational cost of the entire pipeline. This allows a flexible allocat