AI RESEARCH
Recursive Block-Diagonal Coupling for Resource-Efficient Training of Vision Models
arXiv CS.CV
•
Training high-capacity vision models from scratch requires substantial computational resources. To improve training efficiency of a wide target model, existing growth methods often assume the availability of narrower models, obscuring the true computational cost of the entire pipeline. This allows a flexible allocat