AI RESEARCH
Architecture-driven Shift: towards a lightweight selector for capturing the trends of logit shift
arXiv CS.AI
•
ArXi:2605.27469v1 Announce Type: cross Continual Learning (CL) is a practical paradigm to utilize power of deep pre-trained neural networks, but which pre-trained model has a better ability to balance ``Plasticity-Stability", deserving to be chosen? The logit shift serves as a natural proxy because it represents the logit shift in CL scenarios. However, obtaining the logit shift requires huge computational cost, which hinders large-scale model selection.