AI RESEARCH

Norm$\times$Direction: Restoring the Missing Query Norm in Vision Linear Attention

arXiv CS.LG

ArXi:2506.21137v3 Announce Type: replace Linear attention mitigates the quadratic complexity of softmax attention but suffers from a critical loss of expressiveness. We identify two primary causes: (1) The normalization operation cancels the query norm, which breaks the correlation between a query's norm and the spikiness (entropy) of the attention distribution as in softmax attention. (2) Standard techniques for enforcing non-negativity cause destructive information loss by nullifying valid inner-product interactions. To address these challenges, we.