AI RESEARCH
Winner-Take-All bottlenecks enforce disentangled symbolic representations in multi-task learning
arXiv CS.LG
•
ArXi:2605.22472v1 Announce Type: new Winner-take-all (WTA) networks constitute a central circuit motif in cortical networks of the brain. In addition, WTA-like activations are abundant in modern deep learning models in the form of the softmax activation for example in attention layers of transformers. While their role in the extraction of latent factors has been studied for relatively simple generative models, their role in the context of highly non-linearly entangled latent factors has remained elusive.