AI RESEARCH
ChannelTok: Efficient Flexible-Length Vision Tokenization
arXiv CS.CV
•
ArXi:2606.04461v1 Announce Type: new Leading flexible vision tokenizers achieve SOTA quality at an extreme cost, relying on parameter-heavy backbones and slow, multi-step generative decoders. We depart from this complex, spatial-token paradigm and