AI RESEARCH

ChannelTok: Efficient Flexible-Length Vision Tokenization

arXiv CS.CV

ArXi:2606.04461v1 Announce Type: new Leading flexible vision tokenizers achieve SOTA quality at an extreme cost, relying on parameter-heavy backbones and slow, multi-step generative decoders. We depart from this complex, spatial-token paradigm and