AI RESEARCH
LEMUR: Learned Multi-Vector Retrieval
arXiv CS.LG
•
ArXi:2601.21853v2 Announce Type: replace-cross Multi-vector representations generated by late interaction models, such as ColBERT, enable superior retrieval quality compared to single-vector representations in information retrieval applications. In multi-vector retrieval systems, both queries and documents are encoded using one embedding per token, and similarity between queries and documents is measured by the MaxSim similarity measure. However, the improved quality of multi-vector retrieval comes at the expense of significantly increased search latency. In this work, we