AI RESEARCH
Vector Retrieval with Similarity and Diversity: How Hard Is It?
arXiv CS.CL
•
ArXi:2407.04573v4 Announce Type: replace-cross Dense vector retrieval is an important building block of modern machine learning systems, underlying applications ranging from semantic search to retrieval-augmented generation and knowledge-intensive reasoning. Beyond retrieving items that are individually similar to a query, many applications require a set of results that is also diverse, complementary, and collectively informative. Balancing similarity and diversity is therefore central to effective retrieval, but remains challenging to optimize in a stable and theoretically grounded way.