AI RESEARCH
GrepSeek: Training Search Agents for Direct Corpus Interaction
arXiv CS.AI
•
ArXi:2605.29307v1 Announce Type: cross Large Language Model (LLM) search agents have shown strong promise for knowledge-intensive language tasks through multiple rounds of reasoning and information retrieval. Most existing systems access information using a retriever that takes a keyword or natural language query and returns a ranked list of documents using an index of pre-computed document representations. In this work, we explore a complementary perspective in which the search agent treats the corpus itself as the search environment and finds evidence by issuing executable shell commands.