AI RESEARCH

"Skill issues'': data-centric optimization of lakehouse agents

arXiv CS.AI

ArXi:2606.01185v1 Announce Type: new Coding agents are becoming users of data infrastructure, but their success depends not only on model quality: it also depends on the skills and environment files that teach agents how to use a system. We study how to optimize these artifacts for agents operating on a branching lakehouse, Bauplan. In our setting, headless APIs and Git-like data primitives expose data workflows through code, branches, commits, and merges.