AI RESEARCH
unix-ctf: Procedural Environments for Unix-Competence Reinforcement Learning
arXiv CS.AI
•
ArXi:2605.29115v1 Announce Type: cross Unix competence is the ability to use shell and operating-system primitives as first-class tools, not merely to write programs through a terminal. Current terminal benchmarks tend to blur this distinction: a solver fluent in Python but weak in Unix can pass a substantial fraction of Terminal-Bench 2.0, while the reverse skill profile is rarely exercised. We make the distinction operational and build a