AI RESEARCH

Sponsio: Deterministic Contract Layer for LLM Agents [P]

r/MachineLearning

We've been trying to put LangGraph agents into production for a while. The thing that kept biting us was tool-call boundary enforcement: stuff like "must call X before Y", "max N retries", "approval gate before destructive action". Worked fine in s, broke at the moments that mattered. What we tried first: Prompt engineering. Told the model "always call check_policy before issue_refund". Worked ~95% of the time. The 5% that didn't was exactly the cases an auditor would ask about. Not a great answer when someone wants to know why a refund went through. Post-hoc audit (OTEL + log.