Show HN: HermesBench – workflow reliability evals for personal AI agents
Hacker News Show AI
•
Generative AI
Article URL: Comments URL: Points: 2 # Comments: 0