Gemini 3.5 Flash ranks #1 on the Frontier-Agent-VN benchmark, outperforming much larger models a whole size above it.

r/singularity
Generative AI AI Research

Submitted by /u/SuggestionMission516 [link] [comments]