Gemini 3.5 Flash ranks #1 on the APEX-Agents-AA benchmark, outperforming much larger models a whole size above it.

r/singularity
Generative AI AI Research

Submitted by /u/Independent-Wind4462 [link] [comments]