Gemini 3.5 Flash ranks #1 on the APEX-Agents-AA benchmark, outperforming much larger models a whole size above it.
r/singularity
•
Generative AI
AI Research
Submitted by /u/Independent-Wind4462 [link] [comments]