Top of the leaderboard:
🥇Gemini 3.5 Flash: 80.2%
🥈Gemini 3 Flash Preview: 79.2%
🥉Gemini 2.5 Pro: 78.9%
Models must process raw frames as native tokens to locate seconds-long events hidden in an hour of video; accuracy scales logarithmically with frame density.
