Button‑pushing explorers: How to grasp that AI agents can do amazing things while knowing nothing

The nonprofit ARC Prize Foundation on May 1, 2026, released the results of a new benchmark: a test of an AI system's ability to solve a game. The results were striking—humans scored 100%, while the most advanced AI systems scored under 1%.