October 23, 2025 report
The GIST AI teaches itself and outperforms human-designed algorithms
Paul Arnold
contributing writer
Gaby Clark
scientific editor
Robert Egan
associate editor
Editors' notes
This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:
fact-checked
peer-reviewed publication
trusted source
proofread

Like humans, artificial intelligence learns by trial and error, but traditionally, it requires humans to set the ball rolling by designing the algorithms and rules that govern the learning process. However, as AI technology advances, machines are increasingly doing things themselves. An example is a new AI system developed by researchers that invented its own way to learn, resulting in an algorithm that outperformed human-designed algorithms on a series of complex tasks.
For decades, human engineers have designed the algorithms that agents use to learn, especially reinforcement learning (RL), where an AI learns by receiving rewards for successful actions. While learning comes naturally to humans and animals, thanks to millions of years of evolution, it has to be explicitly taught to AI. This process is often slow and laborious and is ultimately limited by human intuition.
Taking their cue from evolution, which is a random trial and error process, the researchers created a large digital population of AI agents. These agents tried to solve numerous tasks in many different, complex environments using a particular learning rule.
Overseeing them was a "meta-network," a parent AI that analyzed how well the agents performed and then changed the learning rule so the next generation of agents could learn faster and perform better. This allowed the system to discover a new learning rule, DiscoRL, which the researchers called Disco57 (evaluated on 57 Atari games), that was superior to any previously designed by humans.
The team then used Disco57 to train a new AI agent and compared its performance against some of the best human-designed algorithms, such as PPO and MuZero. First, it was trained on well-known Atari games, and then on unseen challenges, including games like ProcGen, Crafter and NetHack.
The results were outstanding. On the Atari Benchmark (a set of classic Atari video games used to evaluate AI performance), the DiscoRL-trained achieved better results than all human-designed algorithms. When confronted with unseen challenges, it performed at a state-of-the-art level, proving the system had discovered its own learning rule.
"Our findings suggest that the RL algorithms required for advanced artificial intelligence may soon be automatically discovered from the experiences of agents, rather than manually designed," wrote the researchers in their paper published in the journal Nature. "This work has taken a step towards machine-designed reinforcement learning algorithms that can compete with and even outperform some of the best manually-designed algorithms in challenging environments."
Written for you by our author Paul Arnold, edited by Gaby Clark, and fact-checked and reviewed by Robert Egan—this article is the result of careful human work. We rely on readers like you to keep independent science journalism alive. If this reporting matters to you, please consider a donation (especially monthly). You'll get an ad-free account as a thank-you.
More information: Junhyuk Oh et al, Discovering state-of-the-art reinforcement learning algorithms, Nature (2025). DOI: 10.1038/s41586-025-09761-x
Journal information: Nature
© 2025 Science X Network
Citation: AI teaches itself and outperforms human-designed algorithms (2025, October 23) retrieved 23 October 2025 from https://techxplore.com/news/2025-10-ai-outperforms-human-algorithms.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.
Explore further
Researchers exploit weaknesses of master game bots
Feedback to editors