homehome Home chatchat Notifications


An AI Called Dreamer Learned to Mine Diamonds in Minecraft — Without Being Taught

A self-improving algorithm masters a complex game task, hinting at a new era in AI.

Tudor Tarita
April 10, 2025 @ 2:42 pm

share Share

In the blocky landscapes of Minecraft, millions of players dig, build, and battle each day. But one day, a newcomer broke the game with his diamond mining skills — the game’s most prized resource.

The newcomer wasn’t a player, but an artificial intelligence system called Dreamer, developed by Google DeepMind. And its journey from noob to grand master may offer a glimpse of the future of intelligent machines.

What Dreamer sees
What Dreamer sees. Credit: Mojang

Learning to Learn

The task was simple in name only: collect a diamond. For humans, this is already an involved process. You need to chop trees for wood, craft a table, build a pickaxe, gather stone and iron, then descend into the depths of a randomly generated world — dodging lava and hazards along the way — before you even stand a chance of finding a glimmering gem.

Now imagine doing all that with zero guidance.

Dreamer did just that. It wasn’t taught to play Minecraft, nor was it shown examples of how humans do it. Instead, it began with nothing but the game’s rules and a goal: get that ice (diamond).

Using a technique called reinforcement learning, the AI experimented its way forward. It tried different actions, received small rewards when it made progress — and learned from its mistakes.

“Dreamer marks a significant step towards general AI systems,” Danijar Hafner, a computer scientist at Google DeepMind told Nature. “It allows AI to understand its physical environment and also to self-improve over time, without a human having to tell it exactly what to do.”

The feat, described in a paper published this month in Nature, was far from trivial. Unlike chess or Go — games with fixed boards and perfect information — Minecraft is messy, open-ended, and different every time. Each new play session generates a unique world with forests, deserts, oceans, and hidden underground caves. To succeed, the AI had to develop flexible skills and learn how to generalize.

Training process of Dreamer
Training process of Dreamer. Credit: Nature

A Machine That Can Imagine the Future

At the heart of Dreamer’s success is what scientists call a world model — a kind of internal simulation that lets the AI imagine different scenarios before acting.

Instead of blindly trying every possibility in the real game environment, Dreamer could project the likely outcomes of different actions inside its mind, much like a person mentally rehearsing the steps to solve a puzzle.

“The world model really equips the AI system with the ability to imagine the future,” said Hafner.

That capacity to imagine is no small thing. It allowed Dreamer to speed up learning dramatically. Within just nine days of continuous play, Dreamer reached expert performance. By then, it could mine a diamond in about 30 minutes — a speed comparable to skilled human players.

To prevent Dreamer from simply memorizing one solution, the developers added a twist: every 30 minutes, the game world would reset, replaced with an entirely new one. That forced the AI to adapt, over and over again, learning general rules rather than specific tricks.

Indeed, previous attempts at teaching AI to find diamonds leaned heavily on watching videos of humans play or guiding the algorithms step by step. Dreamer, by contrast, figured it all out from scratch.

Credit: Danijar Hafner et al., https://danijar.com/project/dreamerv3/

Ok, Dreamer Got the Diamonds, Now What?

So why does this matter? Surely Google isn’t spending millions just to make better Minecraft players.

In truth, Minecraft was never the real goal. The game is simply a rich and unpredictable playground, perfect for training algorithms that might one day operate in our world — where trial-and-error learning comes with serious costs.

Teaching a robot to pick up a glass or navigate a warehouse using brute-force trial and error would be slow and risky. But a robot that can imagine the consequences of its actions, the way Dreamer does, could learn much faster and more safely.

It’s this blend of flexibility, foresight, and autonomy that scientists call “general intelligence” — and it’s a long-standing holy grail in AI research.

While Dreamer is still far from matching human reasoning or understanding, its success in Minecraft is a promising sign. It learned not by being programmed, but by exploring, failing, imagining — and, eventually, mastering a task that even humans find tricky.

Next up? The Ender Dragon. For now, though, Dreamer is content with its first prize: a diamond. It ain’t much, but it’s honest work.

share Share

This Rare Viking Burial of a Woman and Her Dog Shows That Grief and Love Haven’t Changed in a Thousand Years

The power of loyalty, in this life and the next.

This EV Battery Charges in 18 Seconds and It’s Already Street Legal

RML’s VarEVolt battery is blazing a trail for ultra-fast EV charging and hypercar performance.

DARPA Just Beamed Power Over 5 Miles Using Lasers and Used It To Make Popcorn

A record-breaking laser beam could redefine how we send power to the world's hardest places.

Why Do Some Birds Sing More at Dawn? It's More About Social Behavior Than The Environment

Study suggests birdsong patterns are driven more by social needs than acoustics.

Nonproducing Oil Wells May Be Emitting 7 Times More Methane Than We Thought

A study measured methane flow from more than 450 nonproducing wells across Canada, but thousands more remain unevaluated.

CAR T Breakthrough Therapy Doubles Survival Time for Deadly Stomach Cancer

Scientists finally figured out a way to take CAR-T cell therapy beyond blood.

The Sun Will Annihilate Earth in 5 Billion Years But Life Could Move to Jupiter's Icy Moon Europa

When the Sun turns into a Red Giant, Europa could be life's final hope in the solar system.

Ancient Roman ‘Fast Food’ Joint Served Fried Wild Songbirds to the Masses

Archaeologists uncover thrush bones in a Roman taberna, challenging elite-only food myths

A Man Lost His Voice to ALS. A Brain Implant Helped Him Sing Again

It's a stunning breakthrough for neuroprosthetics

This Plastic Dissolves in Seawater and Leaves Behind Zero Microplastics

Japanese scientists unveil a material that dissolves in hours in contact with salt, leaving no trace behind.