homehome Home chatchat Notifications


DeepMind AI cracks the structure of over 200 million proteins. That's virtually all proteins known to science

We're past a tipping point in science that could prove groundbreaking.

Tibi Puiu
July 28, 2022 @ 8:17 pm

share Share

If someone ever asks you what artificial intelligence has ever done for science, just show them AlphaFold. The program developed by Google’s AI group, known as DeepMind, has decoded the structure of almost all proteins in scientists’ catalogs, over 200 million of them. As the basic building blocks of life, proteins do most of the work in cells, from transmitting signals that regulate organs to protecting the body from bacteria and viruses. The ability to accurately predict the 3D structures of proteins from their amino-acid sequences is thus a huge boon to life sciences and medicine, and nothing short of revolutionary. This is a big deal because before AI scientists could only unravel the structure of a tiny fraction of these proteins.

Solving the protein folding problem

Proteins serve a wide range of purposes. Some are structural, others transport molecules, others still are receptors, and so on. Each of these functions is closely related to its specific shape, which is achieved through folding.

All proteins start off as a linear chain of basic units called amino acids. This primary 1D structure of amino acids contains the “recipe” that a protein uses to fold itself up. A protein will go through repeating stages of folding, adopting a wide range of configurations before reaching its final shape, which happens to be the most energetically favorable one.

However, predicting the 3D structure of a protein from its flat 1D sequence of amino acids is extremely challenging because the number of possible configurations can be staggering. Traditionally, structural biologists have determined protein structures through experimental means, using very expensive and time-consuming methods, such as X-ray crystallography or electron microscopy. Although accurate, this kind of research is very slow, hence we only knew about a few protein structures. But sifting through unfathomable amounts of possibilities for the human mind is exactly the kind of job an AI is best suited for.

DeepMind first revealed AlphaFold in 2020, and the scientific community was immediately blown away. Last year, in collaboration with the European Molecular Biology Laboratory (EMBL), DeepMind released a public database that included 98% of all human proteins, along with the protein structures for 20 other molecules.

Credit: DeepMind.

Now, the database has been expanded to cover all the proteins in almost every organism on Earth that has had its genome sequenced. That’s over 200 million structures.

“You can think of it as covering the entire protein universe,” Demis Hassabis, CEO of DeepMind, said during a press briefing. “We’re at the beginning of a new era now in digital biology.”

Less pipetting, more thinking

As genomic data is expected to swell like a tsunami each year, molecular biologists will have a field day with AlphaFold’s databases, empowering them to ask more advanced questions. For instance, armed with their 3D structures, scientists can now figure out the function of thousands of currently unsolved proteins in the human genome that may be linked to disease-causing gene variants that differ from person to person. They can also produce new drugs faster and respond to global threats like pandemics with greater zeal.

For instance, in early 2020, AlphaFold determined the structures of a handful of SARS-CoV-2 proteins that were determined experimentally. Imagine if a new dangerous pathogen is discovered tomorrow — AlphaFold would be able to quickly decipher its protein structure and rapidly arrive at possible avenues of attack in order to neutralize it.

Elsewhere, a research team led by Professor Matthew Higgins at the University of Oxford used AlphaFold’s predictions to unlock the structure of a key protein from a malaria parasite, allowing them to find the matching antibodies that can block the transmission of the parasite.

An example of a protein structure prediction by AlphaFold that is remarkably accurate compared to experimental data. Credit: DeepMind.

All of AlphaFold’s discovered protein structures, and even its source code, have been published for free. According to DeepMind, over 500,000 researchers from 190 countries have accessed the database so far, viewing two million structures.

However, all of this doesn’t mean the dawn of the experimental search for protein structures. AlphaFold is trained on datasets of protein structures that have been validated experimentally, and more such work is required to make the algorithm even more accurate. In fact, when dealing with highly challenging work, a hybrid approach combining technology and experimentation seems to work marvelously. Earlier this year, three research groups used AlphaFold to help them piece together one of the biggest jigsaw puzzles in biology, the human nuclear pore complex, which regulates the transport of macromolecules between the eukaryotic cell’s nucleus and cytoplasm and is composed of over 1,000 protein subunits.

“Its delicate structure was finally revealed by using existing experimental methods to reveal its outline and AlphaFold predictions to complete and interpret any areas that were unclear. This powerful combination is now becoming routine in labs, unlocking new science and showing how experimental and computational techniques can work together,” the DeepMind team wrote in a blog post.

share Share

Biggest Modern Excavation in Tower of London Unearths the Stories of the Forgotten Inhabitants

As the dig deeper under the Tower of London they are unearthing as much history as stone.

Millions Of Users Are Turning To AI Jesus For Guidance And Experts Warn It Could Be Dangerous

AI chatbots posing as Jesus raise questions about profit, theology, and manipulation.

Can Giant Airbags Make Plane Crashes Survivable? Two Engineers Think So

Two young inventors designed an AI-powered system to cocoon planes before impact.

First Food to Boost Immunity: Why Blueberries Could Be Your Baby’s Best First Bite

Blueberries have the potential to give a sweet head start to your baby’s gut and immunity.

Ice Age People Used 32 Repeating Symbols in Caves Across the World. They May Reveal the First Steps Toward Writing

These simple dots and zigzags from 40,000 years ago may have been the world’s first symbols.

NASA Found Signs That Dwarf Planet Ceres May Have Once Supported Life

In its youth, the dwarf planet Ceres may have brewed a chemical banquet beneath its icy crust.

Nudists Are Furious Over Elon Musk's Plan to Expand SpaceX Launches in Florida -- And They're Fighting Back

A legal nude beach in Florida may become the latest casualty of the space race

A Pig Kidney Transplant Saved This Man's Life — And Now the FDA Is Betting It Could Save Thousands More

A New Hampshire man no longer needs dialysis thanks to a gene-edited pig kidney.

The Earliest Titanium Dental Implants From the 1980s Are Still Working Nearly 40 Years Later

Longest implant study shows titanium roots still going strong decades later.

Common Painkillers Are Also Fueling Antibiotic Resistance

The antibiotic is only one factor creating resistance. Common painkillers seem to supercharge the process.