homehome Home chatchat Notifications


More than 24,000 AI-readable coronavirus scientific articles go online

The sum of human knowledge on the new coronavirus is now online, in a format readable by artificial intelligence.

Tibi Puiu
March 19, 2020 @ 2:05 am

share Share

Credit: Pixabay.

Scientists all over the world are racing around the clock on candidate vaccines, antiviral treatments, and just about anything they can throw at the novel coronavirus. In order to aid their efforts and accelerate unprecedented scientific action, a database that pools more than 24,000 research papers related to SARS-CoV-2 (the scientific name for the virus that causes the COVID-19 pandemic) and other coronaviruses is now online in a single place.

The most comprehensive coronavirus scientific database

The Covid-19 Open Research Dataset (CORD-19) is the work of several philanthropic and research organizations, including The National Library of Medicine (NLM) at the National Institutes of Health, the Allen Institute for AI, Georgetown University, the Chan Zuckerberg Initiative, Kaggle, Microsoft, and the White House Office of Science and Technology Policy (OSTP).

Each organization contributed with resources and know-how to the best of their ability. For instance, the NLM provided access to scientific literature while Microsoft used its engineering abilities to index and map all these thousands of articles that were scattered across the web. The Allen Institute for Artificial Intelligence (AI2), a non-profit, converted all the articles into a common structured format that can be parsed by algorithms.

Additionally, the entire dataset is machine-readable, allowing artificial intelligence (AI) systems to access and interpret the huge body of knowledge. This way, scientists might find existing safe drugs and therapies designed to treat other conditions that could prove useful in the current war on the coronavirus. Or perhaps they might find a chink in the coronavirus’ armor that has so far escaped scientists.

Previously, Microsoft researchers had employed machine learning and natural language analysis to interpret the content of thousands of biomedical papers. This initiative led to a representation of cellular regulatory networks that was exploited to make recommendations for cancer therapies.

According to MIT Technology Review, the dataset is part of AI2’s Semantic Scholar service, which employs natural language models like ELMo and BERT to plot relationships between papers.

For a long time, there has been a fierce debate among scholars regarding access to scientific papers, many of which are behind paywalls controlled by a handful of publishers.

Proponents of open access — free, unrestricted access to scientific papers — will be at least happy to learn that in this situation great efforts have been made to ensure the global research community has unhindered access to the coronavirus-related papers.

“It’s my hope that the machine-readable content will stimulate advances in computing methods that can help investigators to develop deeper understandings and approaches to addressing the COVID-19 pandemic. Developing tools to help scientists to do research and synthesize new understandings has been a long-term aspiration in AI. Work has been underway over years on methods that can answer questions, analyze and summarize the content of numerous scientific papers, assess the credibility of clinical trials, generate and test hypotheses, and guide experimentation,” Eric Horvitz, Technical Fellow and Chief Scientific Officer at Microsoft, wrote in a recent blog post.

The dataset also includes pre-publication research posted on servers like medRxiv and bioRxiv, which are open access archives for pre-print health sciences and biology research.

“Sharing vital information across scientific and medical communities is key to accelerating our ability to respond to the coronavirus pandemic,” Chan Zuckerberg Initiative Head of Science Cori Bargmann said refering to the CORD-19 project.

share Share

Oldest Firearm in the US, A 500-Year-Old Cannon Unearthed in Arizona, Reveals Native Victory Over Conquistadores

In Arizona’s desert, a 500-year-old cannon sheds light on conquest, resistance, and survival.

No, RFK Jr, the MMR vaccine doesn’t contain ‘aborted fetus debris’

Jesus Christ.

“How Fat Is Kim Jong Un?” Is Now a Cybersecurity Test

North Korean IT operatives are gaming the global job market. This simple question has them beat.

This New Atomic Clock Is So Precise It Won’t Lose a Second for 140 Million Years

The new clock doesn't just keep time — it defines it.

A Soviet shuttle from the Space Race is about to fall uncontrollably from the sky

A ghost from time past is about to return to Earth. But it won't be smooth.

The world’s largest wildlife crossing is under construction in LA, and it’s no less than a miracle

But we need more of these massive wildlife crossings.

Your gold could come from some of the most violent stars in the universe

That gold in your phone could have originated from a magnetar.

Ronan the Sea Lion Can Keep a Beat Better Than You Can — and She Might Just Change What We Know About Music and the Brain

A rescued sea lion is shaking up what scientists thought they knew about rhythm and the brain

Did the Ancient Egyptians Paint the Milky Way on Their Coffins?

Tomb art suggests the sky goddess Nut from ancient Egypt might reveal the oldest depiction of our galaxy.

Dinosaurs Were Doing Just Fine Before the Asteroid Hit

New research overturns the idea that dinosaurs were already dying out before the asteroid hit.