homehome Home chatchat Notifications


Text AI can produce images -- and it's very good at it

AI is already nearing sci-fi territory.

Mihai Andrei
July 31, 2020 @ 1:26 pm

share Share

This AI was designed to work with text. Now, researchers have tweaked it to work with images, predicting pixels and filling out incomplete images.

GPT-2 is a text-generating algorithm. Trained on billions and billions of pages of words, it’s capable of absorbing the structure of the text and then writing texts of its own, starting from simple prompts. The algorithm also uses unsupervised learning, which makes it much easier for researchers to train it without taking a lot of their time. The AI system was presented in February and proved capable of writing convincing passages of English.

Now, researchers have put GPT-2 up to a different task: working with images.

The algorithm itself is not well-suited to working with images, at least not in a conventional sense. It was designed to work with one-dimensional data (strings of letters), not 2D images.

To bypass this shortcoming, researchers unfurled images into a single string of pixels, essentially treating pixels as if they were letters. After the algorithm was trained thusly, the new version of the algorithm was called iGPT.

They then fed halves of images and asked the AI to complete the picture. Here are some examples:

Image credits: OpenAI.

The results are already impressive. If you look at the lower half of the photos above, they’re all generated by the AI, pixel by pixel, and they look eerily realistic. The three birds, for instance, are shown standing on different surfaces, all of them believable. The droplets of water too show different veridic possibilities, and all in all, it’s an amazing accomplishment from iGPT.

This also hints at one of the holy grails of machine learning: generalizable algorithms. Nowadays, AIs can be very good at a single task (whether it’s chess, text, or images), but it’s still only one task. Using one algorithm for multiple tasks is an encouraging sign for generalizable approaches.

The results are even more exciting when you consider that GPT-2 is already last year’s AI. Recently, the next generation, GPT-3, was presented by researchers and it’s already putting its predecessor to shame, by generating some stunningly realistic texts.

There’s no telling what GPT-3 will be capable of, both in terms of text generation and image generation. It’s exciting — and a little bit scary — to imagine the results.

The original paper can be read here.

share Share

Hidden for over a century, a preserved Tasmanian Tiger head "found in a bucket" may bring the lost species back from extinction

Researchers recover vital RNA from Tasmanian tiger, pushing de-extinction closer to reality.

Island Nation Tuvalu Set to Become the First Country Lost to Climate Change. More Than 80% of the Population Apply to Relocate to Australia Under World's First 'Climate Visa'

Tuvalu will likely become the first nation to vanish because of climate change.

Archaeologists Discover 6,000 Year Old "Victory Pits" That Featured Mass Graves, Severed Limbs, and Torture

Ancient times weren't peaceful by any means.

Space Solar Panels Could Cut Europe’s Reliance on Land-Based Renewables by 80 Percent

A new study shows space solar panels could slash Europe’s energy costs by 2050.

A 5,000-Year-Old Cow Tooth Just Changed What We Know About Stonehenge

An ancient tooth reshapes what we know about the monument’s beginnings.

Astronomers See Inside The Core of a Dying Star For the First Time, Confirm How Heavy Atoms Are Made

An ‘extremely stripped supernova’ confirms the existence of a key feature of physicists’ models of how stars produce the elements that make up the Universe.

Rejoice! Walmart's Radioactive Shrimp Are Only a Little Radioactive

You could have a little radioactive shrimp as a treat. (Don't eat any more!)

Newly Found Stick Bug is Heavier Than Any Insect Ever Recorded in Australia

Bigger than a cockroach and lighter than a golf ball, a giant twig emerges from the misty mountains.

Chevy’s New Electric Truck Just Went 1,059 Miles on a Single Charge and Shattered the EV Range Record

No battery swaps, no software tweaks—yet the Silverado EV more than doubled its 493-mile range. How’s this possible?

Dolphins and Whales Can Be Friends and Sometimes Hang Out Together

They have a club and you're not invited.