homehome Home chatchat Notifications


AI scored on par with a four-year old

Despite decades worth of research, unbelievable computing power and sophisticated algorithms, one of today's best artificial intelligence can't score better than a four year old on a standard IQ test.

Tibi Puiu
October 5, 2015 @ 1:33 pm

share Share

Despite decades worth of research, unbelievable computing power and sophisticated algorithms, one of today’s best artificial intelligence can’t score better than a four year old on a standard IQ test.

futurama bender

Image: hwalls.com

There’s a double purpose to artificial intelligence, according to Herbert Simon one of the field’s pioneers. One is to use the power of computers to augment human thinking, just as we use motors to augment human or horse power. Robotics and expert systems are major branches of that. The other is to use a computer’s artificial intelligence to understand how humans think. Essentially, by building artificial replicas of the human brain we might understand some of the fundamental tenants that make us human, like consciousness. Maybe some day will answer the long lasting question of whether or not we have a soul.

It all sounds extremely exciting, but progress is slow even though it might not look like it. In 1997, IBM’s Deep Blue computer defeated world chess champion Gary Gasparov. Then, in 2011 Watson defeated the best Jeopardy! human player ever. Both made headlines, but it’s important not to lose sight of the fact that these machines, when taken out of their natural setting (i.e. made to do something they weren’t program to do) are plain stupid.

Even the best ones – the kind made to ‘think’ like a human – have problems reasoning like we do, as  Stellan Ohlsson demonstrated. Ohlsson, a computer scientist at University of Illinois, reprogrammed ConceptNet, one of the most famous AI under constant development at MIT since 1990, so it could answer questions on an IQ test destined to children. The test, called the Wechsler Preschool and Primary Scale of Intelligence test, assess performance in five categories:  information, vocabulary, word reasoning, comprehension, and similarities.

For “information” related questions, ConceptNet had to answer questions like “Where can you find penguins?”, while in “vocabulary” the computer had to know “what is a house?”, for instance. In these categories, as well as in word reasoning or similarities where the computer had to know that “pen an pencil are both___”, ConceptNet fared alright. On the comprehension test, however, the computer failed miserably. When asked “why do people shake hands?”, the AI hilariously answered because of “epileptic fits”. There were other instances where ConceptNet failed the mark. During the word reasoning part, the AI was given the following clues “This animal has a mane if it is male,” “this is an animal that lives in Africa,” and “this a big yellowish-brown cat.” Instead of lion, the AI came up with the following answers in order of the value assigned to each one:  dog, farm, creature, home, and cat.

“Common sense should at the very least confine the answer to animals, and should also make the simple inference that, “if the clues say it is a cat, then types of cats are the only alternatives to be considered,” say Ohlsson and co.

“The ConceptNet system scored a WPPSI-III VIQ that is average for a four-year-old child, but below average for five- to seven-year-olds,” they say.

So, don’t fret yet. Our android overlords are still many years away.

share Share

Mexico Will Give U.S. More Water to Avert More Tariffs

Droughts due to climate change are making Mexico increasingly water indebted to the USA.

Chinese Student Got Rescued from Mount Fuji—Then Went Back for His Phone and Needed Saving Again

A student was saved two times in four days after ignoring warnings to stay off Mount Fuji.

The perfect pub crawl: mathematicians solve most efficient way to visit all 81,998 bars in South Korea

This is the longest pub crawl ever solved by scientists.

This Film Shaped Like Shark Skin Makes Planes More Aerodynamic and Saves Billions in Fuel

Mimicking shark skin may help aviation shed fuel—and carbon

China Just Made the World's Fastest Transistor and It Is Not Made of Silicon

The new transistor runs 40% faster and uses less power.

Ice Age Humans in Ukraine Were Masterful Fire Benders, New Study Shows

Ice Age humans mastered fire with astonishing precision.

The "Bone Collector" Caterpillar Disguises Itself With the Bodies of Its Victims and Lives in Spider Webs

This insect doesn't play with its food. It just wears it.

University of Zurich Researchers Secretly Deployed AI Bots on Reddit in Unauthorized Study

The revelation has sparked outrage across the internet.

Giant Brain Study Took Seven Years to Test the Two Biggest Theories of Consciousness. Here's What Scientists Found

Both came up short but the search for human consciousness continues.

The Cybertruck is all tricks and no truck, a musky Tesla fail

Tesla’s baking sheet on wheels rides fast in the recall lane toward a dead end where dysfunctional men gather.