Microsoft, Alibaba AI programs beat humans in a Stanford reading test

First, it was chess. Then it was Go. Now it’s basic reading comprehension.

The robots are coming.

Two artificial intelligence programs created by Chinese e-commerce giant Alibaba and Microsoft beat humans on a Stanford University reading comprehension test, Alibaba said Monday. Alibaba took the honor as the creator of the first program to ever beat a human in a reading comprehension test, scoring 82.44 percent out of a perfect 100 and narrowly edging past the human’s 82.304 percent.

A different program built by Microsoft scored higher than Alibaba’s at 82.605. Microsoft’s took the same test as Alibaba’s but was finalized a day later, according to Bloomberg.

Get tech news in your inbox weekday mornings. Sign up for the free Good Morning Silicon Valley newsletter.

The test known as Stanford Question Answering Dataset, or SQuAD for short, asks the contestants — human and robot — to answer provide exact answers to more than 100,000 questions drawn from more than 500 Wikipedia articles. The test is designed to see if artificial intelligence can process large amounts of information before fully comprehending it and offering precise answers.

Some of the Wikipedia articles where questions were drawn from covered a wide range, from Super Bowl 50 (“Where did Super Bowl 50 take place?” Answer: Santa Clara, California.) to Doctor Who (“What planet is Doctor Who from?” Answer: Gallifrey.).

“These kinds of tests are certainly useful benchmarks for how far along the AI journey we may be,” said Microsoft spokesperson Andrew Pickup to CNN. “However, the real benefit of AI is when it is used in harmony with humans.”

Major technology companies in the United States and China have invested billions of dollars in artificial intelligence to gain a foothold in what may be the next technological frontier. The Chinese government has outlined a plan to create a $150 billion AI industry by 2030 in close partnership with private companies such as Alibaba and Tencent.

Microsoft in December announced its “AI on Earth” project to help the planet become more environmentally sustainable using its in-house AI infrastructure. The company will invest $50 million for the next five years, according to Microsoft CEO Brad Smith.

“At Microsoft, we believe artificial intelligence is a game changer,” said Smith. “As we enter the world’s Fourth Industrial Revolution, a technology-fueled transformation, we must not only move technology forward, but also use this era’s technology to clean up the past and create a better future.”

With comprehension skills now arguably better than a human being, Alibaba’s chief data scientist said the new breakthrough will be applied to helping human customers.

“The technology underneath can be gradually applied to numerous applications such as customer service, museum tutorials and online responses to medical inquiries from patients, decreasing the need for human input in an unprecedented way,” said Luo Si, chief scientist for natural language processing at Alibaba’s Institute of Data Science of Technologies, to Bloomberg.

Photo: Jack Ma, CEO of the Alibaba Group, speaks during a news conference on June 18, 2015 in Chiba, Japan, alongside the company’s humanoid robot, Pepper. (Koki Nagahama/Getty Images)


Tags: , , ,


Share this Post

  • ai curious

    Is a difference of .1% – .3% on a single test statistically significant? Also, how many humans were involved in this study? Lastly, was this a double-blind controlled study? Hmmm…