Baidu's new AI better than Googles

cosmicalstorm · Post by **cosmicalstorm** » 2015-05-15 06:19am

I'm currently reading Superintelligence by Nick Bostrom. That along with some AI discussions on various forums plus the occasional news-snip like the one below gives me a weird feeling of quiet dread about what might come in the decades ahead.

http://www.technologyreview.com/news/53 ... cognition/

Baidu’s Artificial-Intelligence Supercomputer Beats Google at Image Recognition

A supercomputer specialized for the machine-learning technique known as deep learning could help software understand us better.

Chinese search giant Baidu says it has invented a powerful supercomputer that brings new muscle to an artificial-intelligence technique giving software more power to understand speech, images, and written language.

The new computer, called Minwa and located in Beijing, has 72 powerful processors and 144 graphics processors, known as GPUs. Late Monday, Baidu released a paper claiming that the computer had been used to train machine-learning software that set a new record for recognizing images, beating a previous mark set by Google.

“Our company is now leading the race in computer intelligence,” said Ren Wu, a Baidu scientist working on the project, speaking at the Embedded Vision Summit on Tuesday. Minwa’s computational power would probably put it among the 300 most powerful computers in the world if it weren’t specialized for deep learning, said Wu. “I think this is the fastest supercomputer dedicated to deep learning,” he said. “We have great power in our hands—much greater than our competitors.”

Computing power matters in the world of deep learning, which has produced breakthroughs in speech, image, and face recognition and improved the image-search and speech-recognition services offered by Google and Baidu.

The technique is a souped-up version of an approach first established decades ago, in which data is processed by a network of artificial neurons that manage information in ways loosely inspired by biological brains. Deep learning involves using larger neural networks than before, arranged in hierarchical layers, and training them with significantly larger collections of data, such as photos, text documents, or recorded speech.

So far, bigger data sets and networks appear to always be better for this technology, said Wu. That’s one way it differs from previous machine-learning techniques, which had begun to produce diminishing returns with larger data sets. “Once you scaled your data beyond a certain point, you couldn’t see any improvement,” said Wu. “With deep learning, it just keeps going up.” Baidu says that Minwa makes it practical to create an artificial neural network with hundreds of billions of connections—hundreds of times more than any network built before.

A paper released Monday is intended to provide a taste of what Minwa’s extra oomph can do. It describes how the supercomputer was used to train a neural network that set a new record on a standard benchmark for image-recognition software. The ImageNet Classification Challenge, as it is called, involves training software on a collection of 1.5 million labeled images in 1,000 different categories, and then asking that software to use what it learned to label 100,000 images it has not seen before.

Software is compared on the basis of how often its top five guesses for a given image miss the correct answer. The system trained on Baidu’s new computer was wrong only 4.58 percent of the time. The previous best was 4.82 percent, reported by Google in March. One month before that, Microsoft had reported achieving 4.94 percent, becoming the first to better average human performance of 5.1 percent.

Wu said that Minwa had made it possible to train the system on higher-resolution images. It also permitted use of a technique that turned the original 1.2 million training images into two billion by distorting them, flipping them, and altering their colors. Using that larger training set improved accuracy by preventing the system from becoming too fixated on the exact details of the training images, said Wu. The resulting system should be better at handling real-world photos, he said.

As those slim margins of victory on the ImageNet challenge might suggest, deep learning is now ready for tougher challenges than image recognition, such as interpreting video or describing images in sentences (see “Google’s Brain-Inspired Software Describes What It Sees in Complex Images”). Wu said that as well as thinking about how to make Minwa even larger and use it on video and text, Baidu’s researchers are working on ways to shrink their trained neural networks so they can operate on mobile devices.

He showed a video of a prototype smartphone app that can recognize different breeds of dog, using a condensed version of a deep-learning network trained on a predecessor to Minwa. “If you know how to tap the computational power of a phone’s GPUs, you can actually recognize on the fly directly from the image sensor,” he said.

madd0ct0r · Post by **madd0ct0r** » 2015-05-15 07:40am

The revolution is here, and has been so for the last decade. Nothing will change, then everything.

Elaro · Post by **Elaro** » 2015-05-19 08:27pm

Deep learning? You mean this?

Ziggy Stardust · Post by **Ziggy Stardust** » 2015-05-19 08:37pm

Elaro wrote:Deep learning? You mean this?

Indeed, this is the new big issue in statistical science in general. A lot of these modern, powerful techniques in machine learning have been shown to be incredibly sensitive to random noise. I can't find the link at the moment, but I once read a similar paper to the one you linked to that showed how these algorithms (in the context of genomics research) fit to randomly generated data still produced "significant" findings. That's why so much of statistics is now focused on model validation methods to try and get around this problem.

cosmicalstorm · Post by **cosmicalstorm** » 2015-05-20 01:11am

I finished Superintelligence now. It reminds me a lot of Stargliders AI-FAQ on this board, but the book is a lot more fleshed out. The value-loading problem is really nasty. The chances we will get it right seem infinetly small, especially when most serious AI research gets done by militaries and corporations. I guess I have known this in some sense ever since watching Terminator.

It is ironic that humans will likely get the same treatment that we have given the animal world.
Worst case scenario: I Have No Mouth, and I Must Scream
http://en.wikipedia.org/wiki/I_Have_No_ ... ust_Scream

The Grim Squeaker · Post by **The Grim Squeaker** » 2015-05-20 03:34am

A. I work in the field.
B. This is nice, but it's a relatively minor technical advance (more GPUs/oomph, not anything interesting in term of network architecture or techniques, unlike say, Microsoft's network and use of PreRELUs or batch normalization).
C. Let's see how it does in Imagenet 2015 ; beating last year's data is always easier when you have access to the validation /test set.

The basics of the approach, as pushed by Ng, is basically training set augmentation - that's been used a LOT for a while now; the problem is scaling it up. (Aka, byebye Ram ).

StarDestroyer.Net BBS

Baidu's new AI better than Googles

Baidu's new AI better than Googles

Re: Baidu's new AI better than Googles

Re: Baidu's new AI better than Googles

Re: Baidu's new AI better than Googles

Re: Baidu's new AI better than Googles

Re: Baidu's new AI better than Googles