Enero 24, 2018

DeepMind's AlphaGo Zero removes humans from AI equation

19 Octubre 2017, 07:55 | Bibiana Flor

Google's AlphaGo can now teach itself from scratch to beat humans

DeepMind's latest AI breakthrough is its most significant yet

Well, here's something to chew on: Google's AI research arm DeepMind, the same benevolent creator that spawned AlphaGo, has already rendered that gluteus maximus-spanking version obsolete.

There are other technical elements that define the new AI, which you can dig into courtesy of DeepMind's paper, published in the scientific journal Nature.

When building the first iterations of AlphaGo, the team explored working on a system like AlphaGo Zero, but then the technology didn't work.

Perhaps we should have seen this coming.

Though extremely impressive, AlphaGo Zero won't replace humans anytime soon.

If, in order to function, AlphaGo learned by basing itself on millions of examples of parts played by humans, AlphaGo Zero - The name of new version - does not need any example.

So, this is why it's taken so long for computers to surpass humans at the game.

Responding to the announcement in a separate editorial for Nature, Satinder Singh, the director of the University of Michigan's AI lab, said Zero "massively outperforms the already superhuman AlphaGo" and could be one of the biggest AI advances so far.

Although DeepMind gained prominence by defeating human Go players, the company has also turned its attention to StarCraft II.

All it needed was a basic set of rules for the game. DeepMind's first paper in Nature past year showed that the algorithm learned for a while from how humans played the game, and then started to play itself to refine those skills. In 21 days, it had beaten the previous version that defeated Ke Jei in all three games.

AlphaGo Zero shows great improvements with respect to all its predecessors.

Approaches using purely reinforcement learning have struggled in AI because ability does not always progress consistently, said David Silver, a scientist at DeepMind who has been leading the development of AlphaGo, at the briefing. "Instead, it is able to learn tabula rasa from the strongest player in the world: AlphaGo itself". AlphaGo Zero, along with AlphaGo Master, each only require a single machine with four TPUs.

The fact that human-guided AlphaGo that defeated Sedol couldn't muster a single win against self-taught AlphaGo Zero had researchers arriving at some rather mind-blowing, and perhaps spine-chilling conclusions.

The latest iteration, however, differs from its predecessors: AlphaGo Zero abandons all hand-engineered features, runs only one neural network (versus the two found in earlier models), and relies exclusively on its own knowledge to evaluate positions. By combining tree search with policy and value networks, AlphaGo has finally reached a professional level in Go, providing hope that human-level performance can now be achieved in other seemingly intractable artificial intelligence domains. Furthermore, the AI will be subject to human limits, since its learning is bounded to pre-existent human knowledge. The game has a rich history, and there's a reason it still captures the imagination of people today. Zero performed so well that it won all 100 matches played. All in less than two months. That's because machines will need to figure out solutions to hard problems even when there isn't a large amount of training data to learn from. Go has fixed rules while humans employ general knowledge and add layers of creativity to it.

They provided no information about how the algorithm has fared in solving other problems.

As for Go, the effects of AlphaGo Zero are likely to be seismic. After three hours, the system's strategy was "greedy stone-capturing", indicative of the human novice. Sure, the behaviors that emerged here are novel, and perhaps unprecedented.

AlphaGo Zero could beat the version of AlphaGo that faced Lee Sedol after training for just 36 hours and earned its 100-o score after 72 hours.

Otras noticias

Tendencias Ahora

Furious Iran hits back at Trump over nuclear deal row
Rouhani assured Macron that Iran in turn "will continue to carry out its commitments" in the nuclear accord, the Elysee said. He said that would have allowed the be tougher on Iran when it comes to its "misbehavior" and support of terrorism.

Dozens of Victims and 900 Missing after the Fires in California (Updated)
It's really going to be hard to tell and we'll be talking about the fires from this week from years to come", Kaiser said. Fire officials were investigating whether downed power lines or other utility failures could have sparked the fires.

Antonio Conte rues Chelsea's 'thin' squad and ponders Álvaro Morata recall
It's those ones in between - too quick to take the praise and fast to shirk responsibility - that you need to worry about. The return of Morata will give surely give some relief to the manager Antonio Conte and the Chelsea fans.

Image processing and machine learning on Pixel 2 — Pixel Visual Core
Before the Google Pixel 2 , all updates that were directed to the Google Camera NX were based on the original Pixel phones. Here we'll compare the two phones to explain how their features differ and which is best suited for your needs.

Real Madrid vs Tottenham — Preview
Jan Vertonghen proved his versatility against Bournemouth and he can do so again. This is a very different team, and fans will expect a very different result.

President Trump Tells Democrats to 'Call Me' to Fix Obamacare
Association health plans allow small-business owners, trade groups and others to purchase health insurance packages collectively. The company bought a smaller competitor, Universal American, to focus even more on the growing Medicare Advantage market.

Leicester boss Shakespeare delighted to see Mahrez back on the scoresheet
Allardyce is the former manager of Sunderland, West Ham, Blackburn, Newcastle and Bolton and is famed for never being relegated. Leicester's billionaire Thai owners are ready to offer Dyche a £2million-a-year bumper deal with the promise of cash to spend.

All Wi-Fi devices exposed by "devastating" WPA2 exploit
A staggering number of devices across the globe are likely to be exposed to attack due to WPA2 breach, which occurred at 7 a.m. Furthermore, this is primarily an attack against clients; devices connected to a network, not routers.

Manchester City 2-1 Napoli: Heroic Performance From Ederson Seals Citizens' Win
Mertens has been directly involved in nine of Napoli's past 14 Champions League goals, scoring six and providing three assists. The key to this game could be Dele Alli, who is missing, which could see Harry Kane marooned alone up the pitch.

Nokia unveiled the smartphone in the case of glass
The Nokia 7 model with 4 GB of RAM carries a price tag of 2,499 Yuan (~$377) and its 6 GB RAM variant costs 2,699 Yuan (~$407). On the photography side, the all-new Nokia 7 is equipped with a 16MP rear camera with F1.8 aperture and a dual-tone flash.