Welcome to the Future: Google DeepMind’s Artificial Intelligence Algorithm

21

September

2016

5/5 (16)

In March 2016, something historical happened, something scientists did not expect to happen for at least another decade. An Artificial Intelligence (A.I.) algorithm named AlphaGO beat the current world champion Lee Sedol at the GO board game.  Now, in order to clarify why this indeed has historical consequences, some background information is required. First, some information about the ancient Chinese board game GO: GO is known as the world’s hardest game with close to an infinite number of optional plays. The game is comparable to chess, only with more moves than there are atoms in the entire universe.

Given the amount of options the game GO has, it was not possible for the algorithm to win by ‘brute forcing’. Previously, when computer systems played other games (e.g. chess) brute forcing was used in order to make in-game decisions. Simply said, brute forcing means that an algorithm would calculate all moves it could possibly make and select the most ‘successful’ move. Because brute forcing is not possible with GO, this ancient Chinese game is seen as the Holy Grail (i.e. test) of Artificial Intelligence. As brute forcing was not an option, Google DeepMind’s AlphaGO used a different technique in order to win. And this is where things get spectacular.

For the first time in history, we are seeing a successful demonstration of general purpose Artificial Intelligence. What does general purpose mean in this case? Because the machine learns from experience and data, it can perform well through a wide variety of tasks. Not only within one specific area.
By using reinforced learning and neural networks AlphaGO could mimic the learning process of a human brain. Meaning: it shows that machines have the potential to learn on their own.
AlphaGO’s win over a human with the game GO shows that machines can really learn and think in a human way. According to the DeepMind founders, the algorithm can learn many more things besides GO. Without alteration or guidance by human hand. In other words, AlphaGO can indeed be defined as general purpose A.I.

Google DeepMind, the company responsible for the creation of AlphaGO, is a British Artificial Intelligence company that was founded in September 2010 as DeepMind technologies. It was renamed into Google DeepMind after it was acquired by Google for a shy $500 million. Fun fact: Next to Google’s interest in the company, other well established names are connected with DeepMind. For example, Elon Musk is one of the investors behind the DeepMind company.
The company goal of DeepMind is ‘to solve intelligence’. The team is trying to achieve this by combining the best techniques from machine learning and systems neuroscience, in order to build powerful general purpose algorithms.  Summarized, the goal of Google DeepMind is to formalize intelligence, find out what it is, and how it works. The end game is not just to implement this into machines, but also to understand the human brain.

Back to the AlphaGO algorithm. So far, we have established that it can be defined as a general purpose A.I. with the potential to learn on its own.
But how does the program learn? Through Deep Reinforcement Learning.  Which again makes it very different from other A.I.’s. Other current A.I.’s were only developed for a predefined purpose and only function within their scope (e.g. Siri on your new iPhone 7). DeepMind claims that their algorithm is not pre-programmed and learns from experience. Using only raw pixels as data input.
Reinforcement learning can be seen as the foundation for the program’s win over the current world champion. AlphaGO was trained by showing the programme over 100.000 pictures of amateurs playing GO. Its first task was to mimic these pictures. After it learned to mimic human amateurs their plays, the algorithm was allowed to play against itself. 13 million times to be exact.
Using deep reinforcement learning, the system learns to improve itself incrementally by increasing its win-rate against older versions of itself. The reinforcement system DeepMind uses, is model free. Meaning that it doesn’t need a structure or a set of rules to learn.

Where can DeepMind’s algorithm be used for in the future? In an interview with one of the founders, Mustafa Suleyman, he explains their vision of applying A.I. in healthcare and science. He mentions it will help speed up the process of major breakthroughs by helping human experts discover patterns.

Concluding, general purpose artificial intelligence that uses reinforcement learning could lay the groundwork for unimaginable things in the near future.

Please rate this

The meaning behind ‘artificial’ and ‘intelligence’

17

September

2016

5/5 (1)

 


Artificial intelligence was officially born in 1956 when John McCarthy organized a workshop at the Dartmouth Summer Research Project[1]. The goal was to create an artificial intelligence being. Decades of unsuccessful trials in the attempt to create an artificial being that had an independent intuition have passed. After waiting for 60 years, this March of 2016, the artificial intelligence program AlphaGo beat Lee Sedol, the grand master in Go (the ancient Chinese board game).

But is Go such a difficult game?

Although the concept is simple, the number of moves are almost endless. Go is a game of two and the goal is to surround a total area that is larger than that of your opponent in a 19 by 19 grid. To give you a comparison, in chess 35 is the number of possible moves a player can choose from in a given turn. In Go, a player can choose from 200 different possibilities in one turn. Now, if one were to account to the total number of possible moves of a Go player in a given game, it could amount to 10 to the power of 170. So how much is 10^170? A lot, given that the number of atoms in the universe is estimated to be around 10^81. Therefore, AlphaGo has achieved a big milestone for beating the best Go human master in such a complex game with almost endless possibilities.

How did AlphaGo manage to beat Lee Sedol? Let’s start from the general idea. AlphaGo uses a deep learning technique that is programed to mimic a human neuron. Deep learning belongs to a branch of algorithms and techniques called machine learning. And Artificial intelligence can be defined as a set of algorithms and techniques that mimic the human intelligence. In short, AlphaGo is specific deep learning system of machine learning that built upon algorithms and techniques.

My guess is that deep learning is going to help us understand the human brain better and consequently develop improved techniques for learning. Why? Because deep learning operates and processes information like a human neuron. By feeding the machine a lot of data, the computer can learn to recognize and classify objects by itself[2]. This leads to the creation of the so-called artificial intuition. As a result, I can imagine a world where artificial intuition will provide out-of-the-box perspectives and guide human beings to solve grand challenges.

AlphaGo vs. Lee Sedol enlightened our species once again. Let me explain. After watching the match between AlphaGo and Lee Sedol, Go players grasp an understanding of the possibilities of deep learning. Go players were fascinated with how AlphaGo was playing because it played unlike any human has played the game before. It made moves that did not seem to make sense at first sight but were very fruitful at last. By playing with or watching AlphaGo play, players are influenced by the way the machine plays the game. In turn players are able to learn from AlphaGo by observing these new strategies, which creates a feedback loop. The master himself, Lee Sedol expressed his positive thoughts after the match “I have improved already. It has given me new ideas” referring to AlphaGo[3].

This has been the first glimpse of what AI can bring in the field of learning. Artificial intuition challenges us to think differently, which will reinforce and disrupt the way the human brain thinks time and time again.  Because of this constant thinking enhancements, I believe that Al will challenge the “Homo Sapiens Sapiens” status that modern humans currently hold.


Sources:

[1] http://www-formal.stanford.edu/jmc/history/dartmouth/dartmouth.html

[2] http://www0.cs.ucl.ac.uk/staff/d.silver/web/Publications.html

[3] https://www.wired.com/2016/05/google-alpha-go-ai/

Please rate this