course plan game playing state-of-the-art

Course plan

State-based models Search problem

Markov decision processes (MDP)

Adversarial games

Variable-based models Constraint satisfaction problem

Bayesian networks

[These slides adapted ai@stanford, ai@berkeley]

Game Playing State-of-the-Art Checkers: 1950: First computer player.

1994: First computer champion: Chinook ended 40-year-reign of human champion Marion Tinsley using complete 8-piece endgame. 2007: Checkers solved!

Chess: 1997: Deep Blue defeats human champion Gary Kasparov in a six-game match. Deep Blue examined 200M positions per second, used very sophisticated evaluation and undisclosed methods for extending some lines of search up to 40 ply (20 moves). Current programs are even better, if less historic.

Go: Human champions are now starting to be challenged by machines. In go, b > 300! Classic programs use pattern knowledge bases, but big recent advances use Monte Carlo (randomized) expansion methods.

[These slides adapted from ai.Berkeley.edu, ai@stanford]

Game Playing State-of-the-Art

Checkers: 1950: First computer player. 1994: First computer champion: Chinook ended 40-year-reign of human champion Marion Tinsley using complete 8-piece endgame. 2007: Checkers solved!

Chess: 1997: Deep Blue defeats human champion Gary Kasparov in a six-game match. Deep Blue examined 200M positions per second, used very sophisticated evaluation and undisclosed methods for extending some lines of search up to 40 ply (20 moves). Current programs are even better, if less historic.

Go: 2016: Alpha GO defeats human champion. Uses Monte Carlo Tree Search, learned evaluation function.

Pacman

Many different kinds of games!

Mainly: Deterministic or stochastic?

One, two, or more players?

Zero sum?

Perfect information (can you see the states)? Chess

Want algorithms for calculating a strategy (policy) which recommends a move from each state.

Types of Games

Zero-Sum Games

Zero-Sum Games Agents have opposite utilities (values

on outcomes)

Lets us think of a single value that one maximizes and the other minimizes

Adversarial, pure competition

General Games Agents have independent utilities

(values on outcomes)

Cooperation, indifference, competition, and more are all possible

More later on non-zero-sum games

A simple game

Which bin?

Roadmap

Modeling: games

Algorithms: Expectimax

Minimax

Expectiminimax

Evaluation functions

Alpha-Beta pruning

Game: modeling

Two-player zero-sum games

Halving game

States = {player (+1/-1), number}Start = {+1/-1, number = N}End = {number =0}Utility = {player*inf}Actions = {-, /}

Policies

Let’s play, with humanPolicy .

Game evaluation example Game evaluation recurrence

Assume agent , opp are given.

Minimax Example

Tic tac toe Backward induction

Minimax recurrence

Extracting minimax policies Roadmap

Modeling: games

Minimax

Expectiminimax

Alpha-Beta pruning

Adversarial Search (Minimax)

Deterministic, zero-sum games:

Tic-tac-toe, chess, checkers

One player maximizes result

The other minimizes result

Minimax search:

A state-space search tree

Players alternate turns

Compute each node’s minimaxvalue: the best achievable utility against a rational (optimal) adversary

8 2 5 6

min2 5

Terminal values:part of the game

Minimax values:computed recursively

Example

80%20% 100% 0%0% 100%

Expectimax Recurrence

Face off

blue opponent, red agent

Minimax property 1

Minimax property 2 Minimax Property 3

Relationship between game values A modified game

example Expectiminimax recurrence

Summary so far Computation

Speeding up minimax Depth-limited Search

Evaluation functions Evaluation functions

Summary: evaluation functions Roadmap

Modeling: games

Minimax

Expectiminimax

Alpha-Beta pruning

Game Tree Pruning

Alpha-beta pruning

Alpha-Beta Quiz

Other games

State-based model's recap

course plan game playing state-of-the-art

Documents

artificial intelligence: game playing

ゲームプレイング (game playing)

ai csc361: game playing1 game playing csc361. ai csc361:...

ai game playing

05 game playing

07 game playing

adversarial search game playing

general game playing game factoring -...

citizen's game - playing society / the game 'encounter

game playing 2

vii.game playing · vii.game playing q game playing ......

them playing the game you playing your part

storyboard for playing the game (in detail) - ict seneca ·...

artificial intelligence 5. game playing course v231...

playing the patent game

game playing in ai

from specific game playing to general game playing

playing the vc game

game playing ( metode minimax )

oompa lecture 17 artificial intelligence and game playing...