0% found this document useful (0 votes)

79 views28 pages

Game Playing AI: (Based On Earlier Lecture From Stephen Gould)

Game playing AI has progressed significantly from early origins in the 18th century. Modern approaches use techniques like minimax search, alpha-beta pruning, and static evaluation functions to evaluate board positions in games. For imperfect information games or games with large state spaces, reinforcement learning methods like Monte Carlo tree search and Q-learning are used to approximate optimal play. Opening books and endgame databases help guide play in the early and late stages of games. Milestones include computers solving checkers and DeepMind's AlphaGo mastering the complex game of Go.

Uploaded by

mmrsaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views28 pages

Game Playing AI: (Based On Earlier Lecture From Stephen Gould)

Uploaded by

mmrsaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Game Playing AI

(based on earlier lecture from Stephen Gould)

Structured Programming COMP1110/COMP1140/COMP6710

Origins

Schachtürke (Chess Turk) El Ajedrecista (The Chess Player) Plankalkül (Plan Calculus)
1770 1912 1941
Early History of AI

John von Neumann John McCarthy Arthur Samuel

Games
A game consists of a set of two or more players, a set of moves for the
players, and a specification of payoffs (outcomes) for each combination
of strategies.

Many different types of games:

• two-person zero-sum
• multi-player
• perfect information games
• imperfect information games
• games of chance
Game Trees
A strategy defines a complete
plan of action for a given player.

Given enough processing time an

optimal strategy can be found for
games of perfect information by
enumerating paths of a game
tree. However, in practice this can
only be done for small games.
Minimax
Consider two players, MAX and MIN. Player MAX is trying to maximize
the score and player MIN is trying to minimize the score. We assume
that the players are rational.
Minimax
The minimax algorithm allows each player to compute their optimal
move on a game tree of alternating MAX and MIN nodes.

max-value(s) min-value(s)
if terminal(s) then if terminal(s) then
return v(s) return v(s)
end if end if
v = −∞ v = ∞
for each successor s’ do for each successor s’ do
v = max(v, min-value(s’)) v = min(v, max-value(s’))
end for end for
return v return v
Minimax Example
start
MAX 5

MIN
5 2

MAX
5 7 2 8

MIN

3 5 7 2 2 1 4 8 end
Alpha-beta Pruning
Minimax suffers from the problem that the number of game states it
has to examine is exponential in the number of moves.
Alpha-beta pruning is a method for reducing the number of nodes that
need to be evaluated by only considering nodes that may be reached in
game play.
Alpha-beta pruning places bounds on the values appearing anywhere
along a path:
• 𝛼: the best (highest) value found so far for MAX
• 𝛽: the best (lowest) value found so far for MIN
𝛼 and 𝛽 propagate down the game tree. The value v propagates up the
game tree.
Alpha-beta Pruning (2)
Initialize 𝛼 = −∞ and 𝛽 = ∞
max-value(𝐬, 𝜶, 𝜷) min-value(𝐬, 𝜶, 𝜷)
if terminal(s) then if terminal(s) then
return v(s) return v(s)
end if end if
v = −∞ v = ∞
for each successor s’ do for each successor s’ do
v = max(v, min-value(s’,𝛼,𝛽)) v = min(v, max-value(s’,𝛼,𝛽))
if v ≥ 𝛽 then if v ≤ 𝛼 then
return v return v
end if end if
𝛼 = max(𝛼, v) β = min(𝛽, v)
end for end for
return v return v
Alpha-beta Pruning Example
MAX 5

MIN
5 2

MAX
5 7 2

MIN

3 5 7 2 1
Multi-player Games
When we have more than two players we need to adapt the minimax
approach. The most conservative strategy is to assume that all of your
opponents are conspiring to minimize your score.
• Treat your opponents as one big powerful player.
Big Games (e.g., Patchwork)

…
>100 possible moves*

* Depending on available pieces.

Big Games (e.g., Patchwork)

… …
>400 possible moves* >400 possible moves*

* Depending on available pieces.

Big Games (e.g., Patchwork)

(33 pcs)
>100
moves

100 x 400 =
>4000 moves

100 x 400 x 400=

16,000,000 moves
100 x 400 x 400 x 400 =
6,400,000,000 moves
(~10 pieces)
? moves
…
Static Evaluation Function
For real-world games, even with alpha-beta pruning, we still can't
search the entire game tree. In these situations, instead of a terminal
test, we introduce a cut-off test that applies a heuristic value at some
intermediate game state.
The heuristic is called a static evaluation function and it returns an
estimate of the expected payoff from a given position.
Machine learning techniques are often used to find a good static
evaluation function based on a linear combination of features:

𝑣ො 𝑠 = 𝑤1 𝑓1 𝑠 + ⋯ + 𝑤𝑛 𝑓𝑛 (𝑠)
Exploration versus Exploitation
Learning the static evaluation function is a classic reinforcement
learning problem.
• Repeatedly play against yourself.
• Reward board positions that lead to wins.
• Punish board positions that lead to losses.

A crucial trade off is in choosing between exploration and exploitation.

Q-Learning
• Many games can be modelled as a Markov Decision Process:
• At each discrete timestep, the game is in a particular state.
• In each state, the player can choose from a set of actions.
• Given a choice of action, the system will transition
to a new state chosen at random with some
probability distribution.
• Q-Learning learns a ‘Quality’ value for each
state-action combination, based on the
expected reward for each state
• Parameters: learning rate, discount factor,
initial Q-values
Cut-off Test
A cut-off test determines when to apply static evaluation. Searching to
a fixed depth is a simple cut-off policy, but this suffers from the horizon
problem: an unavoidable damaging move that can be pushed beyond
the depth of the search.
Cut-off Test
A cut-off test determines when to apply static evaluation. Searching to
a fixed depth is a simple cut-off policy, but this suffers from the horizon
problem: an unavoidable damaging move that can be pushed beyond
the depth of the search.
Another problem is stopping in the middle of a sequence of moves
(e.g., piece exchange in chess).
Some techniques exist to avoid these issues:
• only apply static evaluation on quiescent positions (i.e., stable heuristic).
• killer heuristic – always consider bad moves from the opponent.
Games that include an element of chance require that we calculate the
expected value of a position rather than the exact value.
Games with Chance
RAND

MAX

RAND

MIN
Games with Chance (e.g., Stratopolis)

green’s
move

red’s (opponent)
move

random player shuffles remaining

pieces
green’s
next move
red’s (opponent)
next move
…
Monte Carlo Simulation
Monte Carlo simulation is randomized algorithm that can be used to
approximate the value of an intermediate game state.
• Develop the game tree to some fixed depth or some fixed width
• Run simulations from each leaf node
• Use results of simulation to assign a value to the node
Opening Book and Endgame Databases
• Opening books can save computation at the beginning of a
game by storing a good sequence of starting moves.
• For variety, a player can randomly choose between the moves.
• As soon as an opponent plays a move that is not encoded in the
book, the player must resort to search or simulated game play.

• For some games, the state space reduces near to the end of
the game. In such cases, an endgame database can be pre-
computed by working backwards from different endings.
• If an agent ever finds a game state that matches one in the endgame
database it can immediately determined whether it will win or lose.
Milestones in AI Game Playing
1959 Arthur Samuel develops Checkers playing program
1997 IBM’s Deep Blue chess machine beats Gary Kasparov
2007 Checkers solved by University of Alberta
2011 IBM’s Watson wins Jeopardy! requiring natural
language understanding
2015 Deep reinforcement learning algorithms learn to play
Atari arcade games from scratch
2016 Google DeepMind’s AlphaGo beats Lee Sedol, Korea

AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Games
No ratings yet
Games
41 pages
AI All Units
No ratings yet
AI All Units
93 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Lecture 5 - Adversal Search
No ratings yet
Lecture 5 - Adversal Search
88 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
71 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
AI in Game Strategy and Search
No ratings yet
AI in Game Strategy and Search
36 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Yapay Zeka - 8
No ratings yet
Yapay Zeka - 8
48 pages
Unit Ii Ai (DS) .
No ratings yet
Unit Ii Ai (DS) .
28 pages
Game Playing
No ratings yet
Game Playing
24 pages
6 Min Max
No ratings yet
6 Min Max
11 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
AI Lecture 08 - Games & Adversarial Search
No ratings yet
AI Lecture 08 - Games & Adversarial Search
42 pages
6-A Star Search Adversarial Search-09!01!2025
No ratings yet
6-A Star Search Adversarial Search-09!01!2025
42 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Ai Unit 2
No ratings yet
Ai Unit 2
88 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
2021 Lecture05 AdversarialSearch
No ratings yet
2021 Lecture05 AdversarialSearch
46 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Ai Unit 3
No ratings yet
Ai Unit 3
33 pages
AI Notes Unit II
No ratings yet
AI Notes Unit II
31 pages
Game Playing
No ratings yet
Game Playing
48 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Adversial Search
No ratings yet
Adversial Search
26 pages
Adversial Search
No ratings yet
Adversial Search
21 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Part4.Game Playing
No ratings yet
Part4.Game Playing
35 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
AI in Game Strategy
No ratings yet
AI in Game Strategy
42 pages
CCS 3101 - Lecture 5 - Adversarial Search Techniques
No ratings yet
CCS 3101 - Lecture 5 - Adversarial Search Techniques
34 pages
05-CSE358-Adversarial Search & Games
No ratings yet
05-CSE358-Adversarial Search & Games
57 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
Ai Lect 05
No ratings yet
Ai Lect 05
39 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
No ratings yet
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
28 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
Lecture Adversarial Searches
No ratings yet
Lecture Adversarial Searches
25 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
Game AI and Strategy Analysis
No ratings yet
Game AI and Strategy Analysis
53 pages
Applsci 14 12029 v2
No ratings yet
Applsci 14 12029 v2
26 pages
Gameplaying Group 1
No ratings yet
Gameplaying Group 1
13 pages
Hand Writting Recognition
No ratings yet
Hand Writting Recognition
1 page
Tense: Tense Simple/Indefinite Progressive/Continuous Perfect Perfect Continuous/Progressive
No ratings yet
Tense: Tense Simple/Indefinite Progressive/Continuous Perfect Perfect Continuous/Progressive
7 pages
Tense: Tense Simple/Indefinite Progressive/Continuous Perfect Perfect Continuous/Progressive
No ratings yet
Tense: Tense Simple/Indefinite Progressive/Continuous Perfect Perfect Continuous/Progressive
7 pages
MPMB's Character Record Sheet (v13.1.14) (Printer Friendly - Redesign)
No ratings yet
MPMB's Character Record Sheet (v13.1.14) (Printer Friendly - Redesign)
7 pages
Aml Crashlog
No ratings yet
Aml Crashlog
3 pages
Hangman Report1
No ratings yet
Hangman Report1
10 pages
Modelos Infinistreams
No ratings yet
Modelos Infinistreams
19 pages
Sekiro D&D Part 9. The Mortal Blades
No ratings yet
Sekiro D&D Part 9. The Mortal Blades
4 pages
Random Loot Table by Level (For D&D 5e On Roll20)
No ratings yet
Random Loot Table by Level (For D&D 5e On Roll20)
6 pages
Continue
No ratings yet
Continue
2 pages
DDAL07-13 - Old Bones and Older Tomes v1.0
No ratings yet
DDAL07-13 - Old Bones and Older Tomes v1.0
38 pages
Parlor Game Mechanics
100% (2)
Parlor Game Mechanics
3 pages
Play Rate Card
No ratings yet
Play Rate Card
2 pages
The House Under The Moondial - Ref-Toolkit - V1.0
100% (1)
The House Under The Moondial - Ref-Toolkit - V1.0
14 pages
RDR 2
No ratings yet
RDR 2
12 pages
Conan d20 1e - The Coming of Hanuman (MGP5598)
No ratings yet
Conan d20 1e - The Coming of Hanuman (MGP5598)
15 pages
Lastexception 63866748921
No ratings yet
Lastexception 63866748921
1 page
Fanfiction Links: Yoonmin & More
No ratings yet
Fanfiction Links: Yoonmin & More
23 pages
Effects of Dribbling Soccer
No ratings yet
Effects of Dribbling Soccer
74 pages
News Item
No ratings yet
News Item
17 pages
固体物理学黄昆北大版课后答案.khda
No ratings yet
固体物理学黄昆北大版课后答案.khda
52 pages
Robotech Macross Saga-RPG
92% (25)
Robotech Macross Saga-RPG
264 pages
Mexican Lotto 30 Boards 54 Cards
100% (1)
Mexican Lotto 30 Boards 54 Cards
36 pages
Ronin Arts - Future - Datastream 2005 Collection
90% (10)
Ronin Arts - Future - Datastream 2005 Collection
175 pages
IH6 - War of The Clans
No ratings yet
IH6 - War of The Clans
35 pages
Arch326 Final 3
No ratings yet
Arch326 Final 3
4 pages
Alienist 异形学家
No ratings yet
Alienist 异形学家
5 pages
Rules Livret - Version 5 FINAL VERSION
No ratings yet
Rules Livret - Version 5 FINAL VERSION
4 pages
The Answers Key - Đề 6 - Cambridge A2 Flyers
No ratings yet
The Answers Key - Đề 6 - Cambridge A2 Flyers
2 pages
Gaming Accessories Invoice
No ratings yet
Gaming Accessories Invoice
1 page
Gardena HB 40 Mulchcut HB 40 Art 4005 Users Manual 123580
No ratings yet
Gardena HB 40 Mulchcut HB 40 Art 4005 Users Manual 123580
10 pages
PC Mini GAMES 2023
No ratings yet
PC Mini GAMES 2023
5 pages
Lista de Precios 242124 TRK 12-04-2021
100% (1)
Lista de Precios 242124 TRK 12-04-2021
77 pages

Game Playing AI: (Based On Earlier Lecture From Stephen Gould)

Uploaded by

Game Playing AI: (Based On Earlier Lecture From Stephen Gould)

Uploaded by

Game Playing AI

(based on earlier lecture from Stephen Gould)

Structured Programming COMP1110/COMP1140/COMP6710

John von Neumann John McCarthy Arthur Samuel

Many different types of games:

Given enough processing time an

* Depending on available pieces.

* Depending on available pieces.

100 x 400 x 400=

A crucial trade off is in choosing between exploration and exploitation.

random player shuffles remaining

You might also like