# Available games

🟢:  thoroughly-tested. In many cases, we verified against known values and/or reproduced results from papers.

🔶: implemented but lightly tested.

❌: known issues (see notes below and code for details).

| Status | Game                                            |
| ------ | ----------------------------------------------- |
| 🔶      | [2048](#2048)                                   |
| 🔶      | [Amazons](#amazons)                             |
| 🔶      | [Atari](#atari)                                 |
| 🟢      | [Backgammon](#backgammon)                       |
| 🔶      | [Bargaining](#bargaining)                       |
| 🔶      | [Battleship](#battleship)                       |
| 🔶      | [Blackjack](#blackjack)                         |
| 🔶      | [Block Dominoes](#block-dominoes)               |
| 🟢      | [Breakthrough](#breakthrough)                   |
| 🟢      | [Bridge](#bridge)                               |
| 🟢      | [(Uncontested) Bridge                           |
:        : bidding](#uncontested-bridge-bidding)           :
| 🔶      | [Catch](#catch)                                 |
| 🔶      | [Checkers](#checkers)                           |
| 🔶      | [Cliff Walking](#cliff-walking)                 |
| 🔶      | [Clobber](#clobber)                             |
| 🔶      | [Coin Game](#coin-game)                         |
| 🔶      | [Colored Trails](#colored-trails)               |
| 🟢      | [Connect Four](#connect-four)                   |
| 🔶      | [Cooperative                                    |
:        : Box-Pushing](#cooperative-box-pushing)          :
| 🟢      | [Chess](#chess)                                 |
| 🔶      | [Crazy Eights](#crazy-eights)                   |
| 🔶      | [Dark Hex](#dark-hex)                           |
| 🔶      | [Deep Sea](#deep-sea)                           |
| 🟢      | [Dots and Boxes](#dots-and-boxes)               |
| 🔶      | [Dou Dizhu](#dou-dizhu)                         |
| 🔶      | [Euchre](#euchre)                               |
| 🟢      | [First-price Sealed-Bid                         |
:        : Auction](#first-price-sealed-bid-auction)       :
| 🟢      | [Gin Rummy](#gin-rummy)                         |
| 🟢      | [Go](#go)                                       |
| 🟢      | [Goofspiel](#goofspiel)                         |
| 🟢      | [Hanabi](#hanabi)                               |
| 🟢      | [Havannah](#havannah)                           |
| 🟢      | [Hearts](#hearts)                               |
| 🔶      | [Hex](#hex)                                     |
| 🔶      | [Kriegspiel](#Kriegspiel)                       |
| 🟢      | [Kuhn poker](#kuhn-poker)                       |
| 🔶      | [Laser Tag](#laser-tag)                         |
| 🟢      | [Leduc poker](#leduc-poker)                     |
| 🔶      | [Lewis Signaling](#lewis-signaling)             |
| 🟢      | [Liar's Dice](#liars-dice)                      |
| 🔶      | [Liar's Poker](#liars-poker)                    |
| 🔶      | [Mensch ärgere Dich                             |
:        : nicht](#mensch-aergere-dich-nicht)              :
| 🔶      | [Mancala](#mancala)                             |
| 🔶      | [Markov Soccer](#markov-soccer)                 |
| 🟢      | [Matching Pennies                               |
:        : (Three-player)](#matching-pennies-three-player) :
| 🟢      | [Mean Field Game :                              |
:        : garnet](#mean_field_game_garnet)                :
| 🟢      | [Mean Field Game : crowd                        |
:        : modelling](#mean_field_game_crowd_modelling)    :
| 🟢      | [Mean Field Game : crowd modelling              |
:        : 2d](#mean_field_game_crowd_modelling_2d)        :
| 🟢      | [Mean Field Game : linear                       |
:        : quadratic](#mean-field-game--linear-quadratic)  :
| 🟢      | [Mean Field Game : predator                     |
:        : prey](#mean_field_game_predator_prey)           :
| 🟢      | [Mean Field Game :                              |
:        : routing](#mean-field-game--routing)             :
| 🔶      | [Morpion Solitaire (4D)](#morpion-solitaire-4d) |
| 🟢      | [Negotiation](#negotiation)                     |
| 🔶      | [Nim](#nim)                                     |
| 🔶      | [Nine men's morris](#nine_mens_morris)          |
| 🔶      | [Oh Hell](#oh-hell)                             |
| 🟢      | [Oshi-Zumo](#oshi-zumo)                         |
| 🟢      | [Oware](#oware)                                 |
| 🔶      | [Pathfinding](#pathfinding)                     |
| 🟢      | [Pentago](#pentago)                             |
| 🔶      | [Phantom Go](#phantom-go)                       |
| 🔶      | [Phantom Tic-Tac-Toe](#phantom-tic-tac-toe)     |
| 🟢      | [Pig](#pig)                                     |
| 🟢      | [Prisoner's Dilemma](#prisoner's-dilemma)       |
| ❌      | [Poker (Hold 'em)](#poker-hold-em)              |
| ❌      | [Quoridor](#quoridor)                           |
| ❌      | [Reconnaissance Blind                           |
:        : Chess](#reconnaissance-blind-chess)             :
| 🟢      | [Routing game](#routing-game)                   |
| 🔶      | [Sheriff](#sheriff)                             |
| 🔶      | [Slovenian Tarok](#slovenian-tarok)             |
| 🔶      | [Skat (simplified                               |
:        : bidding)](#skat-simplified-bidding)             :
| 🔶      | [Solitaire (K+)](#solitaire-k)                  |
| 🟢      | [Tic-Tac-Toe](#tic-tac-toe)                     |
| 🟢      | [Tiny Bridge](#tiny-bridge)                     |
| 🟢      | [Tiny Hanabi](#tiny-hanabi)                     |
| 🟢      | [Trade Comm](#trade-comm)                       |
| 🔶      | [TwixT](#twixt)                                 |
| 🔶      | [Ultimate Tic-Tac-Toe](#ultimate-tic-tac-toe)   |
| 🔶      | [Weighted Voting Games](#weighted-voting-games) |
| 🟢      | [Y](#y)                                         |

## Details

### 2048

*   A single player game where player aims to create a 2048 tile by merging
    other tiles.
*   Numbers on a grid.
*   Modern game.
*   Non-deterministic.
*   Perfect information.
*   1 player.
*   [Github](https://github.com/gabrielecirulli/2048)

### Amazons

*   Move pieces on a board trying to block opponents from moving.
*   Pieces on a grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Game_of_the_Amazons)

### Atari

*   Agent plays classic games from
    [Gym's Atari Environments](https://www.gymlibrary.dev/environments/atari/),
    such as Breakout.
*   Single player.
*   Most games are non-deterministic.
*   Perfect information.

### Backgammon

*   Players move their pieces through the board based on the rolls of dice.
*   Idiosyncratic format.
*   Traditional game.
*   Non-deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Backgammon)

### Bargaining

*   Agents negotiate for items in a pool with different (hidden) valuations.
*   Research game.
*   Non-deterministic (randomized pool and valuations).
*   Imperfect information.
*   2 players.
*   [Lewis et al. '17](https://arxiv.org/abs/1706.05125),
    [DeVault et al. '15](https://www.aaai.org/ocs/index.php/SSS/SSS15/paper/viewFile/10335/10100)

### Battleship

*   Players place ships and shoot at each other in turns.
*   Pieces on a board.
*   Traditional game.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   Good for correlated equilibria.
*   [Farina et al. '19, Correlation in Extensive-Form Games: Saddle-Point
    Formulation and
    Benchmarks](https://papers.nips.cc/paper/9122-correlation-in-extensive-form-games-saddle-point-formulation-and-benchmarks.pdf).
    Based on the original game
    [(wikipedia)](https://en.wikipedia.org/wiki/Battleship_\(game\))

### Blackjack

*   Simplified version of blackjack, with only HIT/STAND moves.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   1 player.
*   [Wikipedia](https://en.wikipedia.org/wiki/Blackjack)

### Block Dominoes

*   Most simple version of dominoes.
*   Consists of 28 tiles, featuring all combinations of spot counts (also called
    pips or dots) between zero and six.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia]([https://en.wikipedia.org/wiki/Blackjack]\(https://en.wikipedia.org/wiki/Dominoes#Blocking_game\))

### Breakthrough

*   Simplified chess using only pawns.
*   Pieces on a grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Breakthrough_\(board_game\))

### Bridge

*   A card game where players compete in pairs.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   4 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Contract_bridge)

### (Uncontested) Bridge bidding

*   Players score points by forming specific sets with the cards in their hands.
*   Card game.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Contract_bridge)

### Catch

*   Agent must move horizontally to 'catch' a descending ball. Designed to test
    basic learning.
*   Agent on a grid.
*   Research game.
*   Non-deterministic.
*   Perfect information.
*   1 players.
*   [Mnih et al. 2014, Recurrent Models of Visual Attention](https://papers.nips.cc/paper/5542-recurrent-models-of-visual-attention.pdf),<br>[Osband et al '19, Behaviour Suite for Reinforcement Learning, Appendix A](https://arxiv.org/abs/1908.03568)

### Checkers

*   Players move pieces around the board with the goal of eliminating the
    opposing pieces.
*   Pieces on a grid.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Checkers)

### Cliff Walking

*   Agent must find goal without falling off a cliff. Designed to demonstrate
    exploration-with-danger.
*   Agent on a grid.
*   Research game.
*   Deterministic.
*   Perfect information.
*   1 players.
*   [Sutton et al. '18, page 132](http://www.incompleteideas.net/book/bookdraft2018mar21.pdf)

### Clobber

*   Simplified checkers, where tokens can capture neighbouring tokens. Designed
    to be amenable to combinatorial analysis.
*   Pieces on a grid.
*   Research game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Clobber)

### Coin Game

*   Agents must collect their and their collaborator's tokens while avoiding a
    third kind of token. Designed to test divining of collaborator's intentions
*   Agents on a grid.
*   Research game.
*   Non-deterministic.
*   Imperfect information (all players see the grid and their own preferences,
    but not the preferences of other players).
*   2 players.
*   [Raileanu et al. '18, Modeling Others using Oneself in Multi-Agent
    Reinforcement Learning](https://arxiv.org/abs/1802.09640)

### Colored Trails

*   Agents negotiations for chips that they they play on a colored grid to move
    closer to the goal.
*   Agents on a grid.
*   Research game.
*   Non-deterministic (randomized board & chip configuration).
*   Imperfect information.
*   3 players.
*   [Ya'akov et al. '10](https://dash.harvard.edu/handle/1/4726287),
    [Fecici & Pfeffer '08](https://dl.acm.org/doi/10.5555/1402383.1402431),
    [de Jong et al. '11](https://www.ifaamas.org/Proceedings/aamas2011/papers/C4_R57.pdf)

### Connect Four

*   Players drop tokens into columns to try and form a pattern.
*   Tokens on a grid.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Connect_Four)

### Cooperative Box-Pushing

*   Agents must collaborate to push a box into the goal. Designed to test
    collaboration.
*   Agents on a grid.
*   Research game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Seuken & Zilberstein '12, Improved Memory-Bounded Dynamic Programming for
    Decentralized POMDPs](https://arxiv.org/abs/1206.5295)

### Chess

*   Players move pieces around the board with the goal of eliminating the
    opposing pieces.
*   Pieces on a grid.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Chess)

### Dots and Boxes

*   Players put lines between dots to form boxes to get points.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Dots_and_Boxes)

### Crazy Eights

*   A precursor of UNO (see [here](https://www.unorules.org/crazy-eights/)).
*   Players try to match the rank or suit of the previous played card.
*   Eights are viewed as wild cards.
*   In an alternative version, special cards such as skip, reverse, draw-two are
    permitted.
*   Nondeterministic.
*   Imperfect information.
*   >=2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Crazy_Eights)

### Dark Hex

*   Hex, except the opponent's tokens are hidden. (Imperfect-information
    version)
*   Uses tokens on a hex grid.
*   Research game.
*   Deterministic.
*   Imperfect information.
*   2 players.

### Deep Sea

*   Agent must explore to find reward (first version) or penalty (second
    version). Designed to test exploration.
*   Agent on a grid.
*   Research game.
*   Deterministic.
*   Perfect information.
*   1 players.
*   [Osband et al. '17, Deep Exploration via Randomized Value Functions](https://arxiv.org/abs/1703.07608)

### Dou Dizhu

*   A three-player games where one player (dizhu) plays against a team of two
    (peasants).
*   Uses a 54-card deck.
*   Non-deterministic.
*   Imperfect information.
*   Three players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Dou_dizhu)

### Euchre

*   Trick-taking card game where players compete in pairs.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   4 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Euchre)

### First-price Sealed-Bid Auction

*   Agents submit bids simultaneously; highest bid wins, and that's the price
    paid.
*   Idiosyncratic format.
*   Research game.
*   Non-deterministic.
*   Imperfect, incomplete information.
*   2-10 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/First-price_sealed-bid_auction)

### Gin Rummy

*   Players score points by forming specific sets with the cards in their hands.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Gin_rummy)

### Go

*   Players place tokens on the board with the goal of encircling territory.
*   Tokens on a grid.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Go_\(game\))

### Goofspiel

*   Players bid with their cards to win other cards.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   2-10 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Goofspiel)

### Hanabi

*   Players can see only other player's pieces, and everyone must cooperate to
    win.
*   Idiosyncratic format.
*   Modern game.
*   Non-deterministic.
*   Imperfect information.
*   2-5 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Hanabi_\(card_game\)) and
    [Bard et al. '19, The Hanabi Challenge: A New Frontier for AI Research](https://arxiv.org/abs/1902.00506)
*   Implemented via
    [Hanabi Learning Environment](https://github.com/deepmind/hanabi-learning-environment)

### Havannah

*   Players add tokens to a hex grid to try and form a winning structure.
*   Tokens on a hex grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Havannah)

### Hearts

*   A card game where players try to avoid playing the highest card in each
    round.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   3-6 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Hearts_\(card_game\))

### Hex

*   Players add tokens to a hex grid to try and link opposite sides of the
    board.
*   Uses tokens on a hex grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Hex_\(board_game\))
*   [Hex, the full story by Ryan Hayward and Bjarne Toft](https://webdocs.cs.ualberta.ca/~hayward/hexbook/hex.html)

### Kriegspiel

*   Chess with opponent's pieces unknown. Illegal moves have no effect - it
    remains the same player's turn until they make a legal move.
*   Traditional chess variant, invented by Henry Michael Temple in 1899.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Kriegspiel_\(chess\))
*   [Monte Carlo tree search in Kriegspiel](https://www.ics.uci.edu/~dechter/courses/ics-295/fall-2019/papers/2010-mtc-aij.pdf)
*   [Game-Tree Search with Combinatorially Large Belief States, Parker 2005](https://www.cs.umd.edu/~nau/papers/parker2005game-tree.pdf)

### Kuhn poker

*   Simplified poker amenable to game-theoretic analysis.
*   Cards with bidding.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Kuhn_poker)

### Laser Tag

*   Agents see a local part of the grid, and attempt to tag each other with
    beams.
*   Agents on a grid.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Leibo et al. '17](https://arxiv.org/abs/1702.03037),
    [Lanctot et al. '17](https://arxiv.org/abs/1711.00832)

### Leduc poker

*   Simplified poker amenable to game-theoretic analysis.
*   Cards with bidding.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Southey et al. '05, Bayes’ bluff: Opponent modelling in poker](https://arxiv.org/abs/1207.1411)

### Lewis Signaling

*   Receiver must choose an action dependent on the sender's hidden state.
    Designed to demonstrate the use of conventions.
*   Idiosyncratic format.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Lewis_signaling_game)

### Liar's Dice

*   Players bid and bluff on the state of all the dice together, given only the
    state of their dice.
*   Dice with bidding.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Liar%27s_dice)

### Liar's Poker

*   Players bid and bluff on the state of all hands, given only the state of
    their hand.
*   Cards with bidding.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information
*   2 or more players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Liar%27s_poker)

### Mensch Aergere Dich Nicht 

*   Players roll dice to move their pegs toward their home row while throwing
    other players' pegs to the out area.
*   Traditional game.
*   Non-deterministic.
*   Perfect information.
*   2-4 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Mensch_%C3%A4rgere_Dich_nicht)

### Mancala

*   Players take turns sowing beans on the board and try to capture more beans
    than the opponent.
*   Idiosyncratic format.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Kalah)

### Markov Soccer

*   Agents must take the ball to their goal, and can 'tackle' the opponent by
    predicting their next move.
*   Agents on a grid.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Littman '94, Markov games as a framework for multi-agent reinforcement learning](https://www2.cs.duke.edu/courses/spring07/cps296.3/littman94markov.pdf),<br>[He et al. '16, Opponent Modeling in Deep Reinforcement Learning](https://arxiv.org/abs/1609.05559)

### Matching Pennies (Three-player)

*   Players must predict and match/oppose another player. Designed to have an
    unstable Nash equilibrium.
*   Idiosyncratic format.
*   Research game.
*   Deterministic.
*   Imperfect information.
*   3 players.
*   "Three problems in learning mixed-strategy Nash equilibria"

### Mean Field Game : routing

*   Representative player chooses at each node where they go. They has an
    origin, a destination and a departure time and chooses their route to
    minimize their travel time. Time spent on each link is a function of the
    distribution of players on the link when the player reaches the link.
*   Network with choice of route.
*   Research game.
*   Mean-field (with a unique player).
*   Explicit stochastic game (only for initial node).
*   Perfect information.
*   [Cabannes et. al. '21, Solving N-player dynamic routing games with
    congestion: a mean field approach](https://arxiv.org/pdf/2110.11943.pdf).

### Mean Field Game : Linear-Quadratic

*   Players are uniformly distributed and are then incentivized to gather at the
    same point (The lower the distanbce wrt. the distribution mean position, the
    higher the reward). A mean-reverting term pushes the players towards the
    distribution, a gaussian noise term perturbs them. The players' actions
    alter their states linearly (alpha * a * dt) and the cost thereof is
    quadratic (K * a^2 * dt), hence the name. There exists an exact, closed form
    solution for the fully continuous version of this game.
*   Research game.
*   Mean-field (with a unique player).
*   Explicit stochastic game (only for initial node).
*   Perfect information.
*   [Perrin & al. 2019 (https://arxiv.org/abs/2007.03458)]

### Morpion Solitaire (4D)

*   A single player game where player aims to maximize lines drawn on a grid,
    under certain limitations.
*   Uses tokens on a grid.
*   Traditional game.
*   Deterministic
*   Perfect information.
*   1 player.
*   [Wikipedia](https://en.wikipedia.org/wiki/Join_Five)

### Negotiation

*   Agents with different utilities must negotiate an allocation of resources.
*   Idiosyncratic format.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   [Lewis et al. '17](https://arxiv.org/abs/1706.05125),
    [Cao et al. '18](https://arxiv.org/abs/1804.03980)

### Nim

*   Two agents take objects from distinct piles trying to either avoid taking
    the last one or take it. Any positive number of objects can be taken on each
    turn given they all come from the same pile.
*   Traditional mathematical game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Nim)

### Nine men's morris

*   Two players put and move stones on the board to try to form mills (three
    adjacent stones in a line) to capture the other player's stones.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Nine_men%27s_morris)

### Oh Hell

*   A card game where players try to win exactly a declared number of tricks. 
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   3-7 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Oh_Hell)

### Oshi-Zumo

*   Players must repeatedly bid to push a token off the other side of the board.
*   Idiosyncratic format.
*   Traditional game.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   [Buro, 2004. Solving the oshi-zumo game](https://link.springer.com/chapter/10.1007/978-0-387-35706-5_23) <br> [Bosansky et al. '16, Algorithms for Computing Strategies in Two-Player Simultaneous Move Games](http://mlanctot.info/files/papers/aij-2psimmove.pdf)

### Oware

*   Players redistribute tokens from their half of the board to capture tokens
    in the opponent's part of the board.
*   Idiosyncratic format.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Oware)

### Pathfinding

*   Agents must move to their desitnation.
*   Agents on a grid. Single-agent game is the classic examples from Sutton &
    Barto.
*   Research game.
*   Non-deterministic (in multiagent, collisions resolved by chance nodes).
*   Perfect information.
*   1-10 players.
*   Similar games appeared in
    [Austerweil et al. '15](http://miaoliu.scripts.mit.edu/SSS-16/wp-content/uploads/2016/01/paper.pdf),
    [Greenwald & Hall '03](https://www.aaai.org/Papers/ICML/2003/ICML03-034.pdf),
    and [Littman '01](https://jmvidal.cse.sc.edu/library/littman01a.pdf).

### Pentago

*   Players place tokens on the board, then rotate part of the board to a new
    orientation.
*   Uses tokens on a grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Pentago)

### Phantom Go

*   Go, except the opponent's stones are hidden. The analogue of Kriegspiel for
    Go.
*   Research game.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   [Cazenave '05, A Phantom Go Program](https://link.springer.com/chapter/10.1007/11922155_9)

### Phantom Tic-Tac-Toe

*   Tic-tac-toe, except the opponent's tokens are hidden. Designed as a simple,
    imperfect-information game.
*   Uses tokens on a grid.
*   Research game.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   [Auger '11, Multiple Tree for Partially Observable Monte-Carlo Tree Search](https://hal.archives-ouvertes.fr/hal-00563480v2/document),<br>[Lisy '14, Alternative Selection Functions for Information Set Monte Carlo Tree Search](https://core.ac.uk/download/pdf/81646968.pdf), <br>[Lanctot '13](http://mlanctot.info/files/papers/PhD_Thesis_MarcLanctot.pdf)

### Pig

*   Each player rolls a dice until they get a 1 or they 'hold'; the rolled total
    is added to their score.
*   Dice game.
*   Traditional game.
*   Non-deterministic.
*   Perfect information.
*   2-10 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Pig_\(dice_game\))

### Prisoner's Dilemma

*   Players decide on whether to cooperate or defect given a situation with
    different payoffs.
*   Simultaneous.
*   Traditional game.
*   Deterministic.
*   Perfect Information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Prisoner%27s_dilemma)

### Poker (Hold 'em)

*   Players bet on whether their hand of cards plus some communal cards will
    form a special set.
*   Cards with bidding.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   2-10 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Texas_hold_%27em)
*   Implemented via [ACPC](http://www.computerpokercompetition.org/).
*   ❌ Known issues: see issue [#1033](https://github.com/google-deepmind/open_spiel/issues/1033).

### Quoridor

*   Each turn, players can either move their agent or add a small wall to the
    board.
*   Idiosyncratic format.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2-4 players. (Note, from Wikipedia: "Though it can be played with 3 players,
    it's advised against. Since the 3rd player doesn't have player on the
    opposite side, they have an advantage.")
*   [Wikipedia](https://en.wikipedia.org/wiki/Quoridor)
*   ❌ Known issues: see [#1158](https://github.com/google-deepmind/open_spiel/issues/1158).

### Reconnaissance Blind Chess

*   Chess with opponent's pieces unknown, with sensing moves.
*   Chess variant, invented by John Hopkins University Applied Physics Lab. Used
    in NeurIPS competition and Hidden Information Game Competition.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   [JHU APL Main site](https://rbc.jhuapl.edu/)
*   [Markowitz et al. '18, On the Complexity of Reconnaissance Blind Chess](https://arxiv.org/abs/1811.03119)
*   [Newman et al. '16, Reconnaissance blind multi-chess: an experimentation
    platform for ISR sensor fusion and resource
    management](https://www.spiedigitallibrary.org/conference-proceedings-of-spie/9842/984209/Reconnaissance-blind-multi-chess--an-experimentation-platform-for-ISR/10.1117/12.2228127.short?SSO=1)
*   ❌ Known issues: see [#811](https://github.com/google-deepmind/open_spiel/issues/811).

### Routing game

*   Players choose at each node where they go. They have an origin, a
    destination and a departure time and choose their route to minimize their
    travel time. Time spent on each link is a function of the number of players
    on the link when the player reaches the link.
*   Network with choice of route.
*   Research game.
*   Simultaneous.
*   Deterministic.
*   Perfect information.
*   Any number of players.
*   [Cabannes et. al. '21, Solving N-player dynamic routing games with
    congestion: a mean field approach](https://arxiv.org/pdf/2110.11943.pdf).

### Sheriff

*   Bargaining game.
*   Deterministic.
*   Imperfect information.
*   2 players.
*   Good for correlated equilibria.
*   [Farina et al. '19, Correlation in Extensive-Form Games: Saddle-Point
    Formulation and
    Benchmarks](https://papers.nips.cc/paper/9122-correlation-in-extensive-form-games-saddle-point-formulation-and-benchmarks.pdf).
*   Based on the board game "Sheriff of Nottingham"
    [(bbg)](https://boardgamegeek.com/boardgame/157969/sheriff-nottingham)

### Slovenian Tarok

*   Trick-based card game with bidding.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   3-4 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/K%C3%B6nigrufen#Slovenia)
*   [Luštrek et al. 2003, A program for playing Tarok](https://pdfs.semanticscholar.org/a920/70fe11f75f58c27ed907c4688747259cae15.pdf)

### Skat (simplified bidding)

*   Each turn, players bid to compete against the other two players.
*   Cards with bidding.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   3 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Skat_\(card_game\))

### Solitaire (K+)

*   A single-player card game.
*   Card game.
*   Traditional game.
*   Non-deterministic.
*   Imperfect information.
*   1 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Klondike_\(solitaire\)) and
    [Bjarnason et al. '07, Searching solitaire in real time](http://web.engr.oregonstate.edu/~afern/papers/solitaire.pdf)

### Tic-Tac-Toe

*   Players place tokens to try and form a pattern.
*   Uses tokens on a grid.
*   Traditional game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Tic-tac-toe)

### Tiny Bridge

*   Simplified Bridge with fewer cards and tricks.
*   Cards with bidding.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2, 4 players.
*   See implementation for details.

### Tiny Hanabi

*   Simplified Hanabi with just two turns.
*   Idiosyncratic format.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2-10 players.
*   [Foerster et al 2018, Bayesian Action Decoder for Deep Multi-Agent
    Reinforcement Learning](https://arxiv.org/abs/1811.01458)

### Trade Comm

*   Players with different utilities and items communicate and then trade.
*   Idiosyncratic format.
*   Research game.
*   Non-deterministic.
*   Imperfect information.
*   2 players.
*   A simple emergent communication game based on trading.

### TwixT

*   Players place pegs and links on a 24x24 square to connect a line between
    opposite sides.
*   pegs and links on a grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/TwixT)

### Ultimate Tic-Tac-Toe

*   Players try and form a pattern in local boards and a meta-board.
*   Uses tokens on a grid.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Ultimate_tic-tac-toe)

### Weighted Voting Games

*   Classic coalitional game.
*   Players each have a weight w_i, and there is a quota q.
*   Denote p the binary vector representing a coalition over n players. The
    utility is 1 is p dot w >= q, 0 otherwise.
*   n players.
*   [Chalkiadakis, Elkind, &amp; Wooldridge '12](https://link.springer.com/book/10.1007/978-3-031-01558-8)

### Y

*   Players place tokens to try and connect sides of a triangular board.
*   Tokens on hex grid.
*   Modern game.
*   Deterministic.
*   Perfect information.
*   2 players.
*   [Wikipedia](https://en.wikipedia.org/wiki/Y_\(game\))