10.5.1 Game Trees

In most literature, sequential games are formulated in terms of game trees. A state-space representation, which is more in alignment with the representations used in this chapter, will be presented in Section 10.5.2. The tree representation is commonly referred to as the extensive form of a game (as opposed to the normal form, which is the cost matrix representation used in Chapter 9). The representation is helpful for visualizing many issues in game theory. It is perhaps most helpful for visualizing information states; this aspect of game trees will be deferred until Section 11.7, after information spaces have been formally introduced. Here, game trees are presented for cases that are simple to describe without going deeply into information spaces.

**Figure 10.12:** A $3 \times 3$ matrix game expressed using a game tree.
$\begin{figure}\centerline{\psfig{file=figs/gtree0.eps,width=4.0truein}}\end{figure}$

Before a sequential game is introduced, consider representing a single-stage game in a tree form. Recall Example 9.14, which is a zero-sum, $3 \times 3$ matrix game. It can be represented as a game tree as shown in Figure 10.12. At the root, ${{\rm P}_1}$ has three choices. At the next level, ${{\rm P}_2}$ has three choices. Based on the choices by both, one of nine possible leaves will be reached. At this point, a cost is obtained, which is written under the leaf. The entries of the cost matrix, (9.53), appear across the leaves of the tree. Every nonleaf vertex is called a decision vertex: One player must select an action.

Now imagine that ${{\rm P}_1}$ and ${{\rm P}_2}$ play a sequence of games. A sequential version of the zero-sum game from Section 9.3 will be defined by extending the game tree idea given so far to more levels. This will model the following sequential game:

Formulation 10..3 (Zero-Sum Sequential Game in Tree Form)

Two players, ${{\rm P}_1}$ and ${{\rm P}_2}$ , take turns playing a game. A stage as considered previously is now stretched into two substages, in which each player acts individually. It is usually assumed that ${{\rm P}_1}$ always starts, followed by ${{\rm P}_2}$ , then ${{\rm P}_1}$ again, and so on. Player alternations continue until the game ends. The model reflects the rules of many popular games such as chess or poker. Let ${\cal K}= \{1,\ldots,K\}$ denote the set of stages at which ${{\rm P}_1}$ and ${{\rm P}_2}$ both take a turn.
As each player takes a turn, it chooses from a nonempty, finite set of actions. The available set could depend on the decision vertex.
At the end of the game, a cost for ${{\rm P}_1}$ is incurred based on the sequence of actions chosen by each player. The cost is interpreted as a reward for ${{\rm P}_2}$ .
The amount of information that each player has when making its decision must be specified. This is usually expressed by indicating what portions of the action histories are known. For example, if ${{\rm P}_1}$ just acted, does ${{\rm P}_2}$ know its choice? Does it know what action ${{\rm P}_1}$ chose in some previous stage?

The game tree can now be described in detail. Figure 10.13 shows a particular example for two stages (hence,

and ${\cal K}= \{1,2\}$ ). Every vertex corresponds to a point at which a decision needs to be made by one player. Each edge emanating from a vertex represents an action. The root of the tree indicates the beginning of the game, which usually means that ${{\rm P}_1}$ chooses an action. The leaves of the tree represent the end of the game, which are the points at which a cost is received. The cost is usually shown below each leaf. One final concern is to specify the information available to each player, just prior to its decision. Which actions among those previously applied by itself or other players are known?

**Figure 10.13:** A two-player, two-stage game expressed using a game tree.
$\begin{figure}\centerline{\psfig{file=figs/gtree.eps,width=4.0truein}}\end{figure}$

For the game tree in Figure 10.13, there are two players and two stages. Therefore, there are four levels of decision vertices. The action sets for the players are $U = V = \{L,R\}$ , for ``left'' and ``right.'' Since there are always two actions, a binary tree is obtained. There are

possible outcomes, which correspond to all pairwise combinations of four possible two-stage plans for each player.

For a single-stage game, both deterministic and randomized strategies were defined to obtain saddle points. Recall from Section 9.3.3 that randomized strategies were needed to guarantee the existence of a saddle point. For a sequential game, these are extended to deterministic and randomized plans, respectively. In Section 10.1.3, a (deterministic) plan was defined as a mapping from the state space to an action space. This definition can be applied here for each player; however, we must determine what is a ``state'' for the game tree. This depends on the information that each player has available when it plays.

A general framework for representing information in game trees is covered in Section 11.7. Three simple kinds of information will be discussed here. In every case, each player knows its own actions that were applied in previous stages. The differences correspond to knowledge of actions applied by the other player. These define the ``state'' that is used to make the decisions in a plan.