Since a game under Formulation 9.7 can be nicely
expressed as a matrix, it is tempting to use linear algebra to
conveniently express expected costs.  Let  and
 and  .  As
in Section 9.1.3, a randomized strategy for
.  As
in Section 9.1.3, a randomized strategy for 
 can be
represented as an
 can be
represented as an  -dimensional vector,
-dimensional vector,
| ![$\displaystyle w = [w_1 \;\; w_2 \; \ldots \; w_m].$](img3645.gif) | (9.55) | 
 for all
 for all 
 , and 2)
, and 2) 
 .  If
.  If  is considered as a point in
 is considered as a point in 
 , then
the two constraints imply that it must lie on an
, then
the two constraints imply that it must lie on an  -dimensional
simplex (recall Section 6.3.1).  If
-dimensional
simplex (recall Section 6.3.1).  If  , this means
that
, this means
that  lies in a triangular subset of
 lies in a triangular subset of 
 .  Similarly, let
.  Similarly, let  represent a randomized strategy for
represent a randomized strategy for 
 as an
 as an  -dimensional
vector,
-dimensional
vector,
 denotes transpose, which yields a column vector that
satisfies the dimensional constraints required for an upcoming matrix
multiplication.
 denotes transpose, which yields a column vector that
satisfies the dimensional constraints required for an upcoming matrix
multiplication.
Let 
 denote the expected cost that will be received if
 denote the expected cost that will be received if
 plays
 plays  and
 and 
 plays
 plays  .  This can be computed as
.  This can be computed as
 , makes use of the assumption in
Formulation 9.7 that the actions are consecutive
integers.  The expected cost can be alternatively expressed using the
cost matrix,
, makes use of the assumption in
Formulation 9.7 that the actions are consecutive
integers.  The expected cost can be alternatively expressed using the
cost matrix,  .  In this case
.  In this case
 yields a scalar value that is precisely
(9.57).  To see this, first consider the product
 yields a scalar value that is precisely
(9.57).  To see this, first consider the product  .
This yields an
.
This yields an  -dimensional vector in which the
-dimensional vector in which the  th element is
the expected cost that
th element is
the expected cost that 
 would receive if it tries
 would receive if it tries  .  Thus,
it appears that
.  Thus,
it appears that 
 views
 views 
 as a nature player under the
probabilistic model.  Once
 as a nature player under the
probabilistic model.  Once  and
 and  are multiplied, a scalar value
is obtained, which averages the costs in the vector
 are multiplied, a scalar value
is obtained, which averages the costs in the vector  according the
probabilities of
 according the
probabilities of  .
.
Let  and
 and  denote the set of all randomized strategies for
 denote the set of all randomized strategies for 
 and
and 
 , respectively.  These spaces include strategies that are
equivalent to the deterministic strategies considered in Section
9.3.2 by assigning probability one to a single action.
Thus,
, respectively.  These spaces include strategies that are
equivalent to the deterministic strategies considered in Section
9.3.2 by assigning probability one to a single action.
Thus,  and
 and  can be considered as expansions of the set of
possible strategies in comparison to what was available in the
deterministic setting.  Using
 can be considered as expansions of the set of
possible strategies in comparison to what was available in the
deterministic setting.  Using  and
 and  , randomized security
strategies for
, randomized security
strategies for 
 and
 and 
 are defined as
are defined as
The randomized upper value is defined as
 and
 and  include the deterministic security strategies,
 include the deterministic security strategies,
 and
 and 
 .  These
inequalities imply that the randomized security strategies may have
some hope in closing the gap between the two values in general.
.  These
inequalities imply that the randomized security strategies may have
some hope in closing the gap between the two values in general.
The most fundamental result in zero-sum game theory was shown by von
Neumann [956,957], and it states that 
 for any game in Formulation 9.7.  This yields
the randomized value
 for any game in Formulation 9.7.  This yields
the randomized value
 for the game.  This means that there
will never be expected regret if the players stay with their security
strategies.  If the players apply their randomized security
strategies, then a randomized saddle point is obtained.  This
saddle point cannot be seen as a simple pattern in the matrix
 for the game.  This means that there
will never be expected regret if the players stay with their security
strategies.  If the players apply their randomized security
strategies, then a randomized saddle point is obtained.  This
saddle point cannot be seen as a simple pattern in the matrix  because it instead exists over
because it instead exists over  and
 and  .
.
The guaranteed existence of a randomized saddle point is an important result because it demonstrates the value of randomization when making decisions against an intelligent opponent. In Example 9.7, it was intuitively argued that randomization seems to help when playing against an intelligent adversary. When playing the game repeatedly with a deterministic strategy, the other player could learn the strategy and win every time. Once a randomized strategy is used, the players will not experience regret.
Steven M LaValle 2020-08-14