Suppose that the players apply security strategies, and
. This results in a cost of
. How do the
players feel after the outcome?
may feel satisfied because
given that
selected
, it received the lowest cost
possible. On the other hand,
may regret its decision in light
of the action chosen by
. If it had known that
would
be chosen, then it could have picked
to receive cost
, which is better than
. If the game were to be
repeated, then
would want to change its strategy in hopes of
tricking
to obtain a higher reward.
Is there a way to keep both players satisfied? Any time there is a
gap between
and
, there is regret for one or both
players. If
and
denote the amount of regret experienced
by
and
, respectively, then the total regret is
Steven M LaValle 2020-08-14