Each player must now have its own plan. As in Section 10.1, it seems best to define a plan as a mapping from states to actions, because it may not be clear what actions will be taken by the other decision maker. In Section 10.1, the other decision maker was nature, and here it is a rational opponent. Let and denote plans for and , respectively. Since the number of stages in Formulation 10.4 is fixed, stage-dependent plans of the form and are appropriate (recall that stage-dependent plans were defined in Section 10.1.3). Each produces an action and , respectively.
Now consider different solution concepts for Formulation 10.4. For , a deterministic plan is a function , that produces an action , for each state and stage . For it is instead , which produces an action , for each and . Now consider defining a randomized plan. Let and denote the sets of all probability distributions over and , respectively. A randomized plan for yields some for each and . Likewise, a randomized plan for yields some for each and .
Steven M LaValle 2020-08-14