Game theory -  A Tutorial

Static: Matching Pennies

Alice and Bob each show one of the faces of a penny. If they show the same face, Bob gives Alice $1.00; otherwise, Alice gives Bob $1:00.  This game is summarized by the following reward matrix (left):

How should Alice and Bob play , possibly in a randomized way, to maximize their expected reward?

Not surprisingly, they should each choose H or T randomly with probability 1/2.

To see this, assume that Bob chooses H with probability b and Alice chooses H with probability a.

It is easy to see that if a > 0.5, then the best choice for b is b*(a) = 0, since Bob tries to present a face that does not match Alice’s.  The function b*(a), called the best response, is shown in the right-hand part of the figure, and so is a*(b).  These best response functions have only one intersection: (0.5, 0.5). What this means is that there is only one pair of choices for (a, b) where no player has an incentive to deviate unilaterally. Thus, if (a, b) = (0.4, 0.5), then Bob has an incentive to deviate from 0.5 because b*(0.4) is not equal to 0.5.  Only the intersection of the two best response functions is such that no player wants to deviate unilaterally: this point is called a Nash Equilibrium.

Thus, the game of Matching Pennies has a unique Nash Equilibrium, and it is randomized. One says that the strategies are mixed (for non-deterministic).