Generalized Monty Hall Problem
[edit]
We will consider generalizations of the classical problem with 3 doors and 1 prize discussed above to N doors with k prizes.
Pursuing the more general problems makes it difficult to use heuristic (and often erroneous) arguments and forces one to more precisely formulate exactly what problems are to be solved and what assumptions are being made to execute their solution. Rather than doors, we will formulate these problems in terms of boxes and prizes in boxes.
Rules:
(a) the player initially selects one of N boxes.
(b) The host cannot open a box with a prize and must open exactly p of the N-1 remaining boxes that do not contain a prize.
(c) The player must be then given an opportunity to exchange the box previously chosen with one of the q = N - 1 - p remaining unopened boxes.
The rules show that p + q = N-1 and that p ≤ N-1-k. Therefore q ≥ N-1 –(N – 1 –k) = k. For completeness, we allow the possibility that p = 0, i.e. the host opens no box.
Problems
(1) Find probability that the box initially chosen contains a prize.
(2) Find the conditional probability that the player wins a prize given that it is known that k specific boxes are empty and that the player does not exchange the previously chosen box with one of the remaining unopened boxes.
(3) Find the conditional probability that the player wins a prize given that it is known that k specific boxes are empty and that the player does exchange the previously chosen box with one of the remaining unopened boxes.
We assume the boxes are numbered 1, 2, ... , N. Since all probabilities to be considered are conditional on knowing the box initially chosen, and as this information is available to both player and host, we can assume with no loss of generality (by renumbering the boxes if necessary) that the player initially chose box 1.
Notation:
P(A): probability of A.
P(A|B): conditional probability of A given B.
Ac: complement of A.
A\B = A
Bc
Ck : the set of subsets of size k of {1, …, N}
X : a random variable with values in Ck. X is the set of boxes with prizes.
Rp(x) : is set of all subsets of size p of {2, …, N} The collection Rp(x) is set of all possible sets of boxes that can be opened if the prize set is x,
.
Dp(x) = {2, …, N} \Rp(x): is the set of all possible boxes that can remain if the prize set is x. A set s in Rp (x) has size q and contains x if
and x\ {1} if
.
:the conditional probability that player wins a prize player if the player switches given X = x and S = s.
For B a subset of {1,...,N}
A: the event the player switches and wins a prize.
Problems 1 – 3 in the notation just introduced are:
Solutions
The calculations of the probabilities needed in problems 1 – 3 depend of course on the distribution of X as well as the conditional distribution of S given X. Initially we will solve the problems making no assumptions about these distributions. These distributions are not actually made explicit even in the classical problem of 1 prize and 3 doors though the “usual assumptions” in the classical case are that X is uniformly distributed and that conditional distribution of S given X is either fixed by the rules or uniformly distributed over its possible values. In the generalization considered here we call this the “every thing uniform case” and we will present explicit solutions to the 3 problems in this case. In general the distributions of X as well as the conditional distribution of S given X would be known to the host and could, in fact, be considered a strategy of the host so certainly the host could compute the needed probabilities. However, generally the player would not know these distributions and therefore could not calculate the needed probabilities and therefore, except in certain special cases discussed below, could not determine if switching or remaining with the initially chosen box was a better choice.
We note that the rules demand that
.
Hence
.
Problem 1
.
To solve problems 2 and 3 we need to compute P(S = s). To this end let
and
.
Then
Problem 2
Problem 3
There are two possible routes to win if the player switches. One is that box 1 has a prize and the player exchges for one of the k-1 prize box in s and the other is that box 1 does not have a prize and the player switches for one of the k boxes in s that contain a prize. Reflecting the possibilities let
and
Then
Case where all distributions are uniform
In this special case the computations needed reduce to counting the various possiblilities. There are in all
subsets of size k of {1,...,N} of which
contain 1 and do not contain 1. Hence
.
Given an x which contains 1 there are q-(k-1)non prize boxes that are in s and these can be selected from N-1-(k-1) possible boxes. Thus
Similarly,given an x which does not contain 1 there are q-k non-prize boxes to be selected from N-1-k possiblities. Thus
The sum
is the number of choices for the set x \{1} of prize boxes contained in s. Thus
By (3),(11), and (13)
and by (4), (12), and (14)
.
Then a small calculation shows
Thus
.
Consequetly (6) shows
Thus in the uniform case considered here the additional information gained by the player knowing that various boxes are empty does not change the chance that the player wins a prize if the player remains with box initially selected.
To determine the solution to problem 3 we note that in the everything uniform case considered here
Consequently,
and
Thus
Hence by (9)
Comparison of (14 )and 15)shows
Thus, swiching has a
advantage. In the extreme case when no boxes are opened (q = N - 1) there is no advantag and the chance of winning with or without switching is the same as the prior probability of winning a prize. At the other extreme when q = k the player is
times more likely to win a prize by switching.
The case of N boxes and 1 prize with all distributions uniform seems to been discussed by D. L. Fergeson (1975) in a letter to Selvin (Selvin 1975b) but I know reference to the case of k > 1 prizes.
Host opens maximum number of boxes ( k = q)
When k = q the set s must be the prize set x if
. Additionally,
Therefore, in this case,
Specializing to the case when X is uniformly distributed,
and setting
we find using (6) and (9) that
Therefore,
Hence no matter how P(S =s |X = x) is determined it is always advantageous for the player to switch. However, in cases when X is not uniformly distributed it may be advantageous to to remain with the initially chosen box. For example, if k = 1 and P(X=N) = 0 then P(win switch|S={N}) = 0.
Mathematically, The extended Monty Hall problem discussed here is the same as the following extension of the * Three Prisoners Problem. <br\>
There are N prisoners of which k will be selected at random and set free and the remaining N - k will remain in jail. We refer to the prisoners as 1, …, N. Prisoner 1 knows the are to be k freed and asks the guard to give him the names of p of the prisoners who will remain claiming that information will not change his chance of being amongst those chosen to go free. We assume the guard does not lie and never tells prisoner 1 if he is to be freed if he is one of those to be freed. Is the prisoner correct? If the guard offers him the opportunity to switch with an unnamed prisoner is this advantageous? <br\>
If we identify the prisoners with the boxes and going free with the prizes we see that the prisoner problem is mathematically the same as the extended Monty Hall problem.
Further extension of the Monty Hall problem
[edit]
A further generalization of the Monty Hall problem allows the player to initially choose m >=1 boxes. The host then reveals p empty boxes and the player now has the option to exchange r of his initially chosen boxes with r of the q unopened boxes. However, the player can only keep 1 of the prizes if his final set of boxes contains more than one prize. As before, the player knows that there are k prizes and that the host will open p boxes. The player is at liberty to choose r. For mathematical completeness we allow the case r = 0, which of course, corresponds to the case that the player remains with the initial choice of boxes. In this generalization, the 2nd and 3rd problems coalesce with r =0 corresponding to not switching at all (problem 2) and an additional problem is suggested in determining what is an optimal r.
With this generalization it is convenient to the number the boxes 1, 2, …, m+N and let the players initial boxes be 1, … , m. The constraints on the quantities are:
Additional notation
Solutions in the everything uniform case
We will limit the consideration of the expanded problem to the case X is uniformly distributed, the conditional distribution of S given X is uniformly distributed on its permissible set of values, and the sets of size r chosen from {1, …, m} and S are conditional on S and X uniformly distributed. Additionally, it is somewhat easier to state the results in terms of the probability of not getting a prize.
For easy reference we list some facts:
Unconditional probability of no prize
Conditional probability of winning no prize given S
If we switch r boxes and there are j prizes in {1,…,m} then the player wins no prize if and only if all of the j prize boxes amongst those {1,…,m} are included in the r boxes switched and none of the k – j boxes in the set s are included in those switched from the set s. Call this the event Cr. Then
Case of k = 1 prize.
Formula (29) simplifies considerably for the special cases when m = 1 or k = 1. The case m = 1, reproduce the complement probability given in equation (17) for r = 0 and in equation (10) for r =1. Carrying out the calculations for the case k = 1 shows the denominator of (29) becomes
<br\>
while the numerator of (29) becomes
Since N – q ≥ 0 we see that the probability of losing is maximal for r = 0, i.e., not switching, and steadily declines with increasing r becoming minimal if the player exchanges the maximum number of boxes possible (r = min (m,q)).