12 The Poisson Distribution
Overview
In this lesson, we learn about another specially named discrete probability distribution, namely the Poisson distribution.
Objectives
Upon completion of this lesson, you should be able to:
- recognize the situation that makes a discrete random variable a Poisson random variable.
- understand the heuristic derivation of the probability mass function of a Poisson random variable.
- use the Poisson PMF to calculate probabilities for a Poisson random variable.
- use a standard Poisson cumulative probability table to calculate probabilities for a Poisson random variable.
- explore the key properties, such as the moment-generating function, mean and variance, of a Poisson random variable.
- use the Poisson distribution to approximate binomial probabilities.
- understand the steps involved in each of the proofs in the lesson.
- apply the methods learned in the lesson to new problems.
12.1 Poisson Distributions
Situation
Let the discrete random variable \(X\) denote the number of times an event occurs in an interval of time (or space). Then \(X\) may be a Poisson random variable with \(x=0, 1, 2, \ldots\).
Example 12.1
- Let \(X\) equal the number of typos on a printed page. (This is an example of an interval of space — the space being the printed page.)
- Let \(X\) equal the number of cars passing through the intersection of Allen Street and College Avenue in one minute. (This is an example of an interval of time — the time being one minute.)
- Let \(X\) equal the number of Alaskan salmon caught in a squid driftnet. (This is again an example of an interval of space — the space being the squid driftnet.)
- Let \(X\) equal the number of customers at an ATM in 10-minute intervals.
- Let \(X\) equal the number of students arriving during office hours.
Def. 12.1 (Poisson Random Variable) If \(X\) is a Poisson random variable, then the probability mass function is:
\[ f(x)=\dfrac{e^{-\lambda} \lambda^x}{x!} \]
for \(x=0, 1, 2, \ldots\) and \(\lambda>0\), where \(\lambda\) will be shown later to be both the mean and the variance of \(X\).
Recall that the mathematical constant e is the unique real number such that the value of the derivative (slope of the tangent line) of the function \(f(x)=e^x\) at the point \(x=0\) is equal to 1. It turns out that the constant is irrational, but to five decimal places, it equals:
\[ \mathbf{e} = 2.71828 \]
Also, note that there are (theoretically) an infinite number of possible Poisson distributions. Any specific Poisson distribution depends on the parameter \(\lambda\).
“Derivation” of the PMF
Let \(X\) denote the number of events in a given continuous interval. Then \(X\) follows an approximate Poisson process with parameter \(\lambda>0\) if:
- The number of events occurring in non-overlapping intervals are independent.
- The probability of exactly one event in a short interval of length \(h=\frac{1}{n}\) is approximately \(\lambda h = \lambda \left(\frac{1}{n}\right)=\frac{\lambda}{n}\).
- The probability of exactly two or more events in a short interval is essentially zero.
With these conditions in place, here’s how the derivation of the PMF of the Poisson distribution goes:
Now, let’s make the intervals even smaller. That is, take the limit as \(n\) approaches infinity \(n\rightarrow \infty\) for fixed \(x\). Doing so, we get:
12.2 Finding Poisson Probabilities
Example 12.2 
Let \(X\) equal the number of typos on a printed page with a mean of 3 typos per page. What is the probability that a randomly selected page has at least one typo on it?
Solution
We can find the requested probability directly from the PMF The probability that \(X\) is at least one is:
\[ P(X\ge 1)=1-P(X=0) \]
Therefore, using the PMF to find \(P(X=0)\), we get:
\[ P(X \geq 1)=1-\dfrac{e^{-3}3^0}{0!}=1-e^{-3}=1-0.0498=0.9502 \]
That is, there is just over a 95% chance of finding at least one typo on a randomly selected page when the average number of typos per page is 3.
What is the probability that a randomly selected page has at most 1 typo on it?
Solution
The probability that \(X\) is at most one is:
\[ P(X\le 1)=P(X=0)+P(X=1) \]
Therefore, using the PMF, we get:
\[ \begin{align*} P(X \leq 1)&=\dfrac{e^{-3}3^0}{0!}+\dfrac{e^{-3}3^1}{1!} \\ &=e^{-3}+3e^{-3} \\ &=4e^{-3} \\ &=4(0.0498) \\ &=0.1992 \end{align*} \]
That is, there is just under a 20% chance of finding at most 1 typo on a randomly selected page when the average number of typos per page is 3.
Just as we used a cumulative probability table when looking for binomial probabilities, we could alternatively use a cumulative Poisson probability table, such as Table III in the back of your textbook. You should be able to use the formulas as well as the tables. If you take a look at the table, you’ll see that it is three pages long. Let’s just take a look at the top of the first page of the table in order to get a feel for how the table works:
In summary, to use the table in the back of your textbook, as well as that found in the back of most probability textbooks, to find cumulative Poisson probabilities, do the following:
- Find the column headed by the relevant \(\lambda\). Note that there are three rows containing \(\lambda\) on the first page of the table, two rows containing \(\lambda\) on the second page of the table, and one row containing \(\lambda\) on the last page of the table.
- Find the \(x\) in the first column on the left for which you want to find \(F(x)=P(X\le x)\).
Let’s try it out on an example. If \(X\) equals the number of typos on a printed page with a mean of 3 typos per page, what is the probability that a randomly selected page has 4 typos on it?
Solution
The probability that a randomly selected page has 4 typos on it can be written as \(P(X=4)\). We can calculate \(P(X=4)\) by subtracting \(P(X\le 3)\) from \(P(X\le 4)\). To find \(P(X\le 3)\) and \(P(X\le 4)\) using the Poisson table, we:
- Find the column headed by \(\lambda=3\).
- Find the 3 in the first column on the left, since we want to find \(F(3)=P(X\le 3)\). And, find the 4 in the first column on the left, since we want to find \(F(4)=P(X\le 4)\).
Now, all we need to do is, first, read the probability value where the \(\lambda=3\) column and the \(x=3\) row intersect, and, second, read the probability value where the \(\lambda=3\) column and the \(x=4\) row intersect. What do you get?
| x | 2.2 | 2.4 | 2.6 | 2.8 | 3.0 | 3.2 | 3.4 | 3.6 | 3.8 | 4.0 |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.111 | 0.091 | 0.074 | 0.061 | 0.050 | 0.051 | 0.033 | 0.027 | 0.022 | 0.018 |
| 1 | 0.355 | 0.308 | 0.267 | 0.231 | 0.199 | 0.171 | 0.147 | 0.126 | 0.107 | 0.092 |
| 2 | 0.623 | 0.570 | 0.518 | 0.469 | 0.423 | 0.380 | 0.340 | 0.303 | 0.269 | 0.238 |
| 3 | 0.819 | 0.779 | 0.736 | 0.692 | 0.647 | 0.603 | 0.558 | 0.515 | 0.473 | 0.433 |
| 4 | 0.928 | 0.904 | 0.887 | 0.848 | 0.815 | 0.781 | 0.744 | 0.706 | 0.668 | 0.629 |
| 5 | 0.975 | 0.964 | 0.951 | 0.935 | 0.916 | 0.895 | 0.871 | 0.844 | 0.816 | 0.785 |
| 6 | 0.993 | 0.988 | 0.983 | 0.976 | 0.966 | 0.955 | 0.942 | 0.927 | 0.909 | 0.889 |
| 7 | 0.998 | 0.997 | 0.995 | 0.992 | 0.988 | 0.983 | 0.977 | 0.969 | 0.960 | 0.949 |
| 8 | 1.000 | 0.990 | 0.999 | 0.998 | 0.993 | 0.994 | 0.992 | 0.988 | 0.984 | 0.979 |
| 9 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.998 | 0.997 | 0.996 | 0.994 | 0.992 |
| 10 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.998 | 0.997 |
| 11 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 |
| 12 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
The cumulative Poisson probability table tells us that finding \(P(X\le 4)=0.815\) and \(P(X\le 3)=0.647\). Therefore:
\[P(X=4)=P(X\le 4)-P(X\le 3)=0.815-0.647=0.168\] That is, there is about a 17% chance that a randomly selected page would have 4 typos on it. Since it wouldn’t take a lot of work in this case, you might want to verify that you’d get the same answer using the Poisson PMF.
What is the probability that 3 randomly selected pages have more than 8 typos on it?
Solution
Solving this problem involves taking one additional step. Recall that \(X\) denotes the number of typos on 1 printed page. Then, let’s define a new random variable \(Y\) that equals the number of typos on 3 printed pages. If the mean of \(X\) is 3 typos per page, then the mean of \(Y\) is:
\[ \lambda_Y=3 \text{ typos per 1 page }\times 3\text{ pages }=9 \text{ typos per 3 pages} \]
Finding the desired probability then involves finding:
\[ P(Y>8)=1-P(Y\le 8) \]
where \(P(Y\le 8)\) is found by looking on the Poisson table under the column headed by \(\lambda=9.0\) and the row headed by \(x=8\). What do you get?
| x | 6.5 | 7.0 | 7.5 | 8.0 | 8.5 | 9.0 | 9.5 | 10.0 | 10.5 | 11.0 |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.002 | 0.001 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| 1 | 0.011 | 0.007 | 0.005 | 0.003 | 0.002 | 0.001 | 0.001 | 0.000 | 0.000 | 0.000 |
| 2 | 0.043 | 0.030 | 0.020 | 0.014 | 0.009 | 0.006 | 0.004 | 0.003 | 0.002 | 0.001 |
| 3 | 0.112 | 0.082 | 0.059 | 0.042 | 0.030 | 0.021 | 0.015 | 0.010 | 0.007 | 0.005 |
| 4 | 0.224 | 0.173 | 0.132 | 0.100 | 0.074 | 0.055 | 0.040 | 0.029 | 0.021 | 0.015 |
| 5 | 0.369 | 0.301 | 0.241 | 0.191 | 0.150 | 0.116 | 0.089 | 0.067 | 0.050 | 0.015 |
| 6 | 0.527 | 0.450 | 0.378 | 0.313 | 0.256 | 0.207 | 0.165 | 0.130 | 0.102 | 0.079 |
| 7 | 0.673 | 0.599 | 0.525 | 0.453 | 0.386 | 0.324 | 0.269 | 0.220 | 0.179 | 0.143 |
| 8 | 0.792 | 0.729 | 0.662 | 0.593 | 0.523 | 0.456 | 0.392 | 0.333 | 0.279 | 0.232 |
| 9 | 0.877 | 0.830 | 0.776 | 0.717 | 0.653 | 0.587 | 0.522 | 0.458 | 0.397 | 0.341 |
| 10 | 0.933 | 0.901 | 0.862 | 0.816 | 0.763 | 0.706 | 0.645 | 0.583 | 0.521 | 0.460 |
| 11 | 0.966 | 0.947 | 0.921 | 0.888 | 0.849 | 0.803 | 0.752 | 0.697 | 0.639 | 0.579 |
| 12 | 0.984 | 0.973 | 0.957 | 0.936 | 0.909 | 0.876 | 0.836 | 0.792 | 0.742 | 0.689 |
| 13 | 0.993 | 0.987 | 0.978 | 0.966 | 0.949 | 0.926 | 0.898 | 0.864 | 0.825 | 0.781 |
| 14 | 0.997 | 0.994 | 0.990 | 0.983 | 0.973 | 0.959 | 0.940 | 0.917 | 0.888 | 0.854 |
| 15 | 0.999 | 0.998 | 0.995 | 0.992 | 0.986 | 0.978 | 0.967 | 0.951 | 0.932 | 0.907 |
| 16 | 1.000 | 0.999 | 0.998 | 0.996 | 0.993 | 0.989 | 0.982 | 0.973 | 0.960 | 0.944 |
| 17 | 1.000 | 1.000 | 0.999 | 0.998 | 0.997 | 0.995 | 0.991 | 0.986 | 0.978 | 0.968 |
| 18 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.998 | 0.096 | 0.993 | 0.988 | 0.982 |
| 19 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.998 | 0.997 | 0.994 | 0.991 |
| 20 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.998 | 0.997 | 0.995 |
| 21 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.998 |
| 22 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 |
| 23 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
The cumulative Poisson probability table tells us that finding \(P(X\le 8)=0.456\). Therefore:
\[P(Y>8)=1-P(Y\le 8)=1-0.456=0.544\]
That is, there is a 54.4% chance that three randomly selected pages would have more than 8 typos on it.
12.3 Poisson Properties
Just as we did for the other named discrete random variables we’ve studied, on this page, we present and verify four properties of a Poisson random variable.
12.4 Approximating the Binomial Distribution
Example 12.3 
Five percent (5%) of Christmas tree light bulbs manufactured by a company are defective. The company’s Quality Control Manager is quite concerned and therefore randomly samples 100 bulbs coming off of the assembly line. Let \(X\) denote the number in the sample that are defective. What is the probability that the sample contains at most three defective bulbs?
Solution
Can you convince yourself that \(X\) is a binomial random variable? Hmmm…. let’s see… there are two possible outcomes (defective or not), the 100 trials of selecting the bulbs from the assembly line can be assumed to be performed in an identical and independent manner, and the probability of getting a defective bulb can be assumed to be constant from trial to trial. So, \(X\) is indeed a binomial random variable. Well, calculating the probability is easy enough then… we just need to use the cumulative binomial table with \(n=100\) and \(p=0.05\)… Oops! The table won’t help us here, will it? Even many standard calculators would have trouble calculating the probability using the PMF:
\[ P(X\leq 3)=\dbinom{100}{0}(0.05)^0 (0.95)^{100}+\cdots+\dbinom{100}{3}(0.05)^3 (0.95)^{97} \]
Using a statistical software package (Minitab), I was able to use the binomial PMF to determine that:
\[ P(X\le 3)=0.0059205+0.0311607+0.0811818+0.1395757=0.25784 \]
But, if you recall the way that we derived the Poisson distribution, we started with the binomial distribution and took the limit as \(n\) approached infinity. So, it seems reasonable then that the Poisson PMF would serve as a reasonable approximation to the binomial PMF when your \(n\) is large (and therefore, \(p\) is small). Let’s calculate \(P(X\le 3)\) using the Poisson distribution and see how close we get. Well, the probability of success was defined to be:
\[ p=\dfrac{\lambda}{n} \]
Therefore, the mean \(\lambda\) is:
\[ \lambda=np \]
So, we need to use our Poisson table to find \(P(X\le 3)\) when \(\lambda=100(0.05)=5\). What do you get?
| x | 4.2 | 4.4 | 4.6 | 4.8 | 5.0 | 5.2 |
|---|---|---|---|---|---|---|
| 0 | 0.015 | 0.012 | 0.010 | 0.008 | 0.007 | 0.006 |
| 1 | 0.078 | 0.033 | 0.056 | 0.048 | 0.040 | 0.034 |
| 2 | 0.210 | 0.185 | 0.163 | 0.143 | 0.125 | 0.109 |
| 3 | 0.395 | 0.359 | 0.326 | 0.294 | 0.265 | 0.238 |
| 4 | 0.590 | 0.551 | 0.513 | 0.476 | 0.440 | 0.406 |
| 5 | 0.753 | 0.720 | 0.686 | 0.651 | 0.616 | 0.581 |
| 6 | 0.867 | 0.844 | 0.818 | 0.791 | 0.762 | 0.732 |
| 7 | 0.936 | 0.921 | 0.905 | 0.887 | 0.867 | 0.845 |
| 8 | 0.972 | 0.964 | 0.955 | 0.944 | 0.932 | 0.918 |
| 9 | 0.989 | 0.985 | 0.980 | 0.975 | 0.968 | 0.960 |
| 10 | 0.996 | 0.994 | 0.992 | 0.990 | 0.986 | 0.982 |
| 11 | 0.999 | 0.998 | 0.997 | 0.996 | 0.995 | 0.993 |
| 12 | 1.000 | 0.999 | 0.999 | 0.999 | 0.998 | 0.997 |
| 13 | 1.000 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 |
| 14 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| 15 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| 16 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
The cumulative Poisson probability table tells us that finding \(P(X\le 3)=0.265\). That is, if there is a 5% defective rate, then there is a 26.5% chance that the a randomly selected batch of 100 bulbs will contain at most 3 defective bulbs. More importantly, since we have been talking here about using the Poisson distribution to approximate the binomial distribution, we should probably compare our results. When we used the binomial distribution, we deemed \(P(X\le 3)=0.258\), and when we used the Poisson distribution, we deemed \(P(X\le 3)=0.265\). Not too bad of an approximation, eh?
It is important to keep in mind that the Poisson approximation to the binomial distribution works well only when n is large and p is small. In general, the approximation works well if \(n\ge 20\) and \(p\le 0.05\), or if \(n\ge 100\) and \(p\le 0.10\).