2 1. INTRODUCTION
a die; the probability of getting a three is 1/6. Select a card from a well-shuffled deck; the probability
of getting the queen of spades is 1/52 (assuming there are no jokers). One way to view probability
models that many people find intuitive is in terms of random sampling from a fixed population.
For example, the 52 cards form a fixed population and picking a card from a well-shuffled deck is
a means of randomly selecting one element of the population. While we will exploit this idea of
sampling from fixed populations, we should also note its limitations. For example, blood pressure is
a very useful medical indicator, but even with a fixed population of people it would be very difficult
to define a useful population of blood pressures. Blood pressure depends on the time of day, recent
diet, current emotional state, the technique of the person taking the reading, and many other factors.
Thinking about populations is very useful, but the concept can be very limiting both practically and
mathematically. For measurements such as blood pressures and heights, there are difficulties in even
specifying populations mathematically.
For mathematical reasons, probabilities are defined not on particular outcomes but on sets of
outcomes (events). This is done so that continuous measurements can be dealt with. It seems much
more natural to define probabilities on outcomes as we did in the previous paragraph, but consider
some of the problems with doing that. For example, consider the problem of measuring the height of
a corpse being kept in a morgue under controlled conditions. The only reason for getting morbid here
is to have some hope of defining what the height is. Living people, to some extent, stretch and con-
tract, so a height is a nebulous thing. But even given that someone has a fixed height, we can never
know what it is. When someone’s height is measured as 177.8 centimeters (5 feet 10 inches), their
height is not really 177.8 centimeters, but (hopefully) somewhere between 177.75 and 177.85 cen-
timeters. There is really no chance that anyone’s height is exactly 177.8 cm, or exactly 177.8001 cm,
or exactly 177.800000001 cm, or exactly 56.5955
π
cm, or exactly (76
√
5 + 4.5
√
3) cm. In any
neighborhood of 177.8, there are more numerical values than one could even imagine counting. The
height should be somewhere in the neighborhood, but it won’t be the particular value 177.8. The
point is simply that trying to specify all the possible heights and their probabilities is a hopeless
exercise. It simply cannot be done.
Even though individual heights cannot be measured exactly, when looking at a population of
heights they follow certain patterns. There are not too many people over 8 feet (244 cm) tall. There
are lots of males between 175.3 cm and 177.8 cm (5
9
and 5
10
). With continuous values, each
possible outcome has no chance of occurring, but outcomes do occur and occur with regularity. If
probabilities are defined for sets instead of outcomes, these regularities can be reproduced mathe-
matically. Nonetheless, initially the best way to learn about probabilities is to think about outcomes
and their probabilities.
There are five key facts about probabilities:
1. Probabilities are between 0 and 1.
2. Something that happens with probability 1 is a sure thing.
3. If something has no chance of occurring, it has probability 0.
4. If something occurs with probability, say, .25, the probability that it will not occur is 1 −.25 =
.75.
5. If two events are mutually exclusive, i.e., if they cannot possibly happen at the same time, then
the probability that either of them occurs is just the sum of their individual probabilities.
Individual outcomes are always mutually exclusive, e.g., you cannot flip a coin and get both heads
and tails, so probabilities for outcomes can always be added together. Just to be totally correct, I
should mention one other point. It may sound silly, but we need to assume that something occurring
is always a sure thing. If we flip a coin, we must get either heads or tails with probability 1. We
could even allow for the coin landing on its edge as long as the probabilities for all the outcomes
add up to 1.
E
XAMPLE 1.1.1. Consider the nine outcomes that are all combinations of three heights, tall (T),