Figure 4 assigns probabilities to the events;
and
.
The probability that
and
occur jointly is
. In Figure 4(a) there are no sample points in common so that the probability of
and
occurring together is zero. Figure 4(b) indicates that
.
The probability that
is written
and
The adjustment from the last term,
, removes the one unit of the shared contribution since it is in both
and
. If the two events are mutually exclusive and share no common information, their intersection is the empty set and with
,
In Figure 4(a) the events are mutually exclusive,
When
and
are independent, information about one does not affect the information about the other. For instance, the event of rain on Jan 1st does not allow estimation of whether it will rain on July 1st as these events are far enough apart in time to be regarded as independent. The probability that the 2 independent events both occur is,
This relationship would not hold for dependent events because some of the information contained in
is also contained in
and the product would ``double dip''. The maximum temperature on Jan 1st is a reliable predictor of the maximum temperature on Jan 2nd and the probability that say the temperature exceeds
C on both days is really determined by the Jan 1st event, Jan 2nd is not an entirely new event.
Independence and mutually exclusive are different concepts,
| Independence |
|
| Mutually exclusive |
|
The probability of
when
is known is termed probability of
, conditional on
and written as
.
Then the probability that both events occur is,
In Figure 4(b),
. That illustrates that if we gain extra information by knowing
and that
, the risk associated with
is reduced.
The relationships of the probability of events are collated in Table 1. This provides us with enough basic principles to start probability analysis. Whilst the introductory examples may appear artificial, they are metaphors for ``real'' problems.