Читать книгу Applied Biostatistics for the Health Sciences - Richard J. Rossi - Страница 61
2.3 Probability
ОглавлениеIn a data-based biomedical study, a random sample will be selected from the target population and a well-designed sampling plan requires knowing the chance of drawing a particular observation or set of observations. For example, it might be important to know the chance of drawing a female individual or an individual between the ages of 30 and 60. In other studies, it might be important to determine the likelihood that a particular genetic trait will be passed from the parents to their offspring.
A probability is a number between 0 and 1 that measures how likely it is for an event to occur. Probabilities are associated with tasks or experiment where the outcome cannot be determined without actually carrying out the task. A task where the outcome cannot be predetermined is called a random experiment or a chance experiment. For example, prior to treatment it cannot be determined whether chemotherapy will improve a cancer patient’s health. Thus, the result of a chemotherapy treatment can be treated as a chance experiment before chemotherapy is started. Similarly, when drawing a random sample from the target population, the actual values of the sample will not be known until the sample is actually collected. Hence, drawing a random sample from the target population is a chance experiment.
Because statistical inferences are based on a sample from the population rather than a census of the population, the statistical inferences will have a degree of uncertainty associated with them. The measures of reliability for statistical inferences drawn from a sample are based on the underlying probabilities associated with the target population.
In a chance experiment, the actual outcome of the experiment cannot be predetermined, but it is important for the experimenter to identify all of the possible outcomes of the experiment before it is carried out. The set of all possible outcomes of a chance experiment is called the sample space and will be denoted by S. A subcollection of the outcomes in the sample space is called an event, and the probability of an event measures how likely the event is. An event is said to occur when a chance experiment is carried out and the chance experiment results in one of the outcomes in the event. For example, in a chance experiment consisting of randomly selecting an adult from a well-defined population, if A is the event that an individual between the ages of 30 and 60 is selected, then the event A will occur if and only if the age of the individual selected is between 30 and 60; if the age of the individual is not between 30 and 60, then the event A will not occur.
Probabilities are often used to determine the most likely outcome of a chance experiment and for assessing how likely it is for an observed data set to support a research hypothesis. The probability of an event A is denoted by P(A), and the probability of an event is always a number between 0 and 1. Probabilities near 0 indicate an event rarely occurs and probabilities near 1 indicate an event is likely to occur. Probabilities are sometimes also expressed in terms of percentages in which case the percentage is simply the probability of the event times 100. When probabilities are expressed in terms of percentages, they will be between 0 and 100%.
Example 2.19
Suppose an individual is to be drawn at random and their blood type is identified. Prior to drawing a blood sample and typing it, an individual’s blood type is unknown, and thus, this can be treated as a chance experiment. The four possible blood types are O, A, B, and AB, and hence, the sample space is S={O,A,B,AB}. Furthermore, according to the American Red Cross, the probabilities of each blood type are
Thus, if a person is drawn at random the probability that the person will have blood type AB is 0.04.
The probabilities associated with a chance experiment and a sample space S must satisfy the following four properties known as the Axioms of Probability.