- •Introduction to Statistics for Biomedical Engineers
- •ABSTRACT
- •Keywords
- •Contents
- •3.1 WHY DO WE COLLECT DATA?
- •3.2 WHY DO WE NEED STATISTICS?
- •3.3 WHAT QUESTIONS DO WE HOPE TO ADDRESS WITH OUR STATISTICAL ANALYSIS?
- •3.4.1 Scatterplots
- •3.4.2 Time Series
- •3.4.3 Box-and-Whisker Plots
- •3.4.4 Histogram
- •3.5 GENERAL APPROACH TO STATISTICAL ANALYSIS
- •3.6 DESCRIPTIVE STATISTICS
- •3.7 MEASURES OF CENTRAL TENDENCY
- •3.8 MEASURES OF VARIABILITY
- •4.1 THE STANDARD NORMAL distribution
- •4.2 The normal distribution and sample mean
- •4.3 CONFIDENCE INTERVAL FOR THE SAMPLE MEAN
- •5.1 COMPARISON OF POPULATION MEANS
- •5.1.1 The t Test
- •5.1.1.1 Hypothesis Testing
- •5.1.1.2 Applying the t Test
- •5.1.1.3 Unpaired t Test
- •5.2 COMPARISON OF TWO VARIANCES
- •5.3 COMPARISON OF THREE OR MORE POPULATION MEANS
- •5.4 ONE-FACTOR EXPERIMENTS
- •5.5 TWO-FACTOR EXPERIMENTS
- •5.6 TUKEY’S MULTIPLE COMPARISON PROCEDURE
- •The Correlation Coefficient
- •7.1 POWER OF A TEST
- •7.2 POWER TESTS TO DETERMINE SAMPLE SIZE
- •Bibliography
- •Author Biography
41
c h a p t e r 5
Statistical Inference
Now that we have collected our data, estimated some basic descriptive statistics and assumed a probability model for the underlying population or process, we are prepared to perform some statistical analysis that will allow us to compare populations and test hypotheses. For this text, we are only going to discuss statistical analysis for which the normal distribution is a good probability model for the underlying population(s). If the data or samples suggest that the underlying population or process is not well modeled by a normal distribution, then we will need to resort to other types of statistical analysis, such a nonparametric techniques [6], which do not assume an underlying probability model. Some of these tests include the Wilcoxon rank sum test, Mann–Whitney U test, Kruskal–Wallis test, and the runs test [6, 12]. More advanced texts provide details for administering these nonparametric tests. Keep in mind that if the statistical analysis presented in this text is used for populations or processes that are not normally distributed, the results may be of little value, and the investigator may miss important findings from the data.
Assuming our data represents a normally distributed population or process, we are now prepared to perform a variety of statistical tests that allow us to test hypotheses about the equality of means and variances across two or more populations. Remember that for normal distributions, only the mean and standard deviation are required to completely characterize the probabilistic nature of the population or process. Thus, if we are to compare two normally distributed populations, we need only compare the means and variances of the populations. If the populations or processes are not normally distributed, there may be other parameters, such as skew and kurtosis, which differentiate two or more populations or processes.
5.1COMPARISON OF POPULATION MEANS
One of the most fundamental questions asked by scientists or engineers performing experiments is whether two populations, methods, or treatments are really different in central tendency. More specifically, are the two population means, reflected in the sample means collected under two different experimental conditions, significantly different? Or, is the observed difference between two means simply because of chance alone?