An implicit factor may also separate the data into different distributions, each of which may be normal, but which produce a nonnormal composite distribution. For example, measurements for females may follow a normal distribution, and measurements for males may also follow a normal distribution, but the measurements for the entire population of both males and females may not follow a normal distribution. Depending on the relative proportions of sampled data from each underlying normal distribution, and on the means and variances of each distribution, the composite mixture distribution may appear to be skewed, or to have nonnormal kurtosis, or both. Separating the data into different subsamples based on the value of the implicit factor may reveal that, conditional on the value of the implicit factor (e.g., gender), the data are sampled from a normal distribution, even if it is a different distribution for each value of the implicit factor.
Of course, an implicit factor may also separate the data into different nonnormal distributions. And if one or more of the subsamples has a small sample size, the test may fail to detect nonnormality due to a lack of power.
The boxplot, histogram, and normal probability plot (normal Q-Q plot), along with the normality test, can provide information on the normality of the population distribution. However, if there are only a small number of data points, nonnormality can be hard to detect with any of these methods. If there are a great many data points, the normality test may detect statistically significant but trivial departures from normality that may be of no practical importance.
For data sampled from a normal distribution, normal probability plots should approximate straight lines, and boxplots should be symmetric (median and mean together, in the middle of the box) with few if any outliers.
Even if none of the test assumptions are violated, a normality test with small sample sizes may not have sufficient power to detect a significant departure from normality, even if it is present. Power decreases as the significance level is decreased (i.e., as the test is made more stringent), and increases as the sample size increases. If a statistical significance test with small sample sizes produces a surprisingly non-significant P value, then a lack of power may be the reason. The best time to avoid such problems is in the design stage of an experiment, perhaps in consultation with a statistician, when appropriate minimum sample sizes can be determined before data collection begins.
Examine the glossary.
Do a keyword search of PROPHET
StatGuide.
Back to StatGuide distribution tests page.
Back to StatGuide normal distribution tests page.
Back to StatGuide home page.
©1996 BBN Corporation All rights reserved.