If outliers are present, or if the data in fact come from a normal distribution, then the rank sum test may not be the most powerful test available, and this could mean the difference between detecting a true difference or not. Another nonparametric test, the unpaired two-sample t test, or employing a transformation may result in a more powerful test. If the population dispersions are unequal, a transformation may produce comparable dispersions.
Often, the effect of an assumption violation on the rank sum test result depends on the extent of the violation (such as the how unequal the population disperions are, or how skewed one or the other population distribution is). Some small violations may have little practical effect on the analysis, while other violations may render the rank sum test result uselessly incorrect or uninterpretable. In particular, small sample sizes can increase vulnerability to assumption violations.
Outliers tend to increase the estimate of sample variation, and might lead to an incorrect conclusion that the dispersions of the two samples are not equal if the outlier is the result of a recording or measurement error.
Because the statistic for the rank sum test is resistant, it will not be substantially affected by the presence of outliers unless the number of outliers becomes large relative to the sample size.
The boxplot and normal probability plot (normal Q-Q plot) may suggest the presence of outliers in the data.
If you find outliers in your data that are not due to correctable errors, you may wish to consult a statistician as to how to proceed.
If both outliers and unequal dispersions are present, employing a transformation may resolve both problems at once, and also promote normality. In this case, it may be preferable to perform an unpaired two-sample t test on the transformed data, as the t test has slightly more power than the rank sum test if the assumption of normality holds. (The rank sum test has about 95% efficiency compared to the unpaired t test if the assumption is in fact correct.)
The usual measurement for sample variance is not resistant to outliers, while the Ansari-Bradley test is less subject to influence by outliers. For this reason, the Ansari-Bradley test may not reject equality of dispersions even when the sample variances seem to be substantial different. A lack of power due to small sample sizes may also lead to this situation.
Differences in distributional shapes can be assessed by examination of the data, as with boxplots, histograms, and normal probability plots. Differing results for each sample for the normality test also suggest the possibility of differing distributional shapes.
A fan pattern like the profile of a megaphone, with a
noticeable flare either to the right or to the left
as shown in the picture (one of the "stacks" of data
points is much longer than the other), suggests that
the variation in the values increases in the direction
the fan pattern widens (usually as the sample mean increases), and this in
turn suggests that a transformation
may be needed.
Side-by-side boxplots of the two samples can
also reveal lack of homogeneity of dispersion
if one boxplot is much longer than the other, and reveal suspected outliers.
Even if none of the test assumptions are violated, a rank sum test with small sample sizes may not have sufficient power to detect a significant difference between the two samples, even if the medians are in fact different. Power decreases as the significance level is decreased (i.e., as the test is made more stringent), and increases as the sample size increases. With very small samples, even samples from populations with very different medians may not produce a significant rank sum test statistic. If a statistical significance test with small sample sizes produces a surprisingly non-significant P value, then a lack of power may be the reason. The best time to avoid such problems is in the design stage of an experiment, when appropriate minimum sample sizes can be determined, perhaps in consultation with a statistician, before data collection begins.
Examine the glossary.
Do a keyword search of PROPHET
StatGuide.
Back to StatGuide testing equality of means/location page.
Back to StatGuide nonparametric tests page.
Back to StatGuide rank sum test page.
Back to StatGuide home page.
©1996 BBN Corporation All rights reserved.