scipy.stats.levene¶
-
scipy.stats.
levene
(*args, **kwds)[source]¶ Perform Levene test for equal variances.
The Levene test tests the null hypothesis that all input samples are from populations with equal variances. Levene’s test is an alternative to Bartlett’s test
bartlett
in the case where there are significant deviations from normality.- Parameters
- sample1, sample2, …array_like
The sample data, possibly with different lengths. Only one-dimensional samples are accepted.
- center{‘mean’, ‘median’, ‘trimmed’}, optional
Which function of the data to use in the test. The default is ‘median’.
- proportiontocutfloat, optional
When center is ‘trimmed’, this gives the proportion of data points to cut from each end. (See
scipy.stats.trim_mean
.) Default is 0.05.
- Returns
- statisticfloat
The test statistic.
- pvaluefloat
The p-value for the test.
Notes
Three variations of Levene’s test are possible. The possibilities and their recommended usages are:
‘median’ : Recommended for skewed (non-normal) distributions>
‘mean’ : Recommended for symmetric, moderate-tailed distributions.
‘trimmed’ : Recommended for heavy-tailed distributions.
The test version using the mean was proposed in the original article of Levene ([2]) while the median and trimmed mean have been studied by Brown and Forsythe ([3]), sometimes also referred to as Brown-Forsythe test.
References
- 1
https://www.itl.nist.gov/div898/handbook/eda/section3/eda35a.htm
- 2
Levene, H. (1960). In Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, I. Olkin et al. eds., Stanford University Press, pp. 278-292.
- 3
Brown, M. B. and Forsythe, A. B. (1974), Journal of the American Statistical Association, 69, 364-367
Examples
Test whether or not the lists a, b and c come from populations with equal variances.
>>> from scipy.stats import levene >>> a = [8.88, 9.12, 9.04, 8.98, 9.00, 9.08, 9.01, 8.85, 9.06, 8.99] >>> b = [8.88, 8.95, 9.29, 9.44, 9.15, 9.58, 8.36, 9.18, 8.67, 9.05] >>> c = [8.95, 9.12, 8.95, 8.85, 9.03, 8.84, 9.07, 8.98, 8.86, 8.98] >>> stat, p = levene(a, b, c) >>> p 0.002431505967249681
The small p-value suggests that the populations do not have equal variances.
This is not surprising, given that the sample variance of b is much larger than that of a and c:
>>> [np.var(x, ddof=1) for x in [a, b, c]] [0.007054444444444413, 0.13073888888888888, 0.008890000000000002]