scipy.stats.combine_pvalues¶

scipy.stats.combine_pvalues(pvalues, method='fisher', weights=None)[source]¶

Methods for combining the p-values of independent tests bearing upon the same hypothesis.

Parameters

pvaluesarray_like, 1-D

Array of p-values assumed to come from independent tests.

method{‘fisher’, ‘pearson’, ‘tippett’, ‘stouffer’, ‘mudholkar_george’},

optional.

Name of method to use to combine p-values. The following methods are available:

“fisher”: Fisher’s method (Fisher’s combined probability test), the default, the sum of the logarithm of the p-values.
“pearson”: Pearson’s method (similar to Fisher’s but uses sum of the complement of the p-values inside the logarithms).
“tippett”: Tippett’s method (minimum of p-values).
“stouffer”: Stouffer’s Z-score method.
“mudholkar_george”: the difference of Fisher’s and Pearson’s methods
divided by 2.

weightsarray_like, 1-D, optional

Optional array of weights used only for Stouffer’s Z-score method.

Returns

statistic: float: The statistic calculated by the specified method.
pval: float: The combined p-value.

Notes

Fisher’s method (also known as Fisher’s combined probability test) [1] uses a chi-squared statistic to compute a combined p-value. The closely related Stouffer’s Z-score method [2] uses Z-scores rather than p-values. The advantage of Stouffer’s method is that it is straightforward to introduce weights, which can make Stouffer’s method more powerful than Fisher’s method when the p-values are from studies of different size [6] [7]. The Pearson’s method uses \(log(1-p_i)\) inside the sum whereas Fisher’s method uses \(log(p_i)\) [4]. For Fisher’s and Pearson’s method, the sum of the logarithms is multiplied by -2 in the implementation. This quantity has a chisquare distribution that determines the p-value. The mudholkar_george method is the difference of the Fisher’s and Pearson’s test statistics, each of which include the -2 factor [4]. However, the mudholkar_george method does not include these -2 factors. The test statistic of mudholkar_george is the sum of logisitic random variables and equation 3.6 in [3] is used to approximate the p-value based on Student’s t-distribution.

Fisher’s method may be extended to combine p-values from dependent tests [5]. Extensions such as Brown’s method and Kost’s method are not currently implemented.

New in version 0.15.0.

References

1: https://en.wikipedia.org/wiki/Fisher%27s_method
2: https://en.wikipedia.org/wiki/Fisher%27s_method#Relation_to_Stouffer.27s_Z-score_method
3: George, E. O., and G. S. Mudholkar. “On the convolution of logistic random variables.” Metrika 30.1 (1983): 1-13.
4(1,2): Heard, N. and Rubin-Delanchey, P. “Choosing between methods of combining p-values.” Biometrika 105.1 (2018): 239-246.
5: Whitlock, M. C. “Combining probability from independent tests: the weighted Z-method is superior to Fisher’s approach.” Journal of Evolutionary Biology 18, no. 5 (2005): 1368-1373.
6: Zaykin, Dmitri V. “Optimally weighted Z-test is a powerful method for combining probabilities in meta-analysis.” Journal of Evolutionary Biology 24, no. 8 (2011): 1836-1841.
7: https://en.wikipedia.org/wiki/Extensions_of_Fisher%27s_method

Previous topic

scipy.stats.brunnermunzel

Next topic

scipy.stats.jarque_bera

scipy.stats.combine_pvalues¶

Previous topic

Next topic

Quick search