This module contains a large number of probability distributions as well as a growing library of statistical functions.
Each included distribution is an instance of the class rv_continous: For each given name the following methods are available:
| rv_continuous([momtype, a, b, xa, xb, xtol, ...]) | A generic continuous random variable class meant for subclassing. |
| rv_continuous.rvs(*args, **kwds) | Random variates of given type. |
| rv_continuous.pdf(x, *args, **kwds) | Probability density function at x of the given RV. |
| rv_continuous.logpdf(x, *args, **kwds) | Log of the probability density function at x of the given RV. |
| rv_continuous.cdf(x, *args, **kwds) | Cumulative distribution function at x of the given RV. |
| rv_continuous.logcdf(x, *args, **kwds) | Log of the cumulative distribution function at x of the given RV. |
| rv_continuous.sf(x, *args, **kwds) | Survival function (1-cdf) at x of the given RV. |
| rv_continuous.logsf(x, *args, **kwds) | Log of the survival function of the given RV. |
| rv_continuous.ppf(q, *args, **kwds) | Percent point function (inverse of cdf) at q of the given RV. |
| rv_continuous.isf(q, *args, **kwds) | Inverse survival function at q of the given RV. |
| rv_continuous.moment(n, *args, **kwds) | n’th order non-central moment of distribution |
| rv_continuous.stats(*args, **kwds) | Some statistics of the given RV |
| rv_continuous.entropy(*args, **kwds) | Differential entropy of the RV. |
| rv_continuous.fit(data, *args, **kwds) | Return MLEs for shape, location, and scale parameters from data. |
| rv_continuous.expect([func, args, loc, ...]) | calculate expected value of a function with respect to the distribution |
| rv_continuous.median(*args, **kwds) | Median of the distribution. |
| rv_continuous.mean(*args, **kwds) | Mean of the distribution |
| rv_continuous.var(*args, **kwds) | Variance of the distribution |
| rv_continuous.std(*args, **kwds) | Standard deviation of the distribution. |
| rv_continuous.interval(alpha, *args, **kwds) | Confidence interval with equal areas around the median |
Calling the instance as a function returns a frozen pdf whose shape, location, and scale parameters are fixed.
Similarly, each discrete distribution is an instance of the class rv_discrete:
| rv_discrete([a, b, name, badvalue, ...]) | A generic discrete random variable class meant for subclassing. |
| rv_discrete.rvs(*args, **kwargs) | Random variates of given type. |
| rv_discrete.pmf(k, *args, **kwds) | Probability mass function at k of the given RV. |
| rv_discrete.logpmf(k, *args, **kwds) | Log of the probability mass function at k of the given RV. |
| rv_discrete.cdf(k, *args, **kwds) | Cumulative distribution function at k of the given RV |
| rv_discrete.logcdf(k, *args, **kwds) | Log of the cumulative distribution function at k of the given RV |
| rv_discrete.sf(k, *args, **kwds) | Survival function (1-cdf) at k of the given RV |
| rv_discrete.logsf(k, *args, **kwds) | Log of the survival function (1-cdf) at k of the given RV |
| rv_discrete.ppf(q, *args, **kwds) | Percent point function (inverse of cdf) at q of the given RV |
| rv_discrete.isf(q, *args, **kwds) | Inverse survival function (1-sf) at q of the given RV |
| rv_discrete.stats(*args, **kwds) | Some statistics of the given discrete RV |
| rv_discrete.moment(n, *args, **kwds) | n’th non-central moment of the distribution |
| rv_discrete.entropy(*args, **kwds) | |
| rv_discrete.expect([func, args, loc, lb, ...]) | calculate expected value of a function with respect to the distribution |
| rv_discrete.median(*args, **kwds) | Median of the distribution. |
| rv_discrete.mean(*args, **kwds) | Mean of the distribution |
| rv_discrete.var(*args, **kwds) | Variance of the distribution |
| rv_discrete.std(*args, **kwds) | Standard deviation of the distribution. |
| rv_discrete.interval(alpha, *args, **kwds) | Confidence interval with equal areas around the median |
| norm | A normal continuous random variable. |
| alpha | An alpha continuous random variable. |
| anglit | An anglit continuous random variable. |
| arcsine | An arcsine continuous random variable. |
| beta | A beta continuous random variable. |
| betaprime | A beta prima continuous random variable. |
| bradford | A Bradford continuous random variable. |
| burr | A Burr continuous random variable. |
| cauchy | A Cauchy continuous random variable. |
| chi | A chi continuous random variable. |
| chi2 | A chi-squared continuous random variable. |
| cosine | A cosine continuous random variable. |
| dgamma | A double gamma continuous random variable. |
| dweibull | A double Weibull continuous random variable. |
| erlang | An Erlang continuous random variable. |
| expon | An exponential continuous random variable. |
| exponweib | An exponentiated Weibull continuous random variable. |
| exponpow | An exponential power continuous random variable. |
| f | An F continuous random variable. |
| fatiguelife | A fatigue-life (Birnbaum-Sanders) continuous random variable. |
| fisk | A Fisk continuous random variable. |
| foldcauchy | A folded Cauchy continuous random variable. |
| foldnorm | A folded normal continuous random variable. |
| frechet_r | A Frechet right (or Weibull minimum) continuous random variable. |
| frechet_l | A Frechet left (or Weibull maximum) continuous random variable. |
| genlogistic | A generalized logistic continuous random variable. |
| genpareto | A generalized Pareto continuous random variable. |
| genexpon | A generalized exponential continuous random variable. |
| genextreme | A generalized extreme value continuous random variable. |
| gausshyper | A Gauss hypergeometric continuous random variable. |
| gamma | A gamma continuous random variable. |
| gengamma | A generalized gamma continuous random variable. |
| genhalflogistic | A generalized half-logistic continuous random variable. |
| gilbrat | A Gilbrat continuous random variable. |
| gompertz | A Gompertz (or truncated Gumbel) continuous random variable. |
| gumbel_r | A right-skewed Gumbel continuous random variable. |
| gumbel_l | A left-skewed Gumbel continuous random variable. |
| halfcauchy | A Half-Cauchy continuous random variable. |
| halflogistic | A half-logistic continuous random variable. |
| halfnorm | A half-normal continuous random variable. |
| hypsecant | A hyperbolic secant continuous random variable. |
| invgamma | An inverted gamma continuous random variable. |
| invgauss | An inverse Gaussian continuous random variable. |
| invweibull | An inverted Weibull continuous random variable. |
| johnsonsb | A Johnson SB continuous random variable. |
| johnsonsu | A Johnson SU continuous random variable. |
| ksone | General Kolmogorov-Smirnov one-sided test. |
| kstwobign | Kolmogorov-Smirnov two-sided test for large N. |
| laplace | A Laplace continuous random variable. |
| logistic | A logistic continuous random variable. |
| loggamma | A log gamma continuous random variable. |
| loglaplace | A log-Laplace continuous random variable. |
| lognorm | A lognormal continuous random variable. |
| lomax | A Lomax (Pareto of the second kind) continuous random variable. |
| maxwell | A Maxwell continuous random variable. |
| mielke | A Mielke’s Beta-Kappa continuous random variable. |
| nakagami | A Nakagami continuous random variable. |
| ncx2 | A non-central chi-squared continuous random variable. |
| ncf | A non-central F distribution continuous random variable. |
| nct | A non-central Student’s T continuous random variable. |
| pareto | A Pareto continuous random variable. |
| powerlaw | A power-function continuous random variable. |
| powerlognorm | A power log-normal continuous random variable. |
| powernorm | A power normal continuous random variable. |
| rdist | An R-distributed continuous random variable. |
| reciprocal | A reciprocal continuous random variable. |
| rayleigh | A Rayleigh continuous random variable. |
| rice | A Rice continuous random variable. |
| recipinvgauss | A reciprocal inverse Gaussian continuous random variable. |
| semicircular | A semicircular continuous random variable. |
| t | A Student’s T continuous random variable. |
| triang | A triangular continuous random variable. |
| truncexpon | A truncated exponential continuous random variable. |
| truncnorm | A truncated normal continuous random variable. |
| tukeylambda | A Tukey-Lamdba continuous random variable. |
| uniform | A uniform continuous random variable. |
| vonmises | A Von Mises continuous random variable. |
| wald | A Wald continuous random variable. |
| weibull_min | A Frechet right (or Weibull minimum) continuous random variable. |
| weibull_max | A Frechet left (or Weibull maximum) continuous random variable. |
| wrapcauchy | A wrapped Cauchy continuous random variable. |
| binom | A binom discrete random variable. |
| bernoulli | A bernoulli discrete random variable. |
| nbinom | A negative binomial discrete random variable. |
| geom | A geometric discrete random variable. |
| hypergeom | A hypergeometric discrete random variable. |
| logser | A logarithmic discrete random variable. |
| poisson | A Poisson discrete random variable. |
| planck | A discrete exponential discrete random variable. |
| boltzmann | A truncated discrete exponential discrete random variable. |
| randint | A discrete uniform (random integer) discrete random variable. |
| zipf | A Zipf discrete random variable. |
| dlaplace | A discrete Laplacian discrete random variable. |
Several of these functions have a similar version in scipy.stats.mstats which work for masked arrays.
| gmean(a[, axis, dtype]) | Compute the geometric mean along the specified axis. |
| hmean(a[, axis, dtype]) | Calculates the harmonic mean along the specified axis. |
| mean | |
| cmedian(a[, numbins]) | Returns the computed median value of an array. |
| median | |
| mode(a[, axis]) | Returns an array of the modal (most common) value in the passed array. |
| tmean(a[, limits, inclusive]) | Compute the trimmed mean |
| tvar(a[, limits, inclusive]) | Compute the trimmed variance |
| tmin(a[, lowerlimit, axis, inclusive]) | Compute the trimmed minimum |
| tmax(a, upperlimit[, axis, inclusive]) | Compute the trimmed maximum |
| tstd(a[, limits, inclusive]) | Compute the trimmed sample standard deviation |
| tsem(a[, limits, inclusive]) | Compute the trimmed standard error of the mean |
| moment(a[, moment, axis]) | Calculates the nth moment about the mean for a sample. |
| variation(a[, axis]) | Computes the coefficient of variation, the ratio of the biased standard deviation to the mean. |
| skew(a[, axis, bias]) | Computes the skewness of a data set. |
| kurtosis(a[, axis, fisher, bias]) | Computes the kurtosis (Fisher or Pearson) of a dataset. |
| describe(a[, axis]) | Computes several descriptive statistics of the passed array. |
| skewtest(a[, axis]) | Tests whether the skew is different from the normal distribution. |
| kurtosistest(a[, axis]) | Tests whether a dataset has normal kurtosis |
| normaltest(a[, axis]) | Tests whether a sample differs from a normal distribution. |
| itemfreq(a) | Returns a 2D array of item frequencies. |
| scoreatpercentile(a, per[, limit]) | Calculate the score at the given per percentile of the sequence a. |
| percentileofscore(a, score[, kind]) | The percentile rank of a score relative to a list of scores. |
| histogram2(a, bins) | Compute histogram using divisions in bins. |
| histogram(a[, numbins, defaultlimits, ...]) | Separates the range into several bins and returns the number of instances of a in each bin. |
| cumfreq(a[, numbins, defaultreallimits, weights]) | Returns a cumulative frequency histogram, using the histogram function. |
| relfreq(a[, numbins, defaultreallimits, weights]) | Returns a relative frequency histogram, using the histogram function. |
| obrientransform(*args) | Computes a transform on input data (any number of columns). |
| signaltonoise(a[, axis, ddof]) | The signal-to-noise ratio of the input data. |
| bayes_mvs(data[, alpha]) | Bayesian confidence intervals for the mean, var, and std. |
| sem(a[, axis, ddof]) | Calculates the standard error of the mean (or standard error of measurement) of the values in the input array. |
| zmap(scores, compare[, axis, ddof]) | Calculates the relative z-scores. |
| zscore(a[, axis, ddof]) | Calculates the z score of each value in the sample, relative to the sample mean and standard deviation. |
| threshold(a[, threshmin, threshmax, newval]) | Clip array to a given value. |
| trimboth(a, proportiontocut) | Slices off a proportion of items from both ends of an array. |
| trim1(a, proportiontocut[, tail]) | Slices off a proportion of items from ONE end of the passed array |
| f_oneway(*args) | Performs a 1-way ANOVA. |
| pearsonr(x, y) | Calculates a Pearson correlation coefficient and the p-value for testing |
| spearmanr(a[, b, axis]) | Calculates a Spearman rank-order correlation coefficient and the p-value |
| pointbiserialr(x, y) | Calculates a point biserial correlation coefficient and the associated p-value. |
| kendalltau(x, y[, initial_lexsort]) | Calculates Kendall’s tau, a correlation measure for ordinal data. |
| linregress(x[, y]) | Calculate a regression line |
| ttest_1samp(a, popmean[, axis]) | Calculates the T-test for the mean of ONE group of scores a. |
| ttest_ind(a, b[, axis]) | Calculates the T-test for the means of TWO INDEPENDENT samples of scores. |
| ttest_rel(a, b[, axis]) | Calculates the T-test on TWO RELATED samples of scores, a and b. |
| kstest(rvs, cdf[, args, N, alternative, mode]) | Perform the Kolmogorov-Smirnov test for goodness of fit |
| chisquare(f_obs[, f_exp, ddof]) | Calculates a one-way chi square test. |
| ks_2samp(data1, data2) | Computes the Kolmogorov-Smirnof statistic on 2 samples. |
| mannwhitneyu(x, y[, use_continuity]) | Computes the Mann-Whitney rank test on samples x and y. |
| tiecorrect(rankvals) | Tie-corrector for ties in Mann Whitney U and Kruskal Wallis H tests. |
| ranksums(x, y) | Compute the Wilcoxon rank-sum statistic for two samples. |
| wilcoxon(x[, y]) | Calculate the Wilcoxon signed-rank test. |
| kruskal(*args) | Compute the Kruskal-Wallis H-test for independent samples |
| friedmanchisquare(*args) | Computes the Friedman test for repeated measurements |
| ansari(x, y) | Perform the Ansari-Bradley test for equal scale parameters |
| bartlett(*args) | Perform Bartlett’s test for equal variances |
| levene(*args, **kwds) | Perform Levene test for equal variances. |
| shapiro(x[, a, reta]) | Perform the Shapiro-Wilk test for normality. |
| anderson(x[, dist]) | Anderson-Darling test for data coming from a particular distribution |
| binom_test(x[, n, p]) | Perform a test that the probability of success is p. |
| fligner(*args, **kwds) | Perform Fligner’s test for equal variances. |
| mood(x, y) | Perform Mood’s test for equal scale parameters. |
| oneway(*args, **kwds) | Test for equal means in two or more samples from the normal distribution. |
| fisher_exact(table[, alternative]) | Performs a Fisher exact test on a 2x2 contingency table. |
| chi2_contingency(observed[, correction]) | Chi-square test of independence of variables in a contingency table. |
| contingency.expected_freq(observed) | Compute the expected frequencies from a contingency table. |
| contingency.margins(a) | Return a list of the marginal sums of the array a. |
| probplot(x[, sparams, dist, fit, plot]) | Calculate quantiles for a probability plot of sample data against a specified theoretical distribution. |
| ppcc_max(x[, brack, dist]) | Returns the shape parameter that maximizes the probability plot correlation coefficient for the given data to a one-parameter family of distributions. |
| ppcc_plot(x, a, b[, dist, plot, N]) | Returns (shape, ppcc), and optionally plots shape vs. |
| gaussian_kde(dataset) | Representation of a kernel-density estimate using Gaussian kernels. |
For many more stat related functions install the software R and the interface package rpy.