stats_test_from_scratch

One issue with Python is that there is no unified source for statistical tests like there is for ML with scikit-learn or Deep Learning with Keras.

You could have statistical tests in Scipy, Statsmodels or some other obscure Pthon package but it's not clear which libraries carry which tests (some even carry the same test on top of it!). Additionally, there are some tests that you can find in R that aren't currently supported by these libraries.

My goal is two fold:

Implement the statistical tests from scratch, with an emphasis on making the code as presentable and easy-to-understand as possible. This is so anyone can understand exactly how what the test is measuring and how it is measuring it, even if it means we are sacrificing computational speed in the process.
Identify where in Python you can find these test(s), if at all. That way, for those who want the fastest implementation, they'll understand where to find the test.

Statistical Tests currently supported and where to find them:

Sample Tests

One and two sample Z Tests: Statsmodels through ztest. Used to determine if the sample differs significantly from the normally distributed population we are evaluating, or if the distribution of two samples from a normally distributed population differ.
One and two sample T Tests: Scipy through ttest_1samp and ttest_ind. Used to determine if the sample differs significantly from the normally distributed population (with unknown sample variance), or if the means of two samples from a normally distributed population differ.
Trimmed Means T Test: Not found in either scipy or statsmodels. Used to measure central tendency when our two samples violate the assumption of normality.
Yeun-Welch Test: Not found in either scipy or statsmodels. Used to measure central tendency when our two samples violate the assumption of normality and equality of variances.
Two Sample F Test: Not found in either scipy or statsmodels. Used to determine if the variances of two populations are equal.
Binomial Sign Test: Statsmodels through sign_test. Used to determine if there are consistent significant differences between pairs of data, such as before-and-after treatments.
Wald-Wolfowitz Test: Statsmodels through runstest_1samp. Used to determine if the elements of a dataset are mutually independent.
Trinomial Test: Not found in either scipy or statsmodels. Used as a replacement to the sign test when there are ties in the data.

Rank Tests

Wilcoxon Rank-Sum Test: Scipy through wilcoxon. Used to determine if two related or paired samples have different mean ranks.
Mann-Whitney-U Test: Scipy through mannwhitneyu. Used to determine if a randomly selected value from one ordinal population will be less or greater than a randomly selected value from a second ordinal population.
Friedman Test: Scipy through friedmanchisquare. Used to determine if there are any differences in treatments across multiple test attempts.
Quade Test: Not found in either scipy or statsmodels. Used to determine if there is at least one treatment that is different from the others.
Page's Trend Test: Not found in either scipy or statsmodels. Used to determine if the central tendency for all treatments is the same, or there is an order to them.
Kruskal-Wallis Test: Scipy through kruskal. Used to determine if two or more samples originate from the same distribution.
Fligner-Kileen Test: Scipy through fligner. Used to determine if two or more samples have the same variances without the assumption of normality.
Ansari-Bradley Test: Scipy through ansari. Used to determine if two samples have the same dispersion (distance from the median).
Mood Test for Dispersion: Scipy through mood. Used to determine if two samples have the same dispersion for their ranks.
Cucconi Test: Not found in either scipy or statsmodels. Used to determine if the central tendency and variability of two samples are the same.
Lepage Test: Not found in either scipy or statsmodels. Used to determine if the central tendency and variability of two samples are the same.
Conover Test: Not found in either scipy or statsmodels. Used to determine if the variances of multiple groups are the same.

Categorical Tests

Chi Square Test: Scipy through chi2_contingency. Used to determine if the distribution of our contingency table follows the row and column sum.
G Test: Scipy through chi2_contingency(lambda_="log-likelihood"). Used to determine the likelihood that our contingency follows the distribution of our row and column sum.
Fisher Test: Scipy through fisher_exact. Used to determine the exact likelihood that we would observe a measurement that is more extreme than our expected results.
McNemar Test: Statsmodels through mcnemar. Used to determine if the marginal row and column probabilities are equal.
Cochran–Mantel–Haenszel Test: Statsmodels through StratifiedTable.test_null_odds. Used to determine if there is an association between a binary predictor/treatment and a binary outcome across all strata.
Woolf Test: Not found in either scipy or statsmodels. Used to determine if there exists the same log odds across all strata.
Breslow-Day Test: Found in statsmodels as StratifiedTable.test_equal_odds(). Used to determine if there exists the same odds ratio across all strata.
Bowker Test: Found in statsmodels as TableSymmetry or as bowker_symmetry. Used to determine if the proportions between two treatments are symmetrical.

Multi-Group Tests

Levene Test: Scipy through levene(center='mean'). Used to determine the equality of group variances using the distance from the mean.
Brown-Forsythe Test: Scipy through levene(center='median'). Used to determine the equality of group variances using the distance from the median.
One Way F-Test: Scipy through f_oneway. Used to determine the equality of group means.
Bartlett Test: Scipy through bartlett. Used to determine the equality of group variances using the likelihood ratio.
Tukey Range Test: Statsmodels through pairwise_tukeyhsd. Used to determine the equality of means for all sample pairs.
Cochran's Q Test: Statsmodels through cochrans_q. Used to determine if the treatments (as measured by a binary response variable) have identical effects/are equally effective.
Jonckheere Trend Test: Not found in either scipy or statsmodels. Used to determine if the group medians have an a-priori ordering.
Mood Median Test: Scipy through median_test. Used to test the equality of group medians.
Dunnett Test: Not found in either scipy or statsmodels. Used as post-hoc to ANOVA analysis to determine which groups are significantly different to the control group.
Duncan's New Multi-Range Test: Not found in either scipy or statsmodels. Used as post-hoc to ANOVA analysis to determine which group means are significantly different to one another.

Proportion Tests

One and two sample Proportion Z Tests: Statsmodels through proportions_ztest. Used to determine if one proportion is different to the population proportion mean, or if two proportions share the same mean.
Binomial Test: Scipy through binom_test. Used to determine if the sample follows a given binomial distribution.
Chi Square Proportion Test: Not found in either scipy or statsmodels. Used to determine if the proportion within groups follows a population distribution.
G Proportion Test: Not found in either scipy or statsmodels. Used to determine if the distribution of groups follows a population distribution.

Goodness of Fit Tests

Shapiro-Wilk Test: Scipy through shapiro. Used to determine if a random sample is derived from a normal distribution.
Chi Goodness of Fit Test: Scipy through chisquare. Used to determine if the distribution of groups follows an expected result.
G Goodness of Fit Test: Scipy through power_divergence(lambda_="log-likelihood"). Used to determine if the distribution of groups follows an expected result.
Jarque-Bera Test: Statsmodels through jarque_bera. Used to determine if the sample's skew and kurtosis follow that of a normal distribution.
Ljung-Box Test: Statsmodels through acorr_ljung(boxpierce=False). Used to determine if the autocorrelations are equal to 0.
Box-Pierce Test: Statsmodels through acorr_ljung(boxpierce=True). Used to determine if the autocorrelations are equal to 0.
Skew Test: Scipy through skewtest. Used to determine if the sample is normally distributed through its skew.
Kurtosis Test: Scipy through kurtosistest. Used to determine if a sample is normally distributed through its kurtosis.
K-Squared Test: Scipy through normaltest. Used to determine if a sample is normally distributed through its skew and kurtosis.
Lilliefors Test: Statsmodels through lilliefors. Used to determine if a sample is normally distributed.

Correlation Tests

Pearson Test: Scipy through pearsonr. Used to determine the correlation between two different data points.
Spearman Rank Test: Scipy through spearmanr. Used to determine the correlation between the ranks of two different data points.
Kendall-Tau Test: Scipy through kendalltau. Used to determine the correlation between two ordinal variables.
Point Biserial Correlation: Scipy through pointbiserialr. Used to determine the correlation between two variables when one of them is dichotomous.
Rank Biserial Correlation: Not found in either scipy or statsmodels. Used to determine the correlation between two variables when one of them is dichotomous and the other consists of ranks.

Outliers Tests

Tukey's Fence Test: Not found in either scipy or statsmodels. Used to determine outliers based on their distance from the first or third quartile.
Grubb's Test: Not found in either scipy or statsmodels. Used to determine if there exists one outlier in the dataset.
Extreme Studentized Deviant (ESD) Test: Not found in either scipy or statsmodels. Used to determine if there exists up to k outliers in the dataset, with k specified by the user.
Tietjen-Moore Test: Not found in either scipy or statsmodels. Used to determine if there exists k outliers in the dataset, with k specified by the user.
Chauvenet Test: Not found in either scipy or statsmodels. Used to determine outliers based on the Chauvenet criteria.
Peirce Test: Not found in either scipy or statsmodels. Used to determine outliers based off of Peirce's criteria.
Dixon's Q Test: Not found in either scipy or statsmodels. Used to determine outliers based on the Q values.
Thompson-Tau Test: Not found in either scipy or statsmodels. Used to determine outliers based on the Thompson-Tau criteria.
MAD-Median Test: Not found in either scipy or statsmodels. Used to determine outliers based on the Mean Absolute Deviation - Median criteria.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
StatsTest		StatsTest
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
test_requirements.py		test_requirements.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StatsTest

StatsTest

test

test

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

requirements.txt

requirements.txt

test_requirements.py

test_requirements.py

Repository files navigation

stats_test_from_scratch

Statistical Tests currently supported and where to find them:

Sample Tests

Rank Tests

Categorical Tests

Multi-Group Tests

Proportion Tests

Goodness of Fit Tests

Correlation Tests

Outliers Tests

About

Releases

Packages

Languages

License

Romil-beep/stats_test_from_scratch

Folders and files

Latest commit

History

Repository files navigation

stats_test_from_scratch

Statistical Tests currently supported and where to find them:

Sample Tests

Rank Tests

Categorical Tests

Multi-Group Tests

Proportion Tests

Goodness of Fit Tests

Correlation Tests

Outliers Tests

About

Resources

License

Stars

Watchers

Forks

Languages