Skip to content

walkabout is a package for rapid exploratory data analysis

License

Notifications You must be signed in to change notification settings

yadevi/Walkabout-An_EDA_Package

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Walkabout

Walkabout is a Python package to rapidly perform common exploratory data analysis on all features of a Pandas DataFrame. A single line of code is all you need to plot the distribution of all features, view detailed summary stats, find which feature interactions are likely to be important, and more. Walkabout is all about quick, easy discovery about your entire dataframe.

What's in a name?

A Walkabout is a term from Australian Aborigines and is a journey of discovery and of self. In earlier times, it was known as a rite of passage where male Aborigine adolescents would embark on a journey into the wilderness, sometimes for as long as six months.

Installation

Use the package manager test.pypi (https://test.pypi.org/project/walkabout/)

pip install -i https://test.pypi.org/simple/ walkabout

Usage

If you would like to see all the functionality available in walkabout, check out the example notebook:

Example Notebook

import walkabout as wa

# Display summary statistics including nulls, data types, unqiue values, and shape
wa.report.rundown(df)

# Plot distribution graphs for all features
wa.plot.univariate_distribution(df)

# Display skew, kurtosis, and basic translation of skew value for all numeric features.
wa.plot.numeric_distribution(df)

# Plot box plots for all features, either univariate or bivariate
wa.plot.boxplot(df, 'age')

# Quickly find which features might be important
wa.report.simple_feature_importance(X, y, model='clas')

Contributing

If you are interested in contributing, have feature requests, or bugs - please reach out to me.

You may contact me via email at bundickm@gmail.com.

License

MIT

About

walkabout is a package for rapid exploratory data analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%