Skip to content
forked from kristiewirth/datto

Data Tools (Dat To) is a package with various data tools to help in data analysis and data science work, such as natural language processing and machine learning techniques.

License

Notifications You must be signed in to change notification settings

benhummel/datto

 
 

Repository files navigation

Installation

pip install datto

Overview

datto is a package with various data tools to help in data analysis and data science work.

You can find the documentation here.

Some examples of what you can do:

  • Remove links from some text
  • Extract body of an email only (no greeting or signature)
  • Easily load/save data from S3
  • Run SQL from Python
  • Explore data - check for mistyped data, find correlated data
  • Assign a given user to an experimental condition
  • Create an HTML dropdown from a DataFrame
  • Find the most common phrases by a category
  • Classify free text responses into any number of meaningful groups (e.g. find survey themes)
  • Make a simple Python logger with default options
  • Take some data and test a bunch of machine learning models on it

For detailed examples of how you can use it, check out this Juypter notebook.

About

Data Tools (Dat To) is a package with various data tools to help in data analysis and data science work, such as natural language processing and machine learning techniques.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 62.6%
  • Jupyter Notebook 13.8%
  • JavaScript 11.7%
  • Python 8.9%
  • CSS 3.0%