Introduction to Machine Learning

This tutorial is modeled after a series of tutorials by Jake Vanderplas. Text, code, and licenses to those may be found here

Introduction to Machine Learning

Ian Rose (Project Jupyter) February 1, 2019

This tutorial can be read and executed at https://tinyurl.com/la-ml-demo

What is a model?

We are often faced with a pile of data and little notion of what to do with it. We know that the data reflects some information about the real world, but not necessarily what that is, and how to get at it.

A model is a representation of a real-world process that we want to understand. They can be qualitative or quantitative, complex or simple. Broadly speaking, models are useful in two ways:

They provide understanding. Models are simpler than the real world, and the specific "physics" of a model can often be interpreted in terms of plain language. Frequently the interpretability of a model is among the most attractive of its attributes.
They have predictive power. You can use a good model of the real world to predict the behavior of hypothetical data or data you have not seen before. You can use these predictions to provide guidance for future data collection, policy, and model refinements.

What is machine learning?

Machine learning is the process of building statistical models using computers. These models have tunable parameters (usually numbers), which are adjusted to fit existing data. You can then use that model to predict values from data that the model has not seen before.

There are many different classes of model, and many different methods of fitting and predicting, but they all follow this general pattern.

This tutorial

This tutorial consists of several exercises that introduce the user to simple machine learning. They use the de facto standard Python scientific software stack, principally numpy, scipy, matplotlib, pandas, and scikit-learn.

First is an introduction to machine learning and some motivating examples:

Introduction and Motivation

Second is an introduction to the workhorse of Python machine learning: scikit-learn:

Introduction to scikit-learn

Finally, we perform an example analysis of ridership data for Metro bikeshare:

Metro Bikeshare

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.ipython/profile_default		.ipython/profile_default
binder		binder
images		images
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WARNING.md		WARNING.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipython/profile_default

.ipython/profile_default

binder

binder

images

images

notebooks

notebooks

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

WARNING.md

WARNING.md

Repository files navigation

Introduction to Machine Learning

Ian Rose (Project Jupyter) February 1, 2019

What is a model?

What is machine learning?

This tutorial

About

Releases

Packages

Languages

License

jmraspa/school-of-data-la-machine-learning

Folders and files

Latest commit

History

Repository files navigation

Introduction to Machine Learning

Ian Rose (Project Jupyter) February 1, 2019

What is a model?

What is machine learning?

This tutorial

About

Resources

License

Stars

Watchers

Forks

Languages