Skip to content

tommysiu/udacity-data-analyst

Repository files navigation

Udacity Data Analyst Nanodegree Program

This repository hosts all my Udacity Data Analyst Nanodegree projects.

Certificate

Project 1 - Analyzing the NYC Subway Dataset

A Python project to find out whether there is statistical difference of NYC subway ridership when it rains and when it does not. It involves the usage of some basic statistical tests, linear regression and visualization techniques.

Project 2 - Open Street Map Data Wrangling

A Python+MongoDB project to clean up a part of the OpenStreetMap data. The region used in this project is Hong Kong.

Proejct 3 - Data Analysis with R

A project using R to do some basic exploratory data analysis on a red wine dataset.

Project 4 - Identify Fraud from Enron Email

A machine learning project using Python to build a model from existing Enron Fraud training dataset such that we can predict if an individual was a "Person of Interest" (POI) in the fraud by his/her financial and email data.

Project 5 - Data Visualization and D3

A HTML+D3.js project to explore a dataset containing 113,937 loan records from Prosper. The visualization focuses on the relationship between the original loan amount, state of address of the borrowers and the year of original loan.

Project 6 - A/B Testing

A project that demonstrates different considerations when doing a A/B testing, including invariant and evaluation metrics selection, sizing and power, sanity checks, effect size tests and sign tests.

About

Udacity Data Analyst Nanodegree

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages