Skip to content

Geralt0714/TaxiDataAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

NYC Taxi Transportation Data Analysis

This is a personal analysis of NYC taxi data, based on previous course group project. For data collection and cleaning, see the previous project
In this project, I want to explore more about the information hiding in these data. Moreover, I will learn to use pyspark as well and using these data as source. To make sure the pyspark code is right,the result of pyspark should be same as the result of hadoop MapReduce.

About

Advanced data mining

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published