This is a personal analysis of NYC taxi data, based on previous course group project. For data collection and cleaning, see the previous project
In this project, I want to explore more about the information hiding in these data. Moreover, I will learn to use pyspark as well and using these data as source. To make sure the pyspark code is right,the result of pyspark should be same as the result of hadoop MapReduce.
-
Notifications
You must be signed in to change notification settings - Fork 0
Geralt0714/TaxiDataAnalysis
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Advanced data mining
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published