Skip to content

Elixeus/Machine-Learning-Project

 
 

Repository files navigation

MIT licensed

About our project

Our main goal of this project is to apply the machine learning techniques to data available in the urban context. For this project, we are particularly interested in the restaurant hygene condition in New York City. The Big Apple has more than 20,000 restaurants by July of 2015. But not every restaurant is ready to provide the clients with a service both safe and delicious. According to the Department of Health and Mental Hygene (DOHMH), only 72% of the restaurants among all the restaurants inspected had an A grade. So the food safety remains a concern for New Yorkers who choose to eat outside.

On the other hand, researchers has applied machine learning techniques to data available from other sources, especially mass-based restaurant recommendation software such as Yelp or Foursquare. See Jin and Leslie 2003, Jin and Leslie 2009, Kang et al. 2013. These previous research provided a new approach: combine the restaurant ratings and other features from the restaurant recommendation software data, and find correlation between these features and restaurant inspection grades. For our project, we use data provided by foursquare and try to establish correlation between ratings and inspection results. Also we would like to visualize the risky areas in New York and try to find spatial patterns. We would also like to run a spatial autocorrelation analysis to see if restaurants nearby affect each other's food hygene performance.

Why this repo exists

This is the repo for our Machine Learning final group project at CUSP, NYU. It serves as a platform of code version control, code exchange, code safehouse and reproducibility.

License:

This project is licensed under the MIT License.

TODO:

  • Scrape data from Foursquare
  • Select 2015 Inspection Data
  • Geocode Restaurant Addresses
  • Merge rating data with DOHMH inspection data
  • Neural network classification
  • Spatial autocorrelation analysis

About

Restaurant rating vs inspection

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 81.0%
  • Python 19.0%