Skip to content

limphilip/soen498-diamond-dataset-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SOEN 498 Diamond Dataset Analysis

Techniques

Dataset analysis using:

  1. ALS collaborative filtering to perform cut prediction (ALS price prediction was only experimental, and out of the scope of this project)
  2. Random forests to perform cut prediction and price prediction

Team members

  • Ideawin-Bunthy Koun (26314155)
  • Philip Lim (27485506)

Dataset

Diamond dataset taken from: https://www.kaggle.com/shivam2503/diamonds/data

Environment setup

This dataset analysis is performed on orwell.encs.concordia.ca machine. Please run source env.sh on the project folder to ensure that the software required to run code are loaded.