basic Spark exercises
E1
for basic exercise
E2 data source
https://www.kaggle.com/c/stumbleupon/data
decision tree and Random Forest
E3
data source
same as E2
Logistic Regression
E4
data source https://tianchi.aliyun.com/datalab/dataSet.html?spm=5176.100073.0.0.eff86fc1Y9AsEE&dataId=649
User Behavior Data from Taobao for Recommendation use sparksql and dataframe
E5
data source
same as E4
FPGrowth