Skip to content

tianyic/Bigdata_proj_yanif

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scalable Machine Learning

###Prerequisite

  • numpy
  • Pandas
  • spyLearn
  • Scikit-Learn- PIL
  • matplotlib
  • pyspark

###How to run

  • Set up your own spark path

    • Export SPARK_HOME

        export $SPARK_HOME=YOURPATH/spark-1.1.1-bin-hadoop2.4
      
    • Export Pyspark Path

        export PYTHONPATH=$SPARK_HOME/python/:$PYTHONPATH
      
  • Place data file under the same directory with the scripts

  • Run PCA scripts

    • python pca.py

###Sample result plot

PCA result

About

hadoop, django,spark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published