This is about cloud computing project.Goal: The purpose of this individual assignment is to learn how to develop parallel machine learning (ML) applications in Amazon AWS cloud platform. Specifically, you will learn: (1) how to use Apache Spark to train an ML model in parallel on multiple EC2 instances; (2) how to use Spark’s MLlib to develop and use an ML model in the cloud; (3) How to use Docker to create a container for your ML model to simplify model deployment.
Input: TrainingDtaset.csv for training model and validationDataset.csv for validate model.
Output: The output of your application will be a measure of the prediction performance, specifically the F1 score, which is available in MLlib.