Skip to content

LaDracG/Cloud-Computing-Fa2018-MapReduce

Repository files navigation

Cloud-Computing-Fa2018-MapReduce

MapReduce Assignment

Defined class mapreduce, with functions wordCount, doubleWordCount and findFreq, dealing with problems 1, 2 and 3, respectively.

Used AWS EMR service to run the program on Spark, and saved output as text files using saveAsTextFile function.

Link to AWS output data: Problem1: https://s3.console.aws.amazon.com/s3/buckets/aws-logs-815402967860-us-east-2/elasticmapreduce/JY-CC/Assignment1/wordCountResult

Problem2: https://s3.console.aws.amazon.com/s3/buckets/aws-logs-815402967860-us-east-2/elasticmapreduce/JY-CC/Assignment1/doubleWordCountResult

Problem3: https://s3.console.aws.amazon.com/s3/buckets/aws-logs-815402967860-us-east-2/elasticmapreduce/JY-CC/Assignment1/findFreqResult

About

MapReduce Assignment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages