##Coursework for information retrieval including parts of the Final Project
###HW1 --- Process XML Data ###HW2 --- SQL Queries ###HW3 --- Store Twitter Stream in MongoDB and plot top 50 most common words in tweets... No stopwords used. ###HW4 --- Simple MapReduce Code for calculating top users with more than 2 posts ###HW5 --- Pig Script for processing a log file
###Final-Project --- MrJob code to process ~50 GB of Census data on AWS S3 using EMR across 15 EC2 Instances