Skip to content

sunxichen/e63-coursework

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSCI E-63 Big Data Analytics

Professional Graduate Data Science Coursework - Fall semester 2017

Professor: Zoran B. Djordjević, PhD, Senior Enterprise Architect, NTT Data, Inc.

Description:

The emphasis of this course is on mastering two important big data technologies: Spark 2 and TensorFlow. The focus is on Spark Core, Spark ML (machine learning), and Spark Streaming which allows analysis of data in flight, that is, in near real time. Furthermore the so-called NoSQL storage solutions exemplified by Cassandra are examined. An additional focus lies on memory-resident databases and graph databases (Spark GraphX and Ne4J) and scalable messaging systems like Kafka and Amazon Kinesis.

File Layout

The hw directory structure is as follows:

DIRECTORY DESCRIPTION
. Files such as README and gitignore
./docs/ Different files and presentations
./data/ Folder with all the necessary data
./scripts/ Folder with all the code

Website

You can access all the coursework etc. here.

About

CSIE-63: Big Data Analytics course tought at Harvard - Fall semester 2017

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 57.0%
  • Jupyter Notebook 40.1%
  • Java 0.7%
  • Python 0.7%
  • Scala 0.6%
  • Shell 0.4%
  • Other 0.5%