Data Engineering Nanodegree -- Udacity

This is repository is for all of the work completed in the Udacity Data Engineering Nanodegree. This readme provides an overview and acts as a table of contents for the individual projects/submissions.

It is divided into two sections:

Data engineering for the fictional Sparkify app
Capstone project

SparkifyDB

A startup called Sparkify wants to analyze the data they've been collecting on songs and user activity on their new music streaming app. The following projects are data engineering projects to support their ongoing evloution of their business processes and app.

Projects

Data Modeling in Postgres. This data enigneering project is to create the infrastructure to support the Sparkify data analytics team.
Data Modeling in Cassandra (Jupyter notebook). This jupyter notebook was created to help the data analytics team collect and analyze song play data. The song play data is modeled in the notebook and placed in a Cassandra database.
Data Warehousing in AWS. This project supports the Sparkify initiative to move their data analytics and data to the cloud.
Data Lake in AWS. This project was to support the further evloution of the user base and data analytics requirements of the Sparkify team by transitioning their cloud data warehouse into a data lake.
Data pipelines with Airflow. This data pipeline was completed to further automate the ETL pipelines and monitoring of the data warehouses of Sparkify.

Capstone Project

This capstone project was a chance to combine all of the data engineering learnings of this program into one project.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Data_Lake		Data_Lake
Data_Modeling_Cassandra		Data_Modeling_Cassandra
Data_Warehouse_in_AWS		Data_Warehouse_in_AWS
capstone_project		capstone_project
data_modeling_postgres		data_modeling_postgres
data_pipeline_with_airflow/home/airflow		data_pipeline_with_airflow/home/airflow
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data_Lake

Data_Lake

Data_Modeling_Cassandra

Data_Modeling_Cassandra

Data_Warehouse_in_AWS

Data_Warehouse_in_AWS

capstone_project

capstone_project

data_modeling_postgres

data_modeling_postgres

data_pipeline_with_airflow/home/airflow

data_pipeline_with_airflow/home/airflow

.DS_Store

.DS_Store

README.md

README.md

Repository files navigation

Data Engineering Nanodegree -- Udacity

SparkifyDB

Projects

Capstone Project

About

Releases

Packages

Languages

borbert/Data_Engineering_Nanodegree

Folders and files

Latest commit

History

Repository files navigation

Data Engineering Nanodegree -- Udacity

SparkifyDB

Projects

About

Topics

Resources

Stars

Watchers

Forks

Languages