Skip to content

sp3006/Udacity-Nano_Degree_Data_Engineering

 
 

Repository files navigation

Udacity - Nano Degree : Data Engineering

This is a highly intensive learning program - atleast for me

Core Curriculum

  1. Data Wrangling ( Completed on Sept 12, 2019 )

    • Project 1 : Wrangle and Analyze Data
      Technicals : Python, Juypter, Twitter API
  2. Data Modeling

    • Project 2 : Data Modeling with Postgres ( Completed on Sept 18, 2019 )
      Technicals : Python, Juypter, Postgres(psycopg2)

    • Project 3 : Data Modeling with Apache Cassandra ( Completed on Sept 22, 2019 )
      Technicals : Python, Juypter, Casandra(psycopg2)

  3. Cloud Data Warehouses ( Completed on October 17, 2019 )

    • Project 4 : Data Warehouse
      Technicals : Python, Juypter, Geopy, AWS(S3, Redshift, IAM)
  4. Data Lakes with Spark ( Completed on November 5, 2019 )

    • Project 5 : Data Lake
      Technicals : Python, Jupyter, Spark, AWS(EMR, S3, EMR Notebooks, EC2, Athena)
  5. Data Pipelines with Airflow ( Completed on December 19, 2019 )

    • Project 6 : Data Pipelines
      Technicals : Python, Redshift, Airflow, S3
  6. Capstone Project ( Completed on February 23, 2020 )

    • Project 7 : Data Engineering Capstone Project
      Technicals : Python, Redshift, S3, Pyspark

Udacity Data Engineering Nano Degree Certificate



Who are Data Engineers ?

They are software professionals who can collect, assess, design, build, clean and intergrate data from various sources a.k.a people who are highly skilled, experienced, able to work in pressurized workplace, sportive to see lots of errors, able to find workaround for problems, capable of redesigning the entire system because they have found a efficient way to do the same thing, keep continously reskilling themselves, self-managed/self-disciplined , agile by nature, addicted programmers basically people who dream about vacations which they always miss.


About

My Udacity Data Engineer Nano Degree Projects aka Udacity DEND

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 81.6%
  • PLpgSQL 10.8%
  • TSQL 6.4%
  • Python 1.1%
  • Other 0.1%