Welcome to Data Science! We are building a global community of lifelong learners who are excited about using data to solve real world problems.
In this program, you’ll take on real world problems by analyzing data sets for insights and presenting findings using statistics, programming, data modeling, and business knowledge.
This is the official baseline site for General Assembly's Data Science Immersive Course. We're still in the process of assembling an amazing course, so be sure to check back regularly!
- v 1.0 Projects + Weeks 1-11 Complete: 6/21
- v 1.1 Revisions: 8/1
This course is designed to give you the deep dive into the world of Data Science, focusing on the ability to analyze and convey data-driven facts in order to predict what happens next using modeling and pattern recognition. Our course prepares students to take full-time roles as Data Analysis, Data Scientists, Business Intelligence Analysts, and other roles that require advanced fluency with data. Our projects immerse students in formal data-driven scenarios in order to help them create a polished portfolio of work showcasing their ability to create and communicate machine learning insights.
- Data Analysis & Python:
- Perform visual and statistical analysis on data using Python and its associated libraries and tools.
- Machine Learning & Modeling Techniques:
- Explore the differences between supervised and unsupervised learning through the application of various modeling techniques such as classification, regression, and clustering.
- Git, SQL, & Relational Databases:
- Gather, store, and organize your data using the data science toolkit: SQL, Git, and UNIX.
- Critical Thinking & Synthesis:
- Apply your analysis and modeling skills to real world data problems in fields like finance, marketing, and public policy.
- Visualization, Presentation, & Reporting:
- Learn to create reproducible presentations and reports and use data visualisation tools to present your findings to key stakeholders.
- Collect, extract, query, clean, and aggregate data for analysis
- Perform visual and statistical analysis on data using Python and its associated libraries and tools.
- Build, implement, and evaluate data science problems using appropriate machine learning models and algorithms
- Use appropriate data visualization tools to communicate findings
- Present clear and reproducible reports to stakeholders
- Identify big data problems and understand how distributed systems and parallel computing technologies are solving these challenges.
- Apply question, modeling, and validation problem solving processes to datasets from various industries to gain insight into real-world problems and solutions.
Please take at least 1 hour to read through the following on-boarding documents, in the order provided, to get a better understanding of your responsibilities as an instructor, student responsibilities, and the scope, sequence, and value proposition of this course. Each document links to the next at the bottom of the file!
Document | Description |
---|---|
Students | Student personas and course demographics |
Materials | What we provide and what you should build |
Format | Course syllabus and schedule |
Projects & Assessments | Course projects and grading expectations |
Expectations | Planning and communication responsibilities |
Technology | Tools used in this course |
Supplemental Resources | Common course issues and suggestions |
After reading these docs, we welcome you to jump into the #dsi-instructors
channel on Slack and join the conversation!
The structure of this repository provides a way for us to organize our information and resources.
We encourage the teaching team for each cohort to fork this repository directly, and use it to create resources for your own instance. Please make sure to submit new materials back to the master so we can share them with students and instructors world-wide!
If you have any questions about the organization of resources, or about the scope of our curriculum, feel free to open an issue.
Please check out our contributing guidelines for more details.
- All content is licensed under a CC-BY-NC-SA 4.0 license.
- All software code is licensed under GNU GPLv3. For commercial use or alternative licensing, please contact legal@ga.co.