Skip to content

dquyen/STAT_5014_2018

 
 

Repository files navigation

title author date output header-includes
Stat 5014
Bob Settlage
August 2018
html_document fig_caption fig_height fig_width pdf_document classoption
default
true
6
7
landscape
\setlength\parindent{24pt} \usepackage{MnSymbol} \usepackage{mathrsfs}

This course is an introduction to computing for statistics. Recently, open source platforms have proliferated and are becoming the de facto standard for Data Scientists. R and Python are arguably the two most important languages in a Data Scientists toolbox. In this class, we introduce both languages with a focus on R. We will use Markdown and touch on notebook style analysis for enabling and performing Reproducibile Research. Throughout the course, we will use $LaTeX$ via Markdown for type setting. In many collaborative environments, version control and collaborative development is a crucial technology and concept. Here, we will use Git as the backbone of the course for homework submission, collaborative learning via code sharing, and crowd sourced design. Finally, we end the class with a discussion of GUIs usesful in discussions with collaborators, e.g. BlueSky, Orange, JMP, and SAS, especially in looking at potential data issues.

Course learning objectives:

  • Good programming practices
  • Reproducible research concepts
  • Data cleaning and munging
  • R programming
  • Git fundamentals
  • Markdown
  • Python basics

About

site for STAT5014 course material for 2018

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 100.0%