Do you want to analyse data in a structured, documented and well organized way? Then you want to learn R – and RStudio – a competitive and modern data science environment and programming language. With an unprecedented back catalogue of packages, R is extremely versatile, and it has statistical methods available for most tasks. R is, moreover, suitable for automatizing boring repetitive tasks, for making sure that your analyses are correct and reproducible, and for customizing your analysis to your needs.
You will learn to build a complete data analysis pipeline in R. This includes learning R programming techniques for:
In addition to the technical programming skills, you will also learn a conceptual framework for data analysis, where all steps of the data analysis are automatized via a programmatic pipeline.
The course is based on RStudio and a collection of modern R packages. The focus will be on learning to exploit the full potential of these tools, which can serve as an infrastructure for almost any perceivable data analysis in R. Generalized additive models will be treated as a non-trivial example of how to build a predictive regression model in R.
With this course you will learn to build a complete data analysis pipeline in R.