
✏️ Information
Welcome to Data Science with R where we outline a course of study for understanding how to use the programming language R for data science and statistical modelling.
The aim of this course is to provide a thorough introduction to programming in R for individuals who may have never programmed before. First, we outline how to install R on our operating system, how to download and use an Interactive Development Environment (IDE) such as RStudio, Positron or VSCode, and how to install packages on our system. We will then explore various ways to explore, manage and analyze data using both the in-built functionality in R as well as available libraries such as tidyverse.
✍️ Notes
This course is based fundamentally on the PSTAT10 Data Science Principles class I have taught in the past at UCSB. The course is split into the following topics:
Installing and using R and RStudio.
Operators, Logic & Data Types.
Atomic Data Structures.
Dataframes and Lists.
Functions.
Looping and Branching.
Fundamentals of Probability Theory.
Simulation and MC Methods.
Data Handling with the
tidyversepackage.Plotting with
ggplot2.SQL Basics.
SQL Aggregation and Joins.
📚 Materials
Each topic links to a website post with the relevant material. A pdf copy of the combined course notes can be downloaded here. Furthermore, each wesite post links to the corresponding youtube video going through the material.
For this course you will need to download the language R, your chosen IDE and Quarto using the following links:
Some helpful resources and additional guides and linked below:
Past Teaching Material
- PSTAT10 Summer 2025
🙏 Thanks
If you found any of this material helpful consider buying me a coffee but only if you can afford to! Thank you for visiting this course page. 😊