✏️ Information
Welcome to Data Science with R, a course providing an introduction on how to use the programming language R
for data science and statistical modelling.
The aim of this course is to provide a thorough introduction to programming in R for individuals who may have never written or utilized a programming language before. First we outline how to install R
on our operating system, how to download and use an Interactive Development Environment (IDE) such as RStudio
, Positron
or VSCode
, and how to install packages on our system. We will then explore various ways to explore, manage and analyze data using both the in-built functionality in R
as well as available libraries such as tidyverse
.
✍️ Topics
This course is based fundamentally on the PSTAT10 Data Science Principles class I have taught in the past at UCSB. The course is split into the following topics:
R Basics IV - Functions
R Basics V - Looping and Branching
Basic Simulation with R.
Data Handling with the
tidyverse
package.Plotting with
ggplot2
.SQL Basics.
SQL Aggregation and Joins.
📚 Materials
Each topic links to a website post with the relevant material. A pdf copy of the combined course notes can be downloaded here. Furthermore, each wesite post links to the corresponding youtube video going through the material.
For this course you will need to download the language R, your chosen IDE and Quarto using the following links:
Some helpful resources and additional guides and linked below:
If you found any of this material helpful consider buying me a coffee but only if you can afford to! Thank you for visiting this course page. 😊