“I was immensely impressed by how well the math is taught on Dataquest, anyone can learn it. Only the most important concepts are used so you don’t waste time.”

Ashray Adappa

Data Analyst Consultant @Fractal

Path overview

In this path, you’ll learn the fundamentals of R and build upon them with more advanced skills. You’ll learn how to use RStudio, applications and tools, tidyverse, DataFrames, tibbles, operators, expressions, and much more — as well as data visualization, graphs, plots, and charts.
Best of all, you’ll learn by doing — you’ll write code and get feedback directly in the browser. You’ll apply your skills to several guided projects involving realistic business scenarios to build your portfolio and prepare for your next interview.

Key skills

  • Programming with R to perform complex statistical analysis of large datasets
  • Performing SQL queries and web-scraping to explore and extract data from databases and websites
  • Performing efficient data analysis from start to finish
  • Building insightful data visualizations to tell stories

Path outline

Part 1: Introduction to R [4 courses]

Introduction to Data Analysis in R 3h

  • Define R programming syntax
  • Define variable use and naming rules
  • Perform calculations using arithmetic operators

Data Structures in R 6h

  • Create a data structure
  • Index a data structure
  • Perform operations over a data structure

Control Flow, Iteration, and Functions in R 4h

  • Employ control flow with if-else statements
  • Replicate your code using iteration
  • Write functions

Specialized Data Processing in R 4h

  • Manipulate strings from the stringr package
  • Manipulate strings from the lubridate package
  • Employ the map function from the purrr package

Part 2: Data Visualization in R [1 course]

Introduction to Data Visualization in R 4h

  • Visualize changes over time using line graphs
  • Analyze data distributions using histograms
  • Compare groups using bar charts and box plots
  • Identify the relationships between variables using scatter plots

Part 3: Data Cleaning in R [2 courses]

Introduction to Data Cleaning in R 7h

  • Manipulate DataFrames
  • Define relational data
  • Resolve missing data
  • Reshape data using the tidyr package

Advanced Data Cleaning in R 6h

  • Employ regular expressions to clean and manipulate text data
  • Employ the map and anonymous functions
  • Resolve missing data

Part 4: Working with Data Sources [4 courses]

SQL Fundamentals 5h

  • Analyze data using SQL
  • Organize data using SQL
  • Write SQL queries to estimate summary statistics

Intermediate SQL in R 4h

  • Query data across multiple tables
  • Answer business questions using SQL
  • Define table relations

Introduction to APIs in R 3h

  • Query external data sources using an API
  • Query using an API with authentication

Introduction to Web Scraping in R 3h

  • Scrape data from the web
  • Identify tools for complex web pages

Part 5: Probability and Statistics [5 courses]

Introduction to Statistics in R 5h

  • Sample data using simple random sampling, stratified sampling, and cluster sampling
  • Measure variables in statistics
  • Build, visualize, and compare frequency distribution tables

Intermediate Statistics in R 2h

  • Summarize a distribution using the mean, the weighted mean, the median, or the mode
  • Measure the variability of a distribution using the variance and the standard deviation
  • Compare values using z-scores

Introduction to Probability in R 1h

  • Estimate theoretical and empirical probabilities
  • Define the fundamental rules of probability
  • Identify combinations and permutations

Conditional Probability in R 2h

  • Assign probabilities based on conditions
  • Assign probabilities based on event independence
  • Assign probabilities based on prior knowledge
  • Create spam filters using multinomial Naive Bayes

Hypothesis Testing in R 1h

  • Implement probability density functions
  • Create testable hypotheses
  • Decide which hypotheses to support based on your data

Part 6: Predictive Modeling and Machine Learning in R [2 courses]

Linear Regression Modeling in R 2h

  • Define predictive modeling
  • Build linear regression models
  • Interpret linear regression models
  • Assess model fit and accuracy

Introduction to Machine Learning in R 1h

  • Identify a proper machine learning workflow
  • Implement the k-nearest neighbors algorithm
  • Employ the caret library

Part 7: Shiny Applications in R [1 course]

Introduction to Interactive Web Applications in Shiny 2h

  • Read the structure of a Shiny app
  • Program inputs and outputs in a Shiny interface
  • Extend your Shiny apps

The Dataquest guarantee


Dataquest has helped thousands of people start new careers in data. If you put in the work and follow our path, you’ll master data skills and grow your career.


We believe so strongly in our paths that we offer a full satisfaction guarantee. If you complete a career path on Dataquest and aren’t satisfied with your outcome, we’ll give you a refund.

Master skills faster with Dataquest

Go from zero to job-ready

Go from zero to job-ready

Learn exactly what you need to achieve your goal. Don’t waste time on unrelated lessons.

Build your project portfolio

Build your project portfolio

Build confidence with our in-depth projects, and show off your data skills.

Challenge yourself with exercises

Challenge yourself with exercises

Work with real data from day one with interactive lessons and hands-on exercises.

Showcase your path certification

Showcase your path certification

Impress employers by completing a capstone project and certifying it with an expert review.

Projects in this path

Project: Install RStudio

Learn how to install and use RStudio, a free and open-source development environment for R.

Guided Project: Investigating COVID-19 Virus Trends

Learn to combine the skills you learned in this course to perform practical data analysis.

Guided Project: Creating An Efficient Data Analysis Workflow

Apply control flow, loops and functions to create a reusable data workflow.

Guided Project: Creating An Efficient Data Analysis Workflow, Part 2

Employ even more programming techniques to create a reusable data workflow.

Guided Project: Analyzing Forest Fire Data

Use data visualization techniques to explore data on forest fires.

Plus 14 more projects

Build your project portfolio with the Data Analyst in Python path.

Learning resources

In this guide, you'll learn how to install PostgreSQL 1...
Read Article
In this tutorial, you'll learn how to install PostgreSQ...
Read Article
In this tutorial, you'll learn how to install PostgreSQ...
Read Article
There are lots of great reasons to learn Microsoft Powe...
Read Article

Grow your career with

of learners recommend
Dataquest for career advancement
Dataquest rating on
G2Crowd and SwitchUp
Average salary boost
for learners who complete a path

Aaron Melton

Business Analyst at Aditi Consulting

“Dataquest starts at the most basic level, so a beginner can understand the concepts. I tried learning to code before, using Codecademy and Coursera. I struggled because I had no background in coding, and I was spending a lot of time Googling. Dataquest helped me actually learn.”


Jessica Ko

Machine Learning Engineer at Twitter

“I liked the interactive environment on Dataquest. The material was clear and well organized. I spent more time practicing then watching videos and it made me want to keep learning.”


Victoria E. Guzik

Associate Data Scientist at Callisto Media

“I really love learning on Dataquest. I looked into a couple of other options and I found that they were much too handhold-y and fill in the blank relative to Dataquest’s method. The projects on Dataquest were key to getting my job. I doubled my income!”

Join 1M+ data learners on


Sign up for a free account


Choose a course or path


Learn with hands-on exercises


Apply your skills

Start learning with a free account today.