Course

You'll learn how to:

Start this course today

Build hands-on data skills with interactive exercises and projects.

Sign up

About this course

Although there are many datasets available in convenient formats, there’s also a lot of data that’s stored in tougher-to-access ways, like as a table on web page. To get at this data, we’ll need to use web scraping. In R, we can do that with the rvest scraping package.

In this course, you’ll learn how to scrape the web using R and rvest. You’ll start by learning the basics of web page structure, and a little about HTML and CSS. You’ll also learn how to get the code from a page into your R workflow for further parsing and cleaning.

Then, you’ll dig deeper into scraping, learning to use the CSS Selector to get precisely the data you want (and none of the other content or code you don’t).

The course wraps up with a challenge that’ll get you more familiar with how web scraping can work across different sites. Then, you’ll dive into a guided project that’ll help you put your own unique spin on web-scraped data analysis.

By the end of this course, you’ll be able to:

  • Scrape data from a web page using R and rvest.
  • Parse sites using the CSS Selector to get only the data you need.
  • Tackle more complex web scraping challenges.

Lessons in this course

Loading lessons....

Thousands of learners have changed their careers with Dataquest

97%

Learners who recommend
Dataquest for career advancement

4.9 stars

Dataquest rating on
G2Crowd and SwitchUp

$30k

Average salary boost
for learners who complete a path

Join a community of 1M+ data learners on Dataquest

1

Sign up for a free account

Get access to hundreds of free lessons.

2

Choose a course or path

Start anywhere, from beginner topics to advanced concepts.

3

Learn with hands-on exercises

Learn with real data and build your experience.

Apply your skills

Create projects, build your portfolio, and build your career.

Sign up today

or