About this course
Although there are many datasets available in convenient formats, there’s also a lot of data that’s stored in tougher-to-access ways, like as a table on web page. To get at this data, we’ll need to use web scraping. In R, we can do that with the rvest scraping package.
In this course, you’ll learn how to scrape the web using R and rvest. You’ll start by learning the basics of web page structure, and a little about HTML and CSS. You’ll also learn how to get the code from a page into your R workflow for further parsing and cleaning.
Then, you’ll dig deeper into scraping, learning to use the CSS Selector to get precisely the data you want (and none of the other content or code you don’t).
The course wraps up with a challenge that’ll get you more familiar with how web scraping can work across different sites. Then, you’ll dive into a guided project that’ll help you put your own unique spin on web-scraped data analysis.
By the end of this course, you’ll be able to:
- Scrape data from a web page using R and rvest.
- Parse sites using the CSS Selector to get only the data you need.
- Tackle more complex web scraping challenges.
Lessons in this course
Thousands of learners have changed their careers with Dataquest
Learners who recommend
Dataquest for career advancement
Dataquest rating on
G2Crowd and SwitchUp
Average salary boost
for learners who complete a path
Join a community of 1M+ data learners on Dataquest
Sign up for a free account
Get access to hundreds of free lessons.
Choose a course or path
Start anywhere, from beginner topics to advanced concepts.
Learn with hands-on exercises
Learn with real data and build your experience.
Apply your skills
Create projects, build your portfolio, and build your career.