Although there are many datasets available in convenient formats, there’s also a lot of data that’s more difficult to access, like a table on a web page. To get this data, we’ll need to use web scraping. In R, we can do that with the rvest scraping package.
In this course, you’ll learn about web page structure, including the basics of HTML and CSS. You’ll also learn how to get the code from a page into your R workflow for further parsing and cleaning. Then, you’ll dig deeper into scraping, learning to use the CSS Selector to get precisely the data you want.
Best of all, you’ll learn by doing — you’ll practice and get feedback directly in the browser. At the end of the course, you’ll complete a guided project that asks you to use web scraping to analyze movie ratings.
Loading lessons...
Dataquest has helped thousands of people start new careers in data. If you put in the work and follow our course, you'll master data skills and grow your career.
We believe so strongly in our courses that we offer a full satisfaction guarantee. If you complete a career course on Dataquest and aren't satisfied with your outcome, we'll give you a refund.
Learn exactly what you need to achieve your goal. Don’t waste time on unrelated lessons.
Build confidence with our in-depth projects, and show off your data skills.
Work with real data from day one with interactive lessons and hands-on exercises.
Impress employers by completing a capstone project and certifying it with an expert review.
Learners who recommend
Dataquest for career advancement
Dataquest rating on
G2Crowd and SwitchUp
Average salary boost
for learners who complete a path