3 Mighty Good Reasons to Learn R for Data Science

Ahoy, mateys! Happy International Talk Like A Pirate Day!

You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do:

  • Predicting profitability of plundering events based on crew size and ship features
  • Optimizing trade routes to avoid the law, storms, and other pirates
  • Analyzing large data sets containing millions of hints to hone in on buried treasure chests

Aye, you see, the best pirates are also data scientists. And, of course, they use R for their data science!

Here are some of our favorite reasons to learn R:

  1. Yarrr, statistics: R was originally designed by statisticians for doing statistical analysis, and it remains the programming choice of most statisticians today. R’s syntax makes it easy to create complex statistical models with just a few lines of code. Since so many statisticians use and contribute to R packages, you’re likely to be able to find support for any statistical analysis you need to perform, such as:
  • Looking for correlations between peg leg length and plundering ability to take the guesswork out of crew recruiting
  • Comparing the amount of treasure you found while plundering several islands to decide which one to return to using an analysis of variance model
  • Building a linear regression model to predict the amount of booty in a treasure chest based on its dimensions and burial depth
  1. Shiver me timbers! Data visualizations to impress: R’s streamlined syntax for creating visualizations allows you to quickly construct charts for exploratory data visualization, as well as create high-quality, publication-worthy visuals to communicate your findings:
  • You can make a fine map to buried treasure for yer ship’s captain using ggmap
  • Your crew will love seeing plunder results summarized using graphics made with ggplot2
  1. A swashbucklin’ community of data buccaneers: Like buried treasure, R is free, and anyone who knows where to look can download it and contribute to it in the form of packages. As a result, R’s capabilities are always growing as exciting new packages are added. For example:
  • You can take the “Blimey!” out of data manipulation and cleaning using the dplyr package
  • Using the purrr package can help you avoid repetition and write efficient code.

Speaking of R, did we mention that we just totally revamped and relaunched our introductory R courses? If you want to learn modern, production-ready R, look no further!