Learn data science with Dataquest

Why learn data science online?

Traditional education didn’t worked particularly well for me. When I graduated from college in 2008, I had a 2.1 GPA and still no real idea of what I wanted to do. I bounced from job to job, working for UPS, Pepsi, and the US Foreign Service. None of it clicked.

After an interest in predicting the stock market got me into data science, coding, and machine learning, I ended up in a job as a machine learning engineer at online education company edX.

The job appealed to me because edX was using technology to provide an alternative to the traditional education system. But the longer I worked there, the more I saw that the company’s approach was still anchored to the traditional lecture-based teaching format of the institutions that founded it (Harvard, MIT, and Berkeley).

Video Featured Image

Having learned data science on my own, I knew that it was possible for self-learning students to take a different path. I wanted to create a learning platform that combined the advantages of traditional education (like a well-thought-out curriculum sequence) with the advantages of self-study (like working at your own pace on projects you are personally interested in).

When I started Dataquest in 2015, my lesson was simple — help anyone learn data science, create projects they were passionate about, and use those projects to improve their careers and lives. Since then, we’ve had over 500,000 students take our courses and we've helped thousands start data science careers at companies including SpaceX, Accenture, and Amazon.
Vik Paruchuri Founder, Dataquest

Why are projects the best way to learn data science?

At Dataquest, our lessons are all built around answering real-world data science questions using real-world data. You’ll learn machine learning by predicting airline accidents, and you’ll learn statistics by finding the best players in the NBA.

Most courses also end with a guided project that asks you to synthesize the coding concepts you’ve just learned to solve data science problems just like a working data scientist would.

Working on these real-world projects is more interesting than answering practice problems or working with made-up data sets, and that helps keep students interested and motivated to learn. But there’s another advantage, too: building a portfolio of great data science projects is one of the best ways you can find a job and advance your data science career.

In fact, when we spoke to data science recruiters and hiring managers all over the world, we heard the same thing over and over again: data science portfolios and Github pages are among the first things they look at. Employers want to see if you can really do the job you’re being hired for, so having real-world projects to prove your skills you’re claiming on your resume is a must, particularly if you don’t have a fancy degree.

So while our projects are powerful learning tools that will keep you engaged and motivated as you work through our courses, they’re also a resource you can turn back to when it’s time to start the job hunt to showcase your skills to prospective employers.

What are the Paths?

One of the problems with online learning can be figuring out how to handle the sheer quantity of information that’s out there. Let’s say, for example, that you want to learn data science using Python. You should probably start with the basics of Python, and there are dozens if not hundreds of online courses out there that’ll teach you basic Python syntax.

But then what? Should you study some statistics next? Maybe try to learn `pandas` or `numpy`? Dive straight into machine learning? Focus on doing data viz? Whatever you choose, you may find that your next course requires some learning that wasn’t covered in your last course. Sifting through all of these choices and figuring out all the prerequisites is difficult and exhausting.

That’s why we created Paths: to help students go from zero to job-ready without having to waste time figuring out what they need to learn next.

Our data science courses are organized into four paths: Data Analyst in Python, Data Analyst in R, Data Scientist in Python, and Data Engineer. Each path is a sequence of courses, carefully arranged so that each course builds on the previous one. You won’t encounter prerequisites you don’t meet or have to search outside of our site for what to learn next. You can move smoothly from one course to the next and be confident that your learning is on the right track.

Of course, if you prefer a more a-la-carte learning style, that’s fine! The paths are totally optional and you can take all of our courses in any sequence you’d like. But many students find that the paths help them focus on actually learning and doing data science, since they don’t have to waste any time searching for courses, trying to figure out what skills they need, or Googling around to fill in prerequisites.

Why don’t we use video lectures?

Video lectures are the foundation of a lot of online courses, but to be frank, we don’t think they work very well.

First, with videos, there’s too much of a time gap between learning and doing. We find that most students learn best when they’re presented with a concept and allowed to apply and experiment with it immediately. That’s why our platform features a split browser window, with information presented on the left side and a live code environment on the right/ That way, there’s no gap at all between learning how to do something and actually doing it.

Second, videos waste too much of your time. When you’re asked to watch a 30-minute lecture before you put your hands on the keyboard and start coding, it’s inevitable that you’re going to miss or forget things. That means that when you start coding and hit a roadblock, you have to waste a lot of time scrubbing back and forth across the video or scanning through the transcript, trying to find the part that’s relevant. We think searching through text is faster, and with our platform’s search feature, it shouldn’t take more than a second or two to find exactly what you’re looking for.

Third, doing data science in the real world requires English-language reading skills. When you’re working as a data scientist, the documentation for that API you need to use or that package you’d like to learn is going to be available in text format, not as video lectures. So while you’re reading through our text-based lessons, you’re also getting practice reading code documentation and getting exposed to the terms and phrases you’re going to see in the documentation as a working professional.

Finally, we’ve found that videos can trick you into feeling like you learned something without actually teaching you. We’ve all had the experience of watching somebody else write and explain their code and thinking “That looks easy enough, I get it,” and then sitting down to reproduce it later and realizing it didn’t sink in as well as we thought. That’s the reason our courses are broken down into smaller lessons and then even smaller screens: we’ve found that going hands-on and applying each new concept as you learn it helps you retain the information better.

And of course, there’s plenty of science to suggest that the lecture format in general doesn’t work particularly well, whether it’s in video format or lectures in real life.

Meet our students

At any given moment, Dataquest has thousands of students who are learning from all over the globe. In just the past week, we’ve had sessions from 185 different countries, and our students include everyone from total coding newbies to experienced developers and data scientists looking to brush up or add to their skill set.

She found Dataquest when she realized she’d need Python programming skills to make the career change to data science that she wanted, and ended up finding an awesome job in Silicon Valley.
After being inspired by the data science room at the Miraikan museum and deciding to learn data science, he came to Dataquest, and now he has his dream job at SpaceX!
The projects on Dataquest gave me the confidence to know I could do this on my own. I created a website to showcase them and the companies I interviewed with really liked them.
Dataquest made learning programming less intimidating, by breaking projects down to simple terms. At Genentech, at least 20% of her time was spent using Python to combine data and perform predictive analytics.