In the previous courses of this path, we learned how to perform basic data analysis and data visualization. We also learned about some fundamental statistical metrics like the mean and the median, and we plotted histograms, bar graphs or line plots.

In this first lesson of our Statistics Fundamentals course, we’ll focus on the details around gathering data for analysis. As usual, we'll work with a real-world dataset. Before we dive into the technical details and start playing with the data, we begin with getting a sense of what statistics is.

You will start this mission by learning how to differentiate between a population and a sample, one of the foundational concepts of statistics. You will also learn about simple random sampling to select data from a population when conducting statistical tests for your data science work. 

Not only will you discover how to perform the sampling methods, you will also learn what each method is and how it works so you can make smart decisions when sampling data.

Knowing how and what to sample can be very useful. If you wish to learn about different sampling methods and how to strategically pick data points to look at, this mission is definitely the place to start!

As you work through each concept, you’ll apply what you’ve learned from within your browser; there's no need to use your own machine to do the exercises. The Python environment inside of this course includes answer-checking to ensure you've fully mastered each concept before moving on to the next.


  • Learn about poulation and samples.
  • Learn various sampling methods.

Lesson Outline

1. Introduction
2. Solving Problems with Statistics
3. Populations and Samples
4. Explore the Dataset
5. Sampling Error
6. Simple Random Sampling
7. Generating Numerious Random Samples
8. Visualizing Random Samples
9. The Importance of Sample Size
10. Recap
11. Takeaways