CPU Bound Programs
In the previous course on large data sets in pandas, we covered the idea of memory limitations, and figured out some strategies for overcoming them with pandas. As a quick refresher, a memory limitation is when a dataset won't fit into the memory available on your computer. When this happens, we need to rely on workarounds — approaches like processing the data in batches or relying on tools like SQLite that keep the data on disk instead of in memory while doing processing.
In this course, we'll cover the idea of program bounds and how they affect code performance. A program bound is similar to a limitation in that it affects how you're able to process your data. However, a program bound isn't a hard limitation -- if your program is bound, your computer will still be able to eventually process the data. A program bound mostly limits how quickly the program can be executed.
In this lesson, we'll learn more about CPU-bound programs, and how we can understand and improve CPU performance. While learning more about CPU-bound programs, we'll be working with a data set of search terms and matching products.
As you work through understanding CPU performance, you’ll get to apply what you’ve learned from within your browser so that there's no need to use your own machine to do the exercises. The Python environment inside of this course includes answer checking so you can ensure that you've fully mastered each concept before learning the next concept.
1. Bounds vs Limitations
2. The Dataset
3. Finding duplicate values
4. Big O notation
6. Timing code runs
7. Stable time estimates
9. Alternate profiling strategies
10. Alternate profiling strategies
11. Practicing writing efficient algorithms
12. Big O Notation practice
13. Next Steps
CPU Bound Programs
Learn how to process data more quickly be being aware of CPU bounds.
- 1Bounds vs. Limitations
- 2The Dataset
- 3The Dataset
- 4Finding duplicate values
- 5Finding duplicate values
- 6Big O notation
- 8Timing code runs
- 9Stable time estimates
- 11Alternate profiling strategies
- 12Alternate profiling strategies
- 13Practicing writing efficient algorithms
- 14Big O Notation practice
- 15Next Steps
I made a list of the skills I had to acquire to become a Data Scientist, after looking around it was clear that Dataquest was the quickest way to learn them. Dataquest guided projects showed me exactly what to do.