In our Parallel Processing course, you’ll learn how to improve the performance of your code by processing data in parallel rather that iterating through rows sequentially.
You’ll learn how to start multiple processes and run functions on multiple processes at the same time, as well as share data between multiple processes. You’ll learn how use a process pool executor, practicing all of these skills as you dig into some data about the demand for data engineering jobs.
Then you’ll learn about MapReduce. You’ll learn how to use process pools, how to actually implement MapReduce, and how to effectively process data with it.
At the end of the course, you’ll complete a project using your new skills that challenges you to dig into data from Wikipedia pages and analyze them quickly and efficiently using MapReduce.
In this parallel processing course, you will:
- Learn how to process data in parallel
- Learn how to implement MapReduce
- Learn how to solve problems with MapReduce
Parallel Processing Lessons List
Learn how to start and share data between multiple processes.
Learn how to gather the results of a function that is executed on several processes.
Learn how to implement MapReduce and use process pools.
Practice working with the MapReduce framework.
Use threads and MapReduce to analyze Wikipedia pages more quickly.