In our Python Intermediate course and our Pandas and Numpy Fundamentals course, you've mostly been manipulating and working with data that are represented as tables. Microsoft Excel, the pandas library in Python, and the CSV file format used in our datasets were all developed around this representation.
Because a table neatly organizes values into rows and columns, we can easily look up specific values at the intersection of a row value and a column value. This makes tables great for data cleaning and some kinds of analysis. Unfortunately, it's very difficult to explore a dataset to uncover patterns when it's represented as a table, especially when that dataset contains many values. We need a different way to represent data that can help us identify patterns more easily.
In this lesson, you'll learn the basics of data visualization, a discipline that focuses on the visual representation of data.
As humans, our brains have evolved to develop powerful visual processing capabilities. We can quickly find patterns in the visual information we encounter, which was incredibly important from a survivability standpoint. Unfortunately, when data is represented as tables of values, we can't really take advantage of our visual pattern matching capabilities. This is because our ability to quickly process symbolic values (like numbers and words) is very poor. Data visualization focuses on transforming data from table representations to visual ones, making it much easier for humans to understand.
In this first lesson, we'll use Python, pandas, and matplotlib to create line charts to analyze data from the United States Bureau of Labor Statistics to visualize unemployment data and how it changes over time.
1. Representation Of Data
2. Introduction To The Data
3. Table Representation
4. Observations From The Table Representation
5. Visual Representation
6. Introduction to Matplotlib
7. Adding Data
8. Fixing Axis Ticks
9. Adding Axis Labels And A Title
10. Next Steps