Exploratory Data Analysis with Pandas and Matplotlib


Go to NumFOCUS academy page.

Exploratory data analysis is a practical approach to working with real data, and Pandas and Matplotlib are excellent tools for reading, validating, and visualizing data. In this tutorial, we use data from the General Social Survey (GSS) to learn about these tools by exploring changes in political affiliation and attitudes in the United States over the last 50 years.


Allen Downey

Allen Downey is a professor of Computer Science at Olin College and the author of a series of open-source textbooks related to software and data science, including Think Python, Think Bayes, and Think Complexity, which are also published by O’Reilly Media. His blog, Probably Overthinking It, features articles on Bayesian probability and statistics. He holds a Ph.D. in computer science from U.C. Berkeley, and M.S. and B.S. degrees from MIT.