This two-day workshop will teach the basics of the Python programming language with a focus on managing data. Day one will introduce Python and the Jupyter Notebook. Day two will introduce Pandas, a library for data management and analysis. Participants will receive hands-on training and leave with introductory knowledge of Python and Pandas.
Dates:
- May 22nd: Introduction to Jupyter, & Python
- May 23rd: Introduction to Pandas and Web Data
Instructors:
- Matt Burton (mcburton@pitt.edu)
-
Bring a laptop (and power). You do not need to install any software, everythign will be running in the cloud.
-
We will be using the Binder service for hosting the course.
- Introduction to Jupyter
- Python Programming
- Variables and Data Types
- Control Flow
- Data Containers
- Iteration
- Functions
- File Input and Output
- Python Data Manipulation
- PEP 8
- String Methods
- List Comprehensions
- Traversing the filesystem
- Working with CSV Files
- Putting it Together - QuadraSTAR Parsing
- Introduction to Pandas
- What and Why Pandas
- Introduction to Pandas Data Structures
- Loading & Saving Data
- Basic Data Manipulation
- Fetching & Parsing Data from the Web
- APIs and JSON
- Getting Data from the Web
- Putting it Together - Collections as Data
- Python For Everybody is a great book oriented towards doing stuff with data.
- Whirlwind Tour of Python and GitHub repository with notebooks is a great resource for folks who know how to program in other languages or need a quick Python refresher.
- Here is a good comprehensive Python Cheatsheet
- Pitt also has access to Lynda.com which is a MASSIVE collection of videos and online learning resources, including a bunch on Python programming
- Python Data Science Handbook is a GREAT book that introduces data manipulation with pandas, visualization with matplotlib, and machine learning with scikit-learn.
- Python Data Analysis 2nd Edition this book isn’t Open Access, but is available via the University Library System
- Chris Albon’s, author of Machine Learning with Python Cookbook, has a great website of Python and Pandas tricks. The ML Cookbook is also available from ULS.
- Lynda.com has a very comprehensive Python for Data Science Essential Training that is over 6 hours long! There is also a shorter Python: Data Analysis course as well.