Course materials for the Code/Astro workshop.
Workshop website: https://semaphorep.github.io/codeastro/
Workshop "cheatsheet" of links & information: https://github.com/semaphoreP/codeastro/blob/main/workshop_info.md
Recording of the 2022 Code/Astro workshop: https://www.youtube.com/playlist?list=PLb1880Rn0qkK7zTWcqGaXNbKZbxkpvUuH.
Before the workshop, please complete the following:
-
Follow the installation instructions to get the necessary packages installed.
-
Do the diagnostic assignment to get familiarized with some Python fundamentals.
Throughout this week, you’ll be working with a small group of collaborators (1-4 others) on a final project, a real open-source Python package for astronomy. If you already have group partners in mind, please email us! If not, we will assign you to a group based on timezone, career level, and any personal preferences you've communicated to us. This is an opportunity to put the knowledge you’ve learned from Code/Astro intro practice. On the final day of the workshop, you’ll (optionally) have a chance to present your package to the other Code/Astro participants.
Develop an open-source Python package for astronomy from the ground up. Your project should:
- be maintained on GitHub
- be thoroughly documented using docstrings
- contain unit tests and end-to-end tests
- be pip installable and runnable on other computers
- be a tool with an API (i.e. not a single script or notebook). We should be able to import your package and call it.
Each of these goals will be covered on a different day of the workshop, so it’s completely fine if you don’t know how to do each of these right now.
You should aim to spend 10 hours (~3.3 hours per day) working on the project. Days 1 and 2 are the main time to develop the core functionality of your package. Writing docs/tests/packaging will keep you busy on Day 4 (we don't nomianlly have anything planned on Day 3).
Here’s a more detailed suggested timeline:
Day 1: decide on the purpose of the package. Write one central function or class and put it on Github. Example: we’re working on a package that plots the most likely galactic orbits for a user-specified subset of the Gaia catalog. Today, me and partner 1 will write a function that uses galpy to compute and plot orbits for a list of stars, and partners 2 and 3 will write a class for constructing the list of stars using user input. We'll use two separate branches to do this.
Day 2: get a feature working and committed to Github. You'll be busy later this week doing documentation and other scaffolding, so try to finish something you can demo today. Example: We’ll continue working on the core functionality we sketched out yesterday so that we can produce a plot of likely galactic orbits given a Gaia star ID.
Day 3: Nothing is scheduled for today!
Day 4: The final stretch! Document your code, add tests, release it, and get ready to present tomorrow! Example: we’ll write docstrings for a couple core classes and functions (all we have time for) and add a tutorial to our README. We’ll write unit tests that evaluate the performance of key functions (e.g. if a user sets a degree range keyword, does the Gaia query work appropriately?), and end-to-end tests that make sure the whole package is working. We'll put our code on PyPI and try pip installing it on each of our machines to work through installation problems.
https://github.com/sonithls/GaiaCurves
https://github.com/JulienNGirard/SEDition
https://github.com/Bhavesh012/TCalc
https://github.com/laldoroty/snlcpy
During the final presentation, you will have the opportunity to give a short (5 minute) demonstration of the package your developed. Remember to talk about the motivation behind the package, any challenges or difficulties you encountered, and show off what the package does! We'll send around a sign-up sheet on Day 4, so please sign up for a slot!
To do this project, you’ll be working with others to program. This is likely a new experience for most of you. We suggest you use “pair programming” to collaborate, especially towards the beginning of the week:
- Pair programming
- One person is the “driver” who types code
- One person is the “navigator” who watches what gets typed and offers directions
- The driver thinks more about having the right syntax, while the navigator thinks more big picture about how this code should work or function
- Switch back and forth constantly (e.g., every 10 minutes or after finishing a function)
- In three person groups, two people are navigators
- Use Discord/Zoom/Skype to share screens if working together virtually
- Code individually when there are separate tasks to be done
- Use git branches (which we will cover on Day 2) to prevent overriding each others’ work
Many of you came to this workshop with ideas for open-source projects that are relevant to your research. If you’re in that category, please use this project as an opportunity to make something useful for your research!
Many of you don’t have research experience, and/or did not come to this workshop with an idea. That’s completely fine too! If you’re struggling to think of an idea for a project, here’s a list of ideas to give you some inspiration:
- write an educational visualization of a physical process (e.g. check out Show Me the Orbit)
- write something that randomly generates a joke or song lyrics given the name of a user
- write a quadratic equation solver that prints out the steps for solving by hand
- write a image processing tool that does one main task (e.g., make color images from astronomical data taken at different wavelengths, a tool that automatically looks for stars in an image).