by Katie Masiello, Ryan Johnson
🗓️ August 12, 2024
⏰ 09:00 - 17:00
🏨 ROOM 505 | Queets
✍️ pos.it/conf
In this R-focused workshop, we will discuss ways to improve your data science workflows! During the course, we will review packages for data validation, alerting, modeling, and more. We'll use Posit's open source and professional tools to string all the pieces together for an efficient workflow. We'll discuss environments, managing deployed content, working with databases, and interoperability across data products.
This course is for you if you:
- Build finished data products starting from raw data and are looking to improve your workflow
- Are looking to expand your knowledge of Posit open source and professional tools
- Want to improve interoperability between data products in your work or on your team
- Have experience developing in R. An analogous course with a Python focus is also offered
Most important prework
- Bring your laptop
- Sign up for a GitHub account (Click here to create one if you do not already have one). You will not be able to access our training environment without a valid GitHub account
- Get an Access Code for the https://wsdot.wa.gov/traffic/api/ API. We will be working with this data source throughout the workshop. Make sure to save your access code!
Time | Activity |
---|---|
9:00 - 10:30 | Workshop Introduction |
Reading, Cleaning, Writing and Validating Data | |
10:30 - 11:00 | Coffee break |
11:00 - 12:30 | Creating, Delivering, and Monitoring a Tidymodel |
12:30 - 1:30 | Lunch Break |
1:30-3:00 | Reporting |
3:00-3:30 | Coffee break |
3:30-5:00 | Advancing your Workflow |
Ryan Johnson, Data Science Advisor, Posit
Ryan Johnson is a Data Science Advisor at Posit with a background in Microbiology and Bioinformatics. He obtained his PhD from the Uniformed Services University in Maryland and did his postdoctoral training at the National Human Genome Research Institute, NIH. The only thing that rivals his love for infectious diseases is generating 'super cool' visualizations from large data sets using R and RStudio. In his free time, you can find Ryan running marathons/ultramarathons in the DC area or hiking miles along the Appalachian Trail. Ryan resides in Gaithersburg with his wife and two feline co-workers.
Katie Masiello, Senior Solutions Engineer, Posit
Katie Masiello is a Solutions Engineer at Posit. A mechanical engineer by training, she found her calling in data science while working statistical analysis in the aerospace industry. A good cup of coffee, reproducibility, and making life easier for the next user are three things she loves most. Katie is an avid knitter and knitr, and she can often be found trying to tame her ridiculously overgrown garden, collecting pebbles, or thinking about taking up running as a hobby.
The sample data science project used for this workshop provides assets and applications using data that have been modified for use from their original sources, WSDOT Traveler Information API and Open Meteo, and are used for educational purposes. The authors and original sources make no claims as to the content, accuracy, timeliness, or completeness of any of the data provided.
This work is licensed under a Creative Commons Attribution 4.0 International License.