generated from jamesmbaazam/QuartoPresentationTemplate
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
917cee2
commit b43a234
Showing
2 changed files
with
40 additions
and
61 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
--- | ||
title: Introduction to Infectious Disease Dynamics Modelling | ||
output: github_document | ||
--- | ||
|
||
<!-- README.md is generated from README.Rmd. Please edit that file --> | ||
|
||
```{r, include = FALSE} | ||
knitr::opts_chunk$set( | ||
collapse = TRUE, | ||
comment = "#>" | ||
) | ||
``` | ||
|
||
<!-- badges: start --> | ||
|
||
<!-- badges: end --> | ||
|
||
This repository contains the materials for the course "Introduction to Infectious Disease Dynamics Modelling" taught by [Dr. James Azam](https://jamesmbaazam.github.io/jamesmbaazam/) on behalf of the German West African Centre for Global Health and Pandemic Prevention (G-WAC) at the Kwame Nkrumah University of Science and Technology. | ||
|
||
The course is designed to provide an introduction to infectious disease dynamics modelling using R. The course is aimed at students and researchers in the field of public health, epidemiology, and related disciplines. The course is designed to be interactive and hands-on, with a focus on practical applications of infectious disease dynamics modelling to pandemic response. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,61 +1,19 @@ | ||
# Intro to Arrow Workshop | ||
|
||
by Steph Hazlitt & Nic Crane | ||
|
||
|
||
### Workshop Website | ||
|
||
This repository contains materials for the **Intro to Arrow** workshop. | ||
|
||
### Workshop Overview | ||
|
||
This workshop will focus on using the arrow R package---a mature R interface to Apache Arrow---to process larger-than-memory files and multi-file datasets with arrow using familiar dplyr syntax. You'll learn to create and use interoperable data file formats like Parquet for efficient data storage and access, and also how to exercise fine control over data types to avoid common large data pipeline problems. This workshop will provide a foundation for using Arrow, giving you access to a powerful suite of tools for performant analysis of larger-than-memory data in R. | ||
|
||
*This course is for you if you:* | ||
|
||
- want to learn how to work with tabular data that is too large to fit in memory using existing R and tidyverse syntax implemented in Arrow | ||
- want to learn about Parquet and other file formats that are powerful alternatives to CSV files | ||
- want to learn how to engineer your tabular data storage for more performant access and analysis with Apache Arrow | ||
|
||
### Workshop Prework | ||
|
||
Detailed instructions for software requirements and data sources are shown below. | ||
|
||
#### Packages | ||
|
||
To install the required core packages for the workshop, run the following: | ||
|
||
```{r} | ||
install.packages(c( | ||
"arrow", "dplyr", "stringr", "lubridate", "tictoc" | ||
)) | ||
``` | ||
#### Seattle Checkouts by Title Data | ||
|
||
This is the data we will use in the workshop. It's a good-sized, single CSV file---*9GB* on-disk in total, which can be downloaded from an AWS S3 bucket via https: | ||
|
||
```{r} | ||
options(timeout = 1800) | ||
download.file( | ||
url = "https://r4ds.s3.us-west-2.amazonaws.com/seattle-library-checkouts.csv", | ||
destfile = "./data/seattle-library-checkouts.csv" | ||
) | ||
``` | ||
|
||
#### Tiny Data Option | ||
|
||
If you don't have time or disk space to download the 9Gb dataset (and still have disk space to do the exercises), you can run the code in the workshop with "tiny" version of this data. Although the focus in this course is working with larger-than-memory data, you can still learn about the concepts and workflows with smaller data---although note you may not see the same performance improvements that you would get when working with larger data. | ||
|
||
```{r} | ||
options(timeout = 1800) | ||
download.file( | ||
url = "https://github.com/posit-conf-2023/arrow/releases/download/v0.1.0/seattle-library-checkouts-tiny.csv", | ||
destfile = "./data/seattle-library-checkouts-tiny.csv" | ||
) | ||
``` | ||
|
||
If you want to participate in the coding exercise or follow along, please try your very best to begin the workshop ready with the required software & packages installed and the data downloaded on to your laptop. | ||
|
||
------------------------------------------------------------------------ | ||
|
||
![](https://i.creativecommons.org/l/by/4.0/88x31.png) This work is licensed under a [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/). | ||
Introduction to Infectious Disease Dynamics Modelling | ||
================ | ||
|
||
<!-- README.md is generated from README.Rmd. Please edit that file --> | ||
<!-- badges: start --> | ||
<!-- badges: end --> | ||
|
||
This repository contains the materials for the course “Introduction to | ||
Infectious Disease Dynamics Modelling” taught by [Dr. James | ||
Azam](https://jamesmbaazam.github.io/jamesmbaazam/) on behalf of the | ||
German West African Centre for Global Health and Pandemic Prevention | ||
(G-WAC) at the Kwame Nkrumah University of Science and Technology. | ||
|
||
The course is designed to provide an introduction to infectious disease | ||
dynamics modelling using R. The course is aimed at students and | ||
researchers in the field of public health, epidemiology, and related | ||
disciplines. The course is designed to be interactive and hands-on, with | ||
a focus on practical applications of infectious disease dynamics | ||
modelling to pandemic response. |