The file data/predictions.RData
contains the output of the model. All variables are described in more depth in the paper.
- The columns
lon
andlat
are WGS84 longitude and latitude.gridid
is a grid cell identifier. bsm_weight_pubs
is the model’s output weighted with publications (“observed”).bsm_weight_pop
is the model’s output weighted with population (“bias factored out”).bsm_response
is the model’s output weighted with neither.- The other columns are generally predictor variables used in the model.
The weighted versions are generated by multiplying the output variables with versions of the publication and populations variables normalized to sum to 1 over the entire grid.
The data-raw directory contains all the scripts needed to import the data into the repository. The datasets are not in the directory, though. These scripts build all the working datasets to the data
directory, which is under version control, so the code should all run. Original raw data is stored on EHA's AWS S3 buckets, or can be found attached to the data-raw
release of this repository.