Benchmarking with GEE: example with Wolf replication #144
Replies: 6 comments
-
@Shirobakaidou just sent me a snippet that helps computing deforestation per year on GEE. I'll incorporate it on this code to facilitate comparison of GEE vs. mapme for annual deforestation. Thanks @Shirobakaidou! |
Beta Was this translation helpful? Give feedback.
-
awesome analysis. Helps us to advance in several aspects and we should discuss them in detail @melvinhlwong. Some spontaneous thoughts:
|
Beta Was this translation helpful? Give feedback.
-
Ok. Just saw that Curtis study maps drivers from 2001-2015 so we might consider them ... but it's a pity that they are not ongoing... |
Beta Was this translation helpful? Give feedback.
-
Wait for my proposal on Monday! (it's already on the concept board) :-) |
Beta Was this translation helpful? Give feedback.
-
I think this thread is better suited in the discussion section? Or is there a direct link to an issue with the package that I am currently missing? |
Beta Was this translation helpful? Give feedback.
-
Yeah I think that too. Is it possible to convert? |
Beta Was this translation helpful? Give feedback.
-
TL;DR: The paper by Wolf et al. published in 2021 in Nature Ecolology and Evolution represent I think the state of the art that we are trying to improve with this package. I am currently working on a replication of this study (see work in progress here) I found that the original code cannot run (see issues on the repo where the code is published) so I re-wrote everything. I think that we could use the code written for this replication to cross-validate result accuracy and benchmark performance with mapme.biodiversity. It could also inform best practices for PA impact estimation too (although that speaks more to the work hosted on the KfW repo for reproducible workflows. I'll try to sumarize here the insights we could glean from this ongoing replication.
Background: For the initial study, the authors used GEE to resize and fetch raster files (elevation, population density, time travel and GFC cover, loss, lossyear and gain)., to later on process it with different scripts combining python, R and Julia to prepare the data. There is at least one coding typo so the original code cannot run as is. The code used deprecated version of several packages, but even fetching older version, I get cryptic errors when running the preparation + matching script in Julia. I contacted the author, but he replied that he didn't know what the error message went and was not able to locate the package/software version used for the calculations of the initial study.
Data computation on GEE: I re-implemented the processing workflow by Wolf et al, but using GEE more extensively, thanks to the {rgee} package that interfaces with GEE API. The code is here. It runs in less than an hour and spits out rasters per country, that is small enough to be ingested by a normal computer.
Take-aways for the Mapme effort:
I think there are a few strong points that we could incorporate in our current work:
Wolf et al. use WWF ecoregions + Curtis et al. deforestation drivers as matching variables, which I think is very sound. The mapme.biodiversity packages computes ecoregions, but I think that it is not used as a matching variable in KfW analysis. I think that it would be a good idea to do it. I think that the deforestation drivers are not computed by the package (this could be a feature request I think) and I think it is not included in KfW portfolio analysis either.
I think there are important limitations in Wolf et al. that the package mapme.biodiversity is designed to avoid them (in particular 1, 3 and 4.):
Sorry for the long thread, it's Friday.
Beta Was this translation helpful? Give feedback.
All reactions