This repository contains dated records of curated Marburg case data from the 2023 outbreak in Equatorial Guinea. Data are curated from openly accessible sources only. As new data become available we are updating the line list.
Latest data and archives | Summary report
There may be discrepancies and data remains limited at this stage of the outbreak. Should you find additional detail or have comments about the accuracy of information supplied here please address questions to info@global.health.
There are two datasets: a line list and a timeseries aggregation. Only cases that have either a Date_onset or where Date_onset can be estimated (from Date_death or Date_of_first_consult) are visible in the timeseries.
Python
import pandas as pd
df = pd.read_csv("https://l66noa47nk.execute-api.eu-central-1.amazonaws.com/web/url?folder=&file_name=latest.csv")
# aggregate timeseries data by location and status
ts = pd.read_csv("https://marburg-aggregates.s3.eu-central-1.amazonaws.com/timeseries-location-status/latest.csv")
R
df <- read.csv("https://l66noa47nk.execute-api.eu-central-1.amazonaws.com/web/url?folder=&file_name=latest.csv")
# aggregate timeseries data by location and status
ts <- read.csv("https://marburg-aggregates.s3.eu-central-1.amazonaws.com/timeseries-location-status/latest.csv")
UPDATE 2023-04-24: Following updates from the WHO, Global.health added 3 new probable cases to our line-list (ID# 41, 42, 43) to bring our total number of probable cases to 23 -- all probable cases have died. Global.health acknowledges differences in location data for probable cases between MINSABS and WHO reporting, as discussed in our 2023-04-19 update. We are unable to reconcile location data for probable cases at this time.
UPDATE 2023-04-19: Confirmed case counts are updated from MINSABS and WHO reporting. Location data for the first 15 confirmed cases is consistent with MINSABS reporting through the Epidemiological update on 2023-04-16 [n=15. Bata 9; Ebibeyin 3; Evinayong 2; Nsork 1]. Confirmed case number 16 was first added to our line-list using data from the WHO Director-General media briefing.
Probable cases (n=20) are included in our line-list using data from MINSABS Official Statement No. 3, Page 7. WHO reported an update on 2023-04-15 increasing the probable count to 23. Global.health is working to reconcile our case and location data; however, due to the presentation of aggregated data in official reporting, we may be unable to update or continue to track probable case data in our line-list moving forward.
Three suspected cases, now discarded, are included in our line-list using data from MINSABS Official Statement No. 3, Page 7. Suspected cases are not included in our line-list after this report.
This section is an overview of the data curation process, a discussion about limitations and assumptions.
The Marburg line-list is built by checking a collection of sources, listed here, which will be updated as new sources become available: https://github.com/globaldothealth/marburg/wiki. The original source(s) of information is provided for each line-list ID in our database. Data released from Ministerio de Sanidad y Bienestar Social de la República de Guinea Ecuatorial (MINSABS) has been our primary source of information (https://guineasalud.org/). Our line-list also includes publicly available data from the World Health Organization.
Metadata are added at any time, as information becomes available and our time and resources permit. After making changes, the case will be recorded as modified with the date. Multiple curators look at each datapoint and any discrepancies are resolved in conversations between them. We remain limited by inconsistent, aggregated, or missing case information; change in reporting format; data reconciliation; reporting delays; and change in case definitions, among other reasons. Assumptions are made that may compromise the accuracy of the data.
Users should refer to our data dictionary for a description of each variable. Limitations and assumptions for select variables are briefly discussed below.
Case_status: Only confirmed and probable cases are logged at this time.
Date_onset: Information is available for a selection of cases.
Outcome. Type: Death: The report date is used when a Date_death is not specified by source.
Outcome. Type: Recovered: The report date is used when a Date_recovered is not specified by source.
Healthcare_worker: Due to the limited availability of information, we have not been able to log every HCW case or outcome.
Data are hand-curated. The process and methods to create, organize, and maintain data have been applied with consistency; however, we’re human and mistakes happen. As stated above, line-list data may change due to ongoing data reconciliation and validation. We welcome your contributions and feedback. Get involved!
If you would like to request changes, open an issue on this repository and we will happily consider your request.
If requesting a fix please include steps to reproduce undesirable behaviors.
If you would like to contribute, assign an issue to yourself and/or reach out to a contributor and we will happily help you help us.
This repository is published under MIT License and data exports are published under the CC BY 4.0 license.
Please cite as: "Global.health Marburg (accessed on YYYY-MM-DD)" & please add the appropriate agency, paper, and/or individual in publications and/or derivatives using these data, contact them regarding the legal use of these data, and remember to pass-forward any existing license/warranty/copyright information.
Please also refer to the original source of the data: Ministerio de Sanidad y Bienestar Social de la República de Guinea Equatorial (https://guineasalud.org/) & World Health Organization (https://www.who.int/emergencies/emergency-events/item/2023-e000057)