Skip to content

Commit

Permalink
prepare abstract for SwissRN conference
Browse files Browse the repository at this point in the history
  • Loading branch information
larnsce committed Mar 8, 2024
1 parent 2e512b2 commit 94819ed
Show file tree
Hide file tree
Showing 3 changed files with 659 additions and 0 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -10,3 +10,4 @@ _extensions
.Rdata
.httr-oauth
.quarto
*.docx
41 changes: 41 additions & 0 deletions 3-blog/posts/2024-03-08-swissrn-abstract/index.qmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
---
title: "Research Data Management in Practice - How hiring a data steward ..."
subtitle: "Abstract for submission to Swiss Reproducibility conference 2024"
author:
- name: Lars Schöbitz
url: https://lse.de/
affiliation: Global Health Engineering, ETH Zurich
affiliation-url: https://ghe.ethz.ch
date: today
draft: true
---

# Conference info

## Selected Topic: Transparency and Open Scholarship

Explore the transformative wave of open scholarship, emphasizing the importance of transparency in the research lifecycle. From pre-registration and registered reports to open access publications, research data, and code—this session illuminates the pivotal role of open practices in fostering trust and collaboration in the scientific community.

## Conference Goals

- Engage with researchers to make their research rigorous, transparent and reproducible
- Promote RTR research practices
- Disseminate ways to improve research quality

## Opportunities and Exposure:

- Foster scientific exchange across all disciplines in Switzerland
- Provide the research community with a unique exposure to resources, expertise, and approaches in reproducible research

# Title: "Research Data Management in Practice - How hiring a data steward ..." (10 / 40 words)

# Abstract (342/350 words)

This talk will promote the RTR research practices we have applied to research in our group and the scientific community. We will highlight our approach to open data and code as individual research products separate from scientific articles derived from them. Using a real-world case study example, we will present this approach to reproducible research.

The R package development environment allows researchers to keep an audit trail from unprocessed raw data to analysis-ready data. Data are stored in a git repository on GitHub with the code for data processing and rich metadata and documentation, following FAIR data sharing principles. The repository contains a citation file format (.cff) file that records each contributor's ORCID ID and a permissive CC-BY license. The GitHub to Zenodo integration allows for the automated generation of a digital object identifier (DOI) and ensures long-term archiving, following internationally recommended best practices by funding agencies. Once published, the entry is imported to the ETH Research Collection via the DOI for increased discoverability and institutional archiving. For data communication purposes, the R package pkgdown is ideal. Without any web development experience, the package allows competent R practitioners to prepare a visually appealing website with R code snippets showing exploratory data analysis examples.

We invest in this process at the data collection point long before preparing a scientific article. The process actively promotes rigorous research data management practices to our BSc, MSc, and PhD students and senior research staff, who follow best practices for transparency and open scholarship without being fully aware. Researchers use the published R data package to prepare a scientific article citing the repository and complying with any journal's policies for data availability statements and external long-term archiving.

In our talk and the following discussion, we will argue that implementing these practices was only feasible by hiring a full-time data steward / open science specialist. We will discuss how invested financial resources will pay off as publishers of high-quality journals will increasingly reject article submissions that do not comply with data and code transparency, the foundation of computational reproducibility.

Loading

0 comments on commit 94819ed

Please sign in to comment.