First draft for text description of the data #69

Aastedet · 2024-03-22T13:57:24Z

Just want some feedback on the current format before I fill out the remaining sections describing the raw data with similar content.

I suspect @lwjohnst86 will want to automate the data source descriptions at some point. Then we might have to add a column with the english register names to the variable description csv.

…ings).

Aastedet · 2024-03-22T14:04:37Z

Relates to #35

…uture improvements. Added description of raw data. National Patient Register only for now.

based on #69

Moved changes section to #78

…same recnum value is repeated for all diagnoses for that specific contact.

…aarhus/osdc into general-logic-description

signekb

Looks good! I have added some minor suggestions/questions.
Do you expect to fill in the todo items in this PR as well, or is your plan to create new PRs for those? :)

signekb · 2024-04-25T14:28:13Z

vignettes/algorithm_logic.Rmd

+in future revisions. Refer to the other vignettes for background
+information and a more general description of the algorithm.


You could add links to specific vignettes with relevant information here?

signekb · 2024-04-26T15:36:04Z

vignettes/algorithm_data.Rmd

+
+## Contents
+
+This document describes the structure of the data components processed


I have used the word "data sources". Do we prefer "data components"? Or "data objects" as in the title? I think it makes sense to align this, so the same words are used for the same things throughout the documentation :)

vignettes/algorithm_data.Rmd

signekb · 2024-04-26T15:37:52Z

vignettes/algorithm_data.Rmd

+In a future revision, the algorithm can also utilise the Danish Medical
+Birth Register to extend the period of time of valid inclusions further
+back in time compared to what is possible using obstetric codes from the
+National Patient Register.


Move this to #78 ?

signekb · 2024-04-26T15:38:41Z

vignettes/algorithm_data.Rmd

+assumes that raw data is stored/structured in the most common format for
+raw data provided on Statistics Denmark's servers (from our experience).


Maybe add which specific formats you are talking about here? :)

vignettes/algorithm_data.Rmd

signekb · 2024-04-26T15:44:37Z

vignettes/algorithm_data.Rmd

+    `d_inddto`/`dato_start`.
+
+    -   Named `lpr_adm` in the LPR2-formatted data prior to 2019, and
+        `kontakter` in contact-based LPR3-formatted data from 2019


Do you want to use the English abbreviation DNPR?

Maybe also add what the abbreviation comes from? I.e., Landspatientregisteret?
Or do we assume readers know this?

Co-authored-by: Signe Kirk Brødbæk <40836345+signekb@users.noreply.github.com>

…ed automatically

lwjohnst86 · 2024-05-02T13:31:48Z

@Aastedet and @signekb I've updated this vignette so the descriptions and fake data as tables are included automatically from the sources in data/

lwjohnst86 · 2024-05-02T13:33:56Z

R/as-markdown.R

+  rlang::check_installed("glue")
+  rlang::check_installed("knitr")


Whenever you use a function from a package that you set as "suggests" as a dependency, you need to include a check function to inform the user to install these packages if they are not installed. So that's what these do (when you use use_package("packagename", "suggests"), it tells you exactly what to do)

lwjohnst86 · 2024-05-02T13:35:14Z

R/as-markdown.R

+
+  variable_description |>
+    dplyr::select(
+      .data$register_name,


I think I mentioned this already, but the .data$ is used to masked the variable so that CRAN doesn't warn of "undeclared variables". Since we declare this placeholder variable ".data" already.

lwjohnst86 · 2024-05-02T13:36:41Z

vignettes/data-sources.Rmd

+the `dw_ek_kontakt` variable (LPR3 data).
+
+```{r}
+for (register in osdc:::get_register_abbrev()) {


While I almost always suggest not to use for loops, this is one of those cases that you need to, since these functions are used to create Markdown text, and it doesn't really work in a "functional" way.

lwjohnst86 · 2024-05-02T13:37:09Z

vignettes/data-sources.Rmd

+    register,
+    caption = glue::glue("Variables and their descriptions within the `{register}` register.")
+  ) |>
+    print()


You need to print because only the last thing in a for loop is output, but we want all these things to output.

lwjohnst86 · 2024-05-02T13:37:36Z

vignettes/data-sources.Rmd

+```{r, include = FALSE}
+knitr::opts_chunk$set(
+  echo = FALSE,
+  results = "asis",


This tells knitr to treat all the output as plain text rather than as code output text

Aastedet

AWESOME!

Aastedet · 2024-05-16T12:35:46Z

vignettes/data-sources.Rmd

+sources:
+
+```{r, results='asis'}
+osdc:::registers_as_md_table("Danish registers used in the OSDC algorithm.")


The triple-colons is because this is an internal function/object or what is going on here?
Also, a note to my future self: you need to build the package in order to access internal objects

haah yes! Or use Ctrl-Shift-L to load the package. And yea, ::: accesses all internal objects in a package, neat trick!

First draft. Still missing the actual algorithm logic (among other th…

69276a0

…ings).

Aastedet requested review from lwjohnst86 and signekb as code owners March 22, 2024 13:57

github-actions bot assigned Aastedet Mar 22, 2024

added section on output (to-do)

f9ccbb9

signekb mentioned this pull request Apr 3, 2024

docs: ✨ Initial draft of functions to extract osdc population #71

Merged

Aastedet mentioned this pull request Apr 17, 2024

docs: ✨ initial draft of functions to classify diabetes type #75

Merged

Added description of changes from original validation and potential f…

71d1e78

…uture improvements. Added description of raw data. National Patient Register only for now.

signekb added a commit that referenced this pull request Apr 25, 2024

docs: ⚡ add english translations of register names

5df493d

based on #69

signekb marked this pull request as draft April 26, 2024 07:22

Aastedet mentioned this pull request Apr 26, 2024

Overview of changes (current and potential) since original validation #78

Merged

Update algorithm_logic.Rmd

7d2dfd9

Moved changes section to #78

Aastedet changed the title ~~First draft for text description of the algorithm~~ First draft for text description of the data Apr 26, 2024

Anders Aasted Isaksen added 2 commits April 26, 2024 14:12

Renamed document/file to indicate the focus on data.

89f78d5

Added c_spec variable to lpr_adm table. Corrected recnum scheme: the …

c878066

…same recnum value is repeated for all diagnoses for that specific contact.

Aastedet marked this pull request as ready for review April 26, 2024 13:03

Aastedet and others added 3 commits April 26, 2024 15:03

Merge branch 'main' into general-logic-description

e58f1fe

added note on c_spec values LPR2 vs LPR3

71eab8f

Merge branch 'general-logic-description' of https://github.com/steno-…

5a16c74

…aarhus/osdc into general-logic-description

signekb reviewed Apr 26, 2024

View reviewed changes

lwjohnst86 and others added 7 commits May 2, 2024 13:35

docs: apply suggestions from review

66a232e

Co-authored-by: Signe Kirk Brødbæk <40836345+signekb@users.noreply.github.com>

Merged origin/main into general-logic-description

f2f49d4

chore: rename file to be a bit clearer

45850be

docs: use code to create the table listing the registers

1bc9931

feat: helper functions to insert data into Markdown vignettes

53108a9

build: include dependencies from the helper functions

cf30ea7

docs: updated vignette so variable and register data tables are creat…

d069d3a

…ed automatically

lwjohnst86 reviewed May 2, 2024

View reviewed changes

Aastedet commented May 16, 2024

View reviewed changes

lwjohnst86 approved these changes May 16, 2024

View reviewed changes

lwjohnst86 merged commit 7e2cf66 into main May 16, 2024
2 of 3 checks passed

lwjohnst86 deleted the general-logic-description branch May 16, 2024 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First draft for text description of the data #69

First draft for text description of the data #69

Aastedet commented Mar 22, 2024 •

edited

Loading

Aastedet commented Mar 22, 2024

signekb left a comment

signekb Apr 25, 2024

signekb Apr 26, 2024

signekb Apr 26, 2024

signekb Apr 26, 2024

signekb Apr 26, 2024

signekb Apr 26, 2024

lwjohnst86 commented May 2, 2024

lwjohnst86 May 2, 2024

lwjohnst86 May 2, 2024

lwjohnst86 May 2, 2024

lwjohnst86 May 2, 2024

lwjohnst86 May 2, 2024

Aastedet left a comment

Aastedet May 16, 2024

lwjohnst86 May 16, 2024

		in future revisions. Refer to the other vignettes for background
		information and a more general description of the algorithm.


		## Contents

		This document describes the structure of the data components processed

		assumes that raw data is stored/structured in the most common format for
		raw data provided on Statistics Denmark's servers (from our experience).

		rlang::check_installed("glue")
		rlang::check_installed("knitr")

First draft for text description of the data #69

First draft for text description of the data #69

Conversation

Aastedet commented Mar 22, 2024 • edited Loading

Aastedet commented Mar 22, 2024

signekb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lwjohnst86 commented May 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aastedet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aastedet commented Mar 22, 2024 •

edited

Loading