-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation Changes #57
Changes from 5 commits
e56ff27
b7666b7
c9409bf
59e2332
765fffe
a009eaf
81907db
7509407
e4c80ec
2092759
bd96458
40c4ed0
2500db7
6f7a4d4
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -14,11 +14,10 @@ maxdepth: 2 | |||||
GitHub README <readme> | ||||||
Usage Guide <usage> | ||||||
Task Examples <notebooks/examples> | ||||||
Sample Data Tutorial <notebooks/tutorial> | ||||||
Predicates DataFrame <notebooks/predicates> | ||||||
Configuration Language <configuration> | ||||||
Algorithm & Terminology <terminology> | ||||||
Profiling <profiling> | ||||||
Sample Data Tutorial <notebooks/tutorial> | ||||||
Technical Details <technical> | ||||||
Computational Profile <profiling> | ||||||
Module API Reference <api/modules> | ||||||
License <license> | ||||||
``` | ||||||
|
@@ -29,29 +28,29 @@ ______________________________________________________________________ | |||||
|
||||||
If you have a dataset and want to leverage it for machine learning tasks, the ACES ecosystem offers a streamlined and user-friendly approach. Here's how you can easily transform, prepare, and utilize your dataset with MEDS and ACES for efficient and effective machine learning: | ||||||
|
||||||
### 1. Transform to MEDS | ||||||
### I. Transform to MEDS | ||||||
|
||||||
- Simplicity: Converting your dataset to the Medical Event Data Standard (MEDS) is straightforward and user-friendly compared to other Common Data Models (CDMs). | ||||||
- Minimal Bias: This conversion process ensures that your data remains as close to its raw form as possible, minimizing the introduction of biases. | ||||||
- [MEDS-ETL](https://github.com/Medical-Event-Data-Standard/meds_etl): Follow this link for detailed instructions and ETLs to transform your dataset into the MEDS format! | ||||||
|
||||||
### 2. Identify Predicates | ||||||
### II. Identify Predicates | ||||||
|
||||||
- Task-Specific Concepts: Identify the predicates (data concepts) required for your specific machine learning tasks. | ||||||
- Pre-Defined Criteria: Utilize our pre-defined criteria across various tasks and clinical areas to expedite this process. | ||||||
- [PIE-MD](https://github.com/mmcdermott/PIE_MD/tree/main/tasks/criteria): Access our repository of tasks to find relevant predicates! | ||||||
|
||||||
### 3. Set Dataset-Agnostic Criteria | ||||||
### III. Set Dataset-Agnostic Criteria | ||||||
|
||||||
- Standardization: Combine the identified predicates with standardized, dataset-agnostic criteria files. | ||||||
- Examples: Refer to the [MIMIC-IV](https://github.com/mmcdermott/PIE_MD/tree/main/tasks/MIMIC-IV) and [eICU](https://github.com/mmcdermott/PIE_MD/tree/main/tasks/eICU) examples for guidance on how to structure your criteria files for your private datasets! | ||||||
|
||||||
### 4. Run ACES | ||||||
### IV. Run ACES | ||||||
|
||||||
- Run the ACES Command-Line Interface tool (`aces-cli`) to extract cohorts based on your task - check out the [Usage Guide](https://eventstreamaces.readthedocs.io/en/latest/usage.html)! | ||||||
|
||||||
### 5. Run MEDS-Tab | ||||||
### V. Run MEDS-Tab | ||||||
|
||||||
- Painless Reproducibility: Use [MEDS-Tab](https://github.com/mmcdermott/MEDS_TAB_MIMIC_IV/tree/main/tasks) to obtain comparable, reproducible, and well-tuned XGBoost results tailored to your dataset-specific feature space! | ||||||
|
||||||
By following these steps, you can seamlessly transform your dataset, define necessary criteria, and leverage powerful machine learning tools within the ACES ecosystem. This approach not only simplifies the process but also ensures high-quality, reproducible results for your machine learning for health projects. It can reliably take no more than a week of full-time human effort to perform steps 1-5 on new datasets in reasonable raw formulations! | ||||||
By following these steps, you can seamlessly transform your dataset, define necessary criteria, and leverage powerful machine learning tools within the ACES ecosystem. This approach not only simplifies the process but also ensures high-quality, reproducible results for your machine learning for health projects. It can reliably take no more than a week of full-time human effort to perform Steps I-V on new datasets in reasonable raw formulations! | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Consider simplifying the sentence for better readability. - This approach not only simplifies the process but also ensures high-quality, reproducible results for your machine learning for health projects. It can reliably take no more than a week of full-time human effort to perform Steps I-V on new datasets in reasonable raw formulations!
+ This approach simplifies the process and ensures high-quality, reproducible results. Typically, Steps I-V can be completed within a week of full-time effort on new datasets. Committable suggestion
Suggested change
ToolsLanguageTool
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -5,5 +5,3 @@ | |
language: text | ||
--- | ||
``` | ||
|
||
______________________________________________________________________ |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,12 @@ | ||
# TODO - include the table from supplementary | ||
# Computational Profile | ||
|
||
| Task | # Patients | # Samples | Total Time (secs) | Max Mem (MB) | | ||
| ------------------------------------- | ---------- | --------- | ----------------- | ------------ | | ||
| First 24h in-hospital mortality | - | - | - | - | | ||
| First 48h in-hospital mortality | - | - | - | - | | ||
| First 24h in-ICU mortality | - | - | - | - | | ||
| First 48h in-ICU mortality | - | - | - | - | | ||
| 30d post-hospital-discharge mortality | - | - | - | - | | ||
| 30d re-admission | - | - | - | - | | ||
| Hospital length-of-stay | - | - | - | - | | ||
| ICU length-of-stay | - | - | - | - | |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
```{include} configuration.md | ||
``` | ||
|
||
```{include} terminology.md | ||
``` |
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
@@ -1,6 +1,4 @@ | ||||||
# Algorithm & Design | ||||||
|
||||||
## Introduction | ||||||
## Algorithm & Design | ||||||
|
||||||
We will assume that we are given a dataframe `df` which details events that have happened to subjects. Each | ||||||
row in the dataframe will have a `subject_id` column which identifies the subject, and a `timestamp` column | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Address grammatical issue in the introductory paragraph. - which details events that have happened to subjects.
+ detailing events that have occurred to subjects. Committable suggestion
Suggested change
ToolsLanguageTool
|
||||||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Consider removing the commented-out
preamble
settings if they are no longer needed to clean up the configuration file.