Odyssey

A library for developing foundation models using Electronic Health Records (EHR) data.

Introduction

Odyssey is a comprehensive library designed to facilitate the development, training, and deployment of foundation models for Electronic Health Records (EHR). Recently, we used this toolkit to develop EHRMamba, a cutting-edge EHR foundation model that leverages the Mamba architecture and Multitask Prompted Finetuning (MPF) to overcome the limitations of existing transformer-based models. EHRMamba excels in processing long temporal sequences, simultaneously learning multiple clinical tasks, and performing EHR forecasting, significantly advancing the state of the art in EHR modeling.

Key Features

The toolkit is structured into four main modules to streamline the development process:

data:
- Gathers EHR datasets from HL7 FHIR resources.
- Processes patient sequences for clinical tasks.
- Tokenizes data and creates data splits for model training.
- Provides a dataset class for model training.
models:
- Implements models including XGBoost, LSTM, CEHR-BERT, BigBird, MultiBird, and EHRMamba.
- Offers various embedding classes necessary for the models.
evals:
- Includes tools for testing models on clinical prediction tasks and forecasting.
- Provides evaluation metrics for thorough assessment of model performance.
interp:
- Contains methods for interpreting model decisions.
- Features interactive visualization of attention matrices for Transformer-based models.
- Includes novel interpretability techniques for EHRMamba and gradient attribution methods.

Contributing

We welcome contributions from the community! Please open an issue.

Citation

If you use EHRMamba or Odyssey in your research, please cite our paper:

@misc{fallahpour2024ehrmamba,
      title={EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records},
      author={Adibvafa Fallahpour and Mahshid Alinoori and Arash Afkanpour and Amrit Krishnan},
      year={2024},
      eprint={2405.14567},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 483 Commits
.github		.github
odyssey		odyssey
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
finetune.py		finetune.py
interpret.py		interpret.py
poetry.lock		poetry.lock
pretrain.py		pretrain.py
pyproject.toml		pyproject.toml
results.md		results.md
slurm_scripts.md		slurm_scripts.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Odyssey

Introduction

Key Features

Contributing

Citation

About

Releases

Packages

Contributors 5

Languages

License

VectorInstitute/odyssey

Folders and files

Latest commit

History

Repository files navigation

Odyssey

Introduction

Key Features

Contributing

Citation

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages