Skip to content
/ DICOM2OMOP Public
forked from paulnagy/DICOM2OMOP

This project looks at creating a controlled vocabulary for DICOM Pt 6 Data Dictionary with a focus on CS code strings.

License

Notifications You must be signed in to change notification settings

SMI/DICOM2OMOP

 
 

Repository files navigation

Dicom2OMOP

This project builds on a published work, "Development of Medical Imaging Data Standardization for Imaging-Based Observational Research: OMOP Common Data Model Extension". A copy of the full paper can also be found in the files folder. This project looks at creating a controlled vocabulary for the DICOM Pt 6 Data Dictionary and DICOM Pt 16, focusing on code strings in the OMOP vocabulary format.

  • Create a library that takes a dicom tag
    • (eg Part 6 0010,0040 Patient Sex)
    • (Link to Part 16 CID 7455 Sex)
    • (Ingest FHIR JSON)
    • (Create "maps to" from Source to Standard vocabulary for OMOP gender)
  • Identify current gaps in SNOMED and LOINC mapping from DICOM

References

The repository is structured in phases:

  1. Harvest standards: this folder includes codes to ingest DICOM Standard from JSON and XML files. You may skip this step and access to ingested files in the files folder. You can use this code scripts to ingest newer versions of the DICOM Standard.
  2. DICOM to OMOP: this folder contains codes to build the imaging extension tables to your existing OMOP CDM, trainsform ingested DICOM Standards into OMOP format, and upload the newly created DICOM concepts to the OMOP CDM with imaging extension.
  3. Demonstration: we extracted data from Alzheimer's Disease Neuroimaging Initiative Image and Data Archive (ADNI IDA). The demographics, clinical data are in flat file formats, and the images in the DICOM format. We pre-loaded ADNI 3 DICOM images to Johns Hopkins Azure instance, and accessed the imaging file from the Azure. This folder includes scripts to download and unpack DICOM metadata from the images and transforming imaging metadata and data from flat files. In addition to the imaging metadata, we ran a segmentation algorithm from OpenMAP, and transformed the results into OMOP CDM format. The code to run the algorithm is not part of this repo, but the transformation of results is in "transform_upload_imaging_algo_results.ipynb" file.
  4. Analysis: all the other exploratory analyses are stored in this folder.

About

This project looks at creating a controlled vocabulary for DICOM Pt 6 Data Dictionary with a focus on CS code strings.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.3%
  • Python 0.7%