This repository provides a PyTorch implementation of the following paper:
FIESTA: Fourier-Based Semantic Augmentation with Uncertainty Guidance for Enhanced Domain Generalizability in Medical Image Segmentation
Kwanseok Oh1, Eunjin Jeon2, Da-Woon Heo1, Yooseung Shin1, and Heung-Il Suk1, 2
(1Department of Artificial Intelligence, Korea University)
(2Department of Brain and Cognitive Engineering, Korea University)
Official Version: https://arxiv.org/abs/2406.14308Abstract: Single-source domain generalization (SDG) in medical image segmentation (MIS) aims to generalize a model using data from only one source domain to segment data from an unseen target domain. Despite substantial advances in SDG with data augmentation, existing methods often fail to fully consider the details and uncertain areas prevalent in MIS, leading to mis-segmentation. This paper proposes a Fourier-based semantic augmentation method called FIESTA using uncertainty guidance to enhance the fundamental goals of MIS in an SDG context by manipulating the amplitude and phase components in the frequency domain. The proposed Fourier augmentative transformer addresses semantic amplitude modulation based on meaningful angular points to induce pertinent variations and harnesses the phase spectrum to ensure structural coherence. Moreover, FIESTA employs epistemic uncertainty to fine-tune the augmentation process, improving the ability of the model to adapt to diverse augmented data and concentrate on areas with higher ambiguity. Extensive experiments across three cross-domain scenarios demonstrate that FIESTA surpasses recent state-of-the-art SDG approaches in segmentation performance and significantly contributes to boosting the applicability of the model in medical imaging modalities.
- To the best of our knowledge, this work is the first Fourier-based augmentation method that simultaneously manipulates amplitude and phase components using meaningful factors tailored to SDG for cross-domain MIS.
- We propose the FAT, providing an advanced augmentation strategy that combines masking and modulation techniques to transform the amplitude spectrum and applies filtering to refine the phase information to impose structural integrity.
- The FIESTA framework embraces an uncertainty-guided mutual augmentation strategy by applying UG to focus learning in the segmentation model on certain areas of high ambiguity or mis-segmented locations.
- Based on the quantitative and qualitative experimental results on various cross-domain scenarios (including cross-modality, cross-sequence, and cross-sites), we demonstrate the significant robustness and generalizability of FIESTA, which surpasses state-of-the-art SDG methods.
We conduct datasets preparation following CSDG
1) Abdominal MRI
-
Download Combined Healthy Abdominal Organ Segmentation dataset and put the
/MR
folder under./data/CHAOST2/
directory -
Converting downloaded data (T2 SPIR) to
nii
files in 3D for the ease of reading.
run ./data/abdominal/CHAOST2/s1_dcm_img_to_nii.sh
to convert dicom images to nifti files.
run ./data/abdominal/CHAOST2/png_gth_to_nii.ipynp
to convert ground truth with png
format to nifti.
- Pre-processing downloaded images
run ./data/abdominal/CHAOST2/s2_image_normalize.ipynb
run ./data/abdominal/CHAOST2/s3_resize_roi_reindex.ipynb
The processed dataset is stored in ./data/abdominal/CHAOST2/processed/
1) Abdominal CT
- Download Synapse Multi-atlas Abdominal Segmentation dataset and put the
/img
and/label
folders under./data/SABSCT/CT/
directory
1.Pre-processing downloaded images
run ./data/abdominal/SABS/s1_intensity_normalization.ipynb
to apply abdominal window.
run ./data/abdominal/SABS/s2_remove_excessive_boundary.ipynb
to remove excessive blank region.
run ./data/abdominal/SABS/s3_resample_and_roi.ipynb
to do resampling and roi extraction.
The details for cardiac datasets will be given later.
We also provide the processed datasets. Download and unzip the file where the folder structure should look this:
SLAug
├── ...
├── data
│ ├── abdominal
│ │ ├── CHAOST2
│ │ │ ├── processed
│ │ ├── SABSCT
│ │ │ ├── processed
│ ├── cardiac
│ │ ├── processed
│ │ │ ├── bSSFP
│ │ │ ├── LGE
├── ...
If you find this work useful for your research, please cite the following paper:
@article{oh2024fiesta,
title={FIESTA: Fourier-Based Semantic Augmentation with Uncertainty Guidance for Enhanced Domain Generalizability in Medical Image Segmentation},
author={Oh, Kwanseok and Jeon, Eunjin and Heo, Da-Woon and Shin, Yooseung and Suk, Heung-Il},
journal={arXiv preprint arXiv:2406.14308},
year={2024}
}
Our codes are built upon CSDG and SLAug, thanks for their contribution to the community and the development of researches!
This work was supported by the Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) No. 20220-00959 ((Part 2) Few-Shot Learning of Causal Inference in Vision and Language for Decision Making) and No. 20190-00079 (Department of Artificial Intelligence (Korea University)). This study was further supported by KBRI basic research program through Korea Brain Research Institute funded by the Ministry of Science and ICT (22-BR-03-05).