Siamese Augmentation Strategies (SiAug)

This repository contains the implementation code for our paper:
Exploring Image Augmentations for Siamese Representation Learning with Chest X-Rays

Authors: Rogier van der Sluijs*, Nandita Bhaskhar*, Daniel Rubin, Curtis Langlotz, Akshay Chaudhari
*- co-first authors
Published at Medical Imaging with Deep Learning (MIDL)

tl;dr

Tailored augmentation strategies for image-only Siamese representation learning can outperform supervised baselines with zero-shot learning, linear probing and fine-tuning for chest X-ray classification. We systematically assess the effect of various augmentations on the quality and robustness of the learned representations. We train and evaluate Siamese Networks for abnormality detection on chest X-Rays across three large datasets (MIMIC-CXR, CheXpert and VinDr-CXR). We investigate the efficacy of the learned representations through experiments involving linear probing, fine-tuning, zero-shot transfer, and data efficiency. Finally, we identify a set of augmentations that yield robust representations that generalize well to both out-of-distribution data and diseases, while outperforming supervised baselines using just zero-shot transfer and linear probes by up to 20%.

Installation

To contribute to siaug, you can install the package in editable mode:

pip install -e .
pip install -r requirements.txt
pre-commit install
pre-commit

Make sure to update the .env file according to the setup of your cluster and placement of your project folder on disk. Also, run accelerate config to generate a config file, and copy it from ~/cache/huggingface/accelerate/default_config.yaml to the project directory. Finally, create symlinks from the data/ folder to the datasets you would want to train on.

Training

Currently, we support two modes of training: pretraining and linear evaluation.

Representation learning

To learn a new representation, you can use the train_repr.py script.

# Train and log to WandB
accelerate launch siaug/train_repr.py experiment=experiment_name logger=wandb

# Resume from checkpoint
accelerate launch siaug/train_repr.py ... resume_from_ckpt=/path/to/accelerate/ckpt/dir

# Run a fast_dev_run
accelerate launch siaug/train_repr.py ... fast_dev_run=True max_epoch=10 log_every_n_steps=1 ckpt_every_n_epochs=1

Linear evaluation

To train a linear classifier on top of a frozen backbone, use the train_lcls.py script.

# Train a linear classifier on top of a frozen backbone
accelerate launch siaug/train_lcls.py experiment=experiment_name model.ckpt_path=/path/to/model/weights

# Train a linear classifier on top of a random initialized backbone
accelerate launch siaug/train_lcls.py model.ckpt_path=None

# Use ImageNet pretrained weights
accelerate launch siaug/train_lcls.py +model.pretrained=True

Zero Shot Evaluation

To evaluate a model on a downstream task without fine-tuning, use the siaug/eval.py script.

python siaug/eval.py experiment=eval_chex_resnet +checkpoint_folder=/path/to/model/checkpoints/folder +save_path=/path/to/save/resulting/pickle/files

Contact Us

This repository is being developed at the Stanford's MIMI Lab. Please reach out to sluijs [at] stanford [dot] edu and nanbhas [at] stanford [dot] edu if you would like to use or contribute to siaug.

Citation

If you find our paper and/or code useful, please use the following BibTex for citation:

@article{sluijsnanbhas2023_siaug,
  title={Exploring Image Augmentations for Siamese Representation Learning with Chest X-Rays}, 
  author={Rogier van der Sluijs and Nandita Bhaskhar and Daniel Rubin and Curtis Langlotz and Akshay Chaudhari},
  year={2023},
  journal={Medical Imaging with Deep Learning (MIDL)},
}

Name	Name	Last commit message	Last commit date
Latest commit nanbhas Update README.md Aug 29, 2023 1f238b0 · Aug 29, 2023 History 25 Commits
configs	configs	add repo config files	Aug 29, 2023
data	data	Create .gitkeep	Apr 30, 2023
logs	logs	Create .gitkeep	Apr 30, 2023
notebooks	notebooks	Create .gitkeep	Apr 30, 2023
scripts	scripts	create placeholder for scripts	Apr 30, 2023
siaug	siaug	update siaug package	Aug 29, 2023
.env-example	.env-example	Create .env-example	Apr 30, 2023
.gitignore	.gitignore	Create .gitignore	Apr 30, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	add pre-commit hooks from hydra-lightning-template	Apr 30, 2023
LICENSE	LICENSE	Initial commit	Apr 30, 2023
README.md	README.md	Update README.md	Aug 29, 2023
accelerate_env_config.yaml	accelerate_env_config.yaml	add accelerate config	Aug 29, 2023
pyproject.toml	pyproject.toml	Create pyproject.toml	Apr 30, 2023
requirements.txt	requirements.txt	Create requirements.txt	Apr 30, 2023
setup.py	setup.py	Create setup.py	Apr 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Siamese Augmentation Strategies (SiAug)

tl;dr

Installation

Training

Representation learning

Linear evaluation

Zero Shot Evaluation

Contact Us

Citation

About

Releases

Packages

Languages

License

StanfordMIMI/siaug

Folders and files

Latest commit

History

Repository files navigation

Siamese Augmentation Strategies (SiAug)

tl;dr

Installation

Training

Representation learning

Linear evaluation

Zero Shot Evaluation

Contact Us

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages