NeurIPS MedFM Challenge: 2nd Place - 2023

This repository contains the code to reproduce the 2nd place submission for the Foundation Model Prompting for Medical Image Classification Challenge 2023 (MedFM).

It is based on MMPreTrain, which which uses the backbones ViT-cls, ViT-eva02, ViT-dinov2, Swin-cls and ViT-clip. More details could be found in its document.

Installation

Install requirements by

$ conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=10.1 -c pytorch
$ pip install openmim scipy scikit-learn ftfy regex tqdm
$ mim install mmpretrain

Usage

Preparation

Prepare data following MMPreTrain. Download the dataset and unzip the '.zip' files into {$MedFM}/data/ as following:

MedFM (root)/
    ├── docker               # dockerfiles
    ├── configs              # all the configs
    │   ├── clip-b_vpt
    │   ├── swin-b_vpt
    │   ├── dinov2-b_vpt
    │   └── ...
    ├── data               
    │   ├── MedFMC_train      # unzip the download file
    │   ├── MedFMC_val
    │   └── ...
    ├── data_anns
    │   ├── MedFMC
    │   |    ├── chest
    │   |    ├── colon
    │   |    ├── endo
    │   ├── result             # sample example to submit 
    │   └── ...
    ├── medfmc                 # all source code
    ├── tools                  # train, test and other tools
    └── ...

click to show the detail of data_anns

Note that the `.txt` files includes data split information for fully supervised learning and few-shot learning tasks. The public dataset is split into `trainval.txt` and `test_WithLabel.txt`, and `trainval.txt` is also split into `train_20.txt` and `val_20.txt` where `20` means the training data makes up 20% of `trainval.txt`. `test_WithoutLabel.txt` of each dataset represents the validation set.

Corresponding .txt files are stored at ./data_anns/ folder, the few-shot learning data split files {dataset}_{N_shot}-shot_train/val_exp{N_exp}.txt could also be generated as below:

python tools/generate_few-shot_file.py

Where N_shot is 1,5 and 10, respectively, the shot is of patient(i.e., 1-shot means images of certain one patient are all counted as one), not number of images.

Training and Testing

For training and testing, you can directly use the commands listed below:

# you need to export path in terminal so the `custom_imports` in config would work
export PYTHONPATH=$PWD:$PYTHONPATH
# Training
python tools/train.py $CONFIG

# Evaluation
python tools/test.py $CONFIG $CHECKPOINT

Generating Submission results

Run

python tools/infer.py $CONFIF $WEIGHT $IMAGE_FOLDER --batch-size 4 --out $OUT_FILE_PATH

Using the MedFMC repository with Docker

click to show the detail

More details of Docker could be found in this tutorial.

Preparation of Docker

We provide a Dockerfile to build an image. Ensure that your docker version >=19.03.

# build an image with PyTorch 1.11, CUDA 11.3
# If you prefer other versions, just modified the Dockerfile
docker build -t medfmc docker/

Run it with

docker run --gpus all --shm-size=8g -it -v {DATA_DIR}:/medfmc/data medfmc

The submitted docker will be evaluated by the following command:

docker container run --gpus all --shm-size=8g -m 28G -it --name teamname --rm -v $PWD:/medfmc_exp -v $PWD/data:/medfmc_exp/data teamname:latest /bin/bash -c "sh /medfmc_exp/run.sh"

--gpus: specify the number of available GPUs during inference
-m: specify the maximum RAM
--name: container name
--rm: remove the container after running
-v $PWD:/medfmc_exp: map local codebase folder to Docker medfmc_exp folder.
-v $PWD/data:/medfmc_exp/data: map local codebase folder to Docker medfmc_exp/data folder.
teamname:latest: docker image name (should be teamname) and its version tag. The version tag should be the latest.
/bin/bash -c "sh run.sh": start the prediction command.

Assuming the team name is baseline, the Docker build command is

docker build -t baseline .

During the inference, please monitor the GPU memory consumption using watch nvidia-smi. The GPU memory consumption should be less than 10G. Otherwise, it will run into an OOM error on the official evaluation server.

3) Save Docker

docker save baseline | gzip -c > baseline.tar.gz

License

This project is released under the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
data_anns		data_anns
dataset_creation		dataset_creation
docker		docker
docs		docs
ensemble		ensemble
experiment-chest		experiment-chest
experiment-endo		experiment-endo
experiment_colon		experiment_colon
gridsearch		gridsearch
medfmc		medfmc
submissions		submissions
tools		tools
utility		utility
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
clear_runs_without_checkpoints		clear_runs_without_checkpoints
prep_infer_model_soup.py		prep_infer_model_soup.py
prep_infer_model_soup_tmp.py		prep_infer_model_soup_tmp.py
prep_infer_with_exp.py		prep_infer_with_exp.py
reproduce.py		reproduce.py
requirements.txt		requirements.txt
requirements_cuda10.txt		requirements_cuda10.txt
run.sh		run.sh
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeurIPS MedFM Challenge: 2nd Place - 2023

Installation

Usage

Preparation

Training and Testing

Generating Submission results

Using the MedFMC repository with Docker

Preparation of Docker

3) Save Docker

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NeurIPS MedFM Challenge: 2nd Place - 2023

Installation

Usage

Preparation

Training and Testing

Generating Submission results

Using the MedFMC repository with Docker

Preparation of Docker

3) Save Docker

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages