Introduction

Welcome to the mil4wsi Framework – your gateway to state-of-the-art Multiple Instance Learning (MIL) model implementations for gigapixel whole slide images. This comprehensive open-source repository empowers researchers, developers, and enthusiasts to explore and leverage cutting-edge MIL techniques.

Automatic Installation

conda create -n wsissl python=3.10
conda activate wsissl
conda env update --file environment.yml

Manual Installation

create Environment

conda create -n ENV_NAME python=3.10
conda activate ENV_NAME

Install torch; 2) Install pytorch_geometric; 3) Install additional packages for visualization and log as:

pip install submitit joblib pandas wandb openslide-python==1.2.0 scikit-image wsiprocess scikit-learn matplotlib nystrom_attention

Example with torch==2.4.0; cuda==11.8

conda create -n ENV_NAME python=3.10 && conda activate ENV_NAME && pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118 && pip install torch_geometric pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-2.4.0+cu118.html && pip install submitit joblib pandas wandb openslide-python==1.2.0 scikit-image wsiprocess scikit-learn matplotlib nystrom_attention

Data Preprocessing

This work uses CLAM to filter out background patches. After the .h5 coordinate generation, use:

H5-to-jpg: It converts .h5 coordinates into jpg images
Sort images: It reorganizes patches into hierarchical folders
Dino Training: Given the patches, train dino with the vit_small option
Feature Extraction: It extracts patch features and adjacency matrices
Geometric Dataset Conversion: It allows to work with graphs architectures and PyTorch geometric

Available Models

MaxPooling
MeanPooling
ABMIL
DSMIL
DASMIL
BUFFERMIL
TRANSMIL
HIPT

DASMIL

@inproceedings{Bontempo2023_MICCAI,
    author={Bontempo, Gianpaolo and Porrello, Angelo and Bolelli, Federico and Calderara, Simone and Ficarra, Elisa},
    title={{DAS-MIL: Distilling Across Scales for MIL Classification of Histological WSIs}},
    booktitle={Medical Image Computing and Computer Assisted Intervention – MICCAI 2023},
    pages={248--258},
    year=2023,
    month={Oct},
    publisher={Springer},
    doi={https://doi.org/10.1007/978-3-031-43907-0_24},
    isbn={978-3-031-43906-3}
}


@ARTICLE{Bontempo2024_TMI,
  author={Bontempo, Gianpaolo and Bolelli, Federico and Porrello, Angelo and Calderara, Simone and Ficarra, Elisa},
  journal={IEEE Transactions on Medical Imaging}, 
  title={A Graph-Based Multi-Scale Approach With Knowledge Distillation for WSI Classification}, 
  year={2024},
  volume={43},
  number={4},
  pages={1412-1421},
  keywords={Feature extraction;Proposals;Spatial resolution;Knowledge engineering;Graph neural networks;Transformers;Prediction algorithms;Whole slide images (WSIs);multiple instance learning (MIL);(self) knowledge distillation;weakly supervised learning},
  doi={10.1109/TMI.2023.3337549}}

Training

python main.py --datasetpath DATASETPATH --dataset [cam or lung]

Reproducibility

Pretrained models

DINO Camelyon16	DINO LUNG
x5 ~0.65GB	x5 ~0.65GB
x10 ~0.65GB	x10 ~0.65GB
x20 ~0.65GB	x20 ~0.65GB

DASMIL Camelyon16	DASMIL LUNG
model ~9MB	model ~15MB
ACC: 0.945	ACC: 0.92
AUC: 0.967	AUC: 0.966

Pytorch Geometric - Extracted Features

Camelyon16	LUNG
Dataset ~4.25GB	Dataset ~17.5GB

Eval

setup checkpoints and datasets paths in utils/experiment.py then

python eval.py --datasetpath DATASETPATH --checkpoint CHECKPOINTPATH --dataset [cam or lung]

Contributing

We encourage and welcome contributions from the community to help improve the MIL Models Framework and make it even more valuable for the entire machine-learning community.

Name	Name	Last commit message	Last commit date
Latest commit Bontempogianpaolo1 fix Nov 29, 2024 27fee40 · Nov 29, 2024 History 96 Commits
0-extract_patches	0-extract_patches	add warning to contour_fn	Mar 25, 2024
1-sort_images	1-sort_images	refactor: comments added to the code	Jul 14, 2023
3-prepare-geomDataset	3-prepare-geomDataset	update	Sep 10, 2023
models	models	fix	Nov 29, 2024
t2	t2	remove checkentropy function and its occurrence	Mar 25, 2024
utilsmil4wsi	utilsmil4wsi	buffermil update	Oct 22, 2024
.gitignore	.gitignore	ignore wandb folder	Oct 22, 2024
.gitmodules	.gitmodules	dino as submodule	Sep 10, 2023
CONTRIBUTING.md	CONTRIBUTING.md	Create CONTRIBUTING.md	Mar 18, 2024
Example.ipynb	Example.ipynb	colab	Nov 15, 2024
LICENSE	LICENSE	Create LICENSE	Mar 18, 2024
bash.sh	bash.sh	update	Sep 9, 2023
environment.yml	environment.yml	Update environment.yml	Sep 21, 2023
eval.py	eval.py	update	Sep 10, 2023
logo.png	logo.png	update	Aug 9, 2023
main.py	main.py	buffermil update	Oct 22, 2024
readme.md	readme.md	readme update	Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Automatic Installation

Manual Installation

Data Preprocessing

Available Models

DASMIL

Training

Reproducibility

Pretrained models

Pytorch Geometric - Extracted Features

Eval

Contributing

About

Releases

Packages

Contributors 4

Languages

License

aimagelab/mil4wsi

Folders and files

Latest commit

History

Repository files navigation

Introduction

Automatic Installation

Manual Installation

Data Preprocessing

Available Models

DASMIL

Training

Reproducibility

Pretrained models

Pytorch Geometric - Extracted Features

Eval

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages