Published March 27, 2024 | Version v1
Journal article Open

Slideflow: Deep learning for digital histopathology with real-time whole-slide visualization

Description

Deep learning methods have emerged as powerful tools for analyzing histopathological images, but current methods are often specialized for specific domains and software environments, and few open-source options exist for deploying models in an interactive interface. Experimenting with different deep learning approaches typically requires switching software libraries and reprocessing data, reducing the feasibility and practicality of experimenting with new architectures. We developed a flexible deep learning library for histopathology called Slideflow, a package which supports a broad array of deep learning methods for digital pathology and includes a fast whole-slide interface for deploying trained models. Slideflow includes unique tools for whole-slide image data processing, efficient stain normalization and augmentation, weakly-supervised whole-slide classification, uncertainty quantification, feature generation, feature space analysis, and explainability. Whole-slide image processing is highly optimized, enabling whole-slide tile extraction at 40x magnification in 2.5 s per slide. The framework-agnostic data processing pipeline enables rapid experimentation with new methods built with either Tensorflow or PyTorch, and the graphical user interface supports real-time visualization of slides, predictions, heatmaps, and feature space characteristics on a variety of hardware devices, including ARM-based devices such as the Raspberry Pi.

Data availability

Slideflow and Slideflow Studio are available at https://github.com/jamesdolezal/slideflow, via the Python Package Index (PyPI), and Docker Hub (https://hub.docker.com/r/jamesdolezal/slideflow). Slideflow is licensed with GNU General Public License v3.0. The whole-slide image dataset from The Cancer Genome Atlas (TCGA) head and neck squamous cell carcinoma project are publicly available at https://portal.gdc.cancer.gov/projects/TCGA-HNSC. The training datasets from University of Chicago analyzed during the current study are not publicly available due to patient privacy obligations, but are available from the corresponding author on reasonable request. Data can only be shared for non-commercial academic purposes and will require institutional permission and a data use agreement.

Files

Slideflow-Deep-learning-for-digital-histopathology-with-real-time-whole-slide-visualization.pdf

Files (19.7 MB)

Additional details

Identifiers

DOI
10.1186/s12859-024-05758-x
Other
oai:uchicago.tind.io:11458

Funding

National Institutes of Health
K12-CA13916013
National Institutes of Health
R56-DE030958
Department of Defense
Breakthrough Cancer Research program
Wellcome Leap
Q4Bio
European Union
Horizon Programme
National Institutes of Health
UE5-EB035490
National Institutes of Health
R01- R01CA276652
DoE/NCI
Innovative Methodologies and New Data for Predictive Oncology Model Evaluation (IMPROVE) project IAA
SU2C
Maverick Award Grant
ECOG Research and Education Foundation

UChicago Information

Division(s)
Biological Sciences Division, Physical Sciences Division
Department(s)
Computer Science, Medicine