Send us Fan Mail
Are pathology foundation models actually ready for labs, or are they still stronger on paper than in practice?
In this episode of DigiPath Digest #49, I unpack a timely review on pathology foundation models and ask the question that matters most to me: not just what these models can do, but what has to be true before they are genuinely useful in real pathology workflows.
I walk through how pathology AI moved from narrow, task-specific models into the era of transformer-based foundation models. That shift matters because pathology is no longer only about looking at H&E in isolation. Today, pathologists are expected to integrate morphology, immunohistochemistry, molecular assays, genomics, and clinical context. That growing complexity is one reason foundation models are getting so much attention.
In this discussion, I explain how transformers entered pathology, why image patches are treated like tokens, and how shared embeddings can support classification, regression, segmentation, and multimodal retrieval. I also go through the major pathology foundation models mentioned in the paper, including Virchow/Virchow2, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, GigaPath, and TITAN, and why scale alone is not the full story.
A big part of this episode is about the gap between benchmark performance and clinical readiness. I talk about the persistent limitations in training data diversity, the overuse of TCGA, and why public benchmarks can still miss what real pathology practice looks like. I also cover where foundation models still struggle, especially in cytopathology, hematopathology, and underrepresented disease areas, along with the real-world problems of artifacts, domain shift, concept drift, infrastructure burden, regulatory complexity, and workflow disruption.
For me, one of the most important themes is this: AI in pathology should augment, not replace, pathologists. The future is not about handing diagnosis to a model. It is about building tools that support pathologists better, fit real workflows, and can be validated in ways that deserve trust.
I also spend time on what comes next: explainable AI, counterfactual explanations, conversational interfaces, retrieval-augmented systems, multimodal fusion, and the need for deployment-centric validation rather than paper-only excitement.
If you are trying to understand where pathology foundation models really stand today, this episode will help you separate the promise from the practical barriers.
Episode Highlights
00:01 – Why I chose this paper, what is changing at Digital Pathology Place, and why foundation models are worth paying attention to now.
02:15 – The core questions: what pathology foundation models are, where they are, and how difficult they are to apply in pathology.
04:50 – Why pathology is becoming more cognitively demanding, and how multimodal complexity is driving interest in scalable AI.
07:02 – From narrow AI to transformers: how pathology moved beyond single-task CNN models.
10:16 – How transformers work in pathology: image patches as tokens, self-attention, embeddings, and downstream tasks.
14:16 – Why multimodality matters, and what kinds of data foundation models may eventually integrate.
15:27 – Timeline of key model developments, from “Attention Is All You Need” to gigapixel-scale pathology foundation models.
17:13 – The leading models and what scale really looks like: Virchow, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, and GigaPath.
19:51 – Why dataset diversity matters more than sheer volume, and why TCGA is not enough.
23:17 – Where foundation models still struggle: cytopathology, hematopathology, rare disease, artifacts, scanner shifts, and pen marks.
28:06 – Explainability, counterfactual explanations, and why trust in pathology AI needs more than attention maps.
30:17 – The real deployment hurdles: regulation, infrastructure, workflow fit, and economics.
36:32 – Why AI should augment pathologists, not replace them, and which tedious tasks pathologists would gladly hand over.
38:36 – Retrieval-augmented and conversational AI in pathology: where interactive systems may actually help.
40:51 – Vision-language models and multimodal fusion with histology, radiology, genomics, and clinical notes.
42:16 – The path forward: deployment-centric design, prospective multi-site validation, and human-AI collaboration.
44:08 – Closing thoughts on AI literacy, community learning, and what needs to happen next.
Resources Mentioned
Main paper discussed:
Pathology Foundation Models: Evolution, Current Landscape, Challenges and Opportunities from a Technical and Clinical Perspective
https://doi.org/10.3390/bioengineering13050577
Review article / journal landing page:
https://doi.org/10.3390/bioengineering13050577
Benchmarks mentioned:PathoBench — discussed in the review paper; use the review link here for context until you want to swap in a canonical project page:
https://doi.org/10.3390/bioengineering13050577
PathBench — public benchmark paper:
https://arxiv.org/abs/2505.20202
MEDFAIR — benchmark paper:
https://arxiv.org/abs/2210.01725
MEDFAIR code repository:
https://github.com/ys-zong/MEDFAIR
Models mentioned:Model overview in the review (Virchow/Virchow2, UNI, CONCH, H-Optimus, GigaPath, TITAN, Mayo Clinic Atlas):
https://doi.org/10.3390/bioengineering13050577
Virchow:
https://arxiv.org/abs/2309.07778
UNI:
https://arxiv.org/abs/2308.15474
CONCH:
https://arxiv.org/abs/2307.12914
Mayo Clinic Atlas:
https://arxiv.org/abs/2501.05409
TITAN:
https://arxiv.org/abs/2411.19666
Dataset mentioned:
The Cancer Genome Atlas (TCGA)
https://portal.gdc.cancer.gov/
Book mentioned:
Digital Pathology 101: All You Need to Know to Start and Continue Your Digital Pathology Journey
https://digitalpathologyplace.com/
Platform:
Digital Pathology Place
https://digitalpathologyplace.com/
Support the show
Get the "Digital Pathology 101" FREE E-book and join us!