Pathology Datasets
Pathology datasets are collections of digitized tissue images and associated data, crucial for training and evaluating algorithms in computational pathology. Current research focuses on developing robust models for tasks like image segmentation, classification (including cancer subtyping and grading), and visual-language analysis, often employing transformer-based architectures, diffusion models, and self-supervised learning techniques to address data scarcity and heterogeneity. These advancements are significantly impacting healthcare by enabling more accurate and efficient disease diagnosis, prognosis, and treatment planning, ultimately improving patient care. The development of large-scale, publicly available datasets and standardized evaluation frameworks is also a key area of progress, fostering collaboration and reproducibility within the field.
Papers
Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images
Gabriele Campanella, Ricky Kwan, Eugene Fluder, Jennifer Zeng, Aryeh Stock, Brandon Veremis, Alexandros D. Polydorides, Cyrus Hedvat, Adam Schoenfeld, Chad Vanderbilt, Patricia Kovatch, Carlos Cordon-Cardo, Thomas J. Fuchs
Exploring adversarial attacks in federated learning for medical imaging
Erfan Darzi, Florian Dubost, N. M. Sijtsema, P. M. A van Ooijen