Clinical Proteomic Tumor Analysis Consortium (CPTAC)

Search and visualize data from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) collections to investigate cancer phenotypes which may correlate to corresponding proteomic, genomic and clinical data.

Browse Search

The Cancer Genome Atlas (TCGA)

Search and visualize data from The Cancer Genome Atlas (TCGA) collections to investigate cancer phenotypes which may correlate to corresponding genomic and clinical data.

Browse Search

National Lung Screening Trial

This collection contains subjects from a randomized controlled clinial trial of screening tests for lung cancer conducted by the National Lung Screening Trial (NLST), between August 2002 and April 2004.

Osteosarcoma Pathology

The osteosarcoma dataset is composed of digitized Hematoxylin and eosin (H&E) stained osteosarcoma histology images from adolescents. The data was collected by a team of clinical scientists at University of Texas Southwestern Medical Center, Dallas.

Prostate Fused-MRI-Pathology

This collection comprises a total of 28 3 Tesla T1-weighted, T2-weighted, Diffusion weighted and Dynamic Contrast Enhanced prostate MRI along with accompanying digitized histopathology (H&Estained) images of corresponding radical prostatectomy specimens.

Prostate-MRI

The Prostate-MRI pathology dataset contains H&E stained prostate images from the National Cancer Institute, generated between 2008-2010.

Osteosarcoma Tumor Assessment

The dataset is composed of Hematoxylin and eosin (H&E) stained osteosarcoma histology images. The dataset consists of 1144 images of size 1024 X 1024 at 10X resolution with the following distribution: 536 (47%) non-tumor images, 263 (23%) necrotic tumor images and 345 (30%) viable tumor tiles.

Lung Fused-CT-Pathology

This is the first attempt of mapping the extent of Invasive Adenocarcinoma onto in vivo lung CT. The mappings constitute ground truth of disease and may be used to further investigate the imaging signatures of Invasive Adenocarcinoma in ground glass pulmonary nodules. Data collection and analysis was provided by Case Western Reserve University.

AML-Cytomorphology_LMU

The Munich AML Morphology Dataset contains 18,365 expert-labeled single-cell images taken from patients diagnosed with Acute Myeloid Leukemia at Munich University Hospital. The dataset has been used by the authors to train a convolutional neural network for single-cell morphology classification.

Post-NAT-BRCA

The Post-NAT-BRCA dataset is a collection of representative sections from breast resections in patients with residual invasive breast cancer following neoadjuvant therapy. Histologic sections were prepared and digitized to produce high resolution, microscopic images of treated breast cancer tumors.

SLN-Breast

The dataset consists of 130 de-identified whole slide images (WSI) of H&E stained axillary lymph node specimens from 78 patients. Metastatic breast carcinoma is present in 36 of the WSI from 27 patients.

C-NMC 2019

This data collection consists of 15,135 acute lymphoblastic leukemia (ALL) images which were split into 3 separate testing phases for the purpose of training a machine learning-based algorithm. The dataset was used for the IEEE ISBI 2019 conference challenge.

MiMM_SBILab

This data collection consists of 85 Jenner-Giemsa stained bone marrow aspirate slides of patients diagnosed with multiple myeloma. Images were captured in raw BMP formate with a size of 2560×1920 pixels.

SN-AM

Microscopic images were captured from bone marrow aspirate slides of patients diagnosed with B-lineage Acute Lymphoid Leukemia (B-ALL) and Multiple Myeloma (MM) as per the standard guidelines. This dataset consists of 90 images of B-ALL and 100 images of MM.

IvyGAP

This data collection consists of MRI/CT scan data as well as clinical and genomic pathology data for brain tumor patients that form the cohort for the resource Ivy Glioblastoma Atlast Project (Ivy GAP). There are 390 studies for 39 patients that include pre-surgery, post-surgery and follow up scans.