Skip to main content


The Cancer Imaging Archive

SLN-Breast | Breast Metastases to Axillary Lymph Nodes

DOI: 10.7937/tcia.2019.3xbn2jcc | Data Citation Required | Image Collection

Location Species Subjects Data Types Cancer Types Size Status Updated
Breast Human 78 Histopathology Breast Cancer 53GB Public, Complete 2019/07/18


The detection of breast cancer metastases to lymph nodes is of great prognostic value for patient treatment. Using machine learning to detect metastatic breast cancer to lymph nodes can increase efficiency of pathologist diagnosis and ultimately ensure patients are accurately staged for prospective treatment. This dataset allows for the objective comparison of breast cancer metastases detection algorithms.

The dataset consists of 130 de-identified whole slide images of H&E stained axillary lymph node specimens from 78 patients. Metastatic breast carcinoma is present in 36 of the WSI from 27 patients. No patient inclusion/exclusion criteria were followed. No slide inclusion/exclusion criteria were followed. The slides were scanned at Memorial Sloan Kettering Cancer Center (MSKCC) with Leica Aperio AT2 scanners at 20x equivalent magnification (0.5 microns per pixel). Together with the slides, the class label of each slide, either positive or negative for breast carcinoma, is given. The slide class label was obtained from the pathology report of the respective case.

Data Access

Version 1: Updated

Title Data Type Format Access Points Subjects Studies Series Images License
Slide Images Histopathology SVS
Download requires IBM-Aspera-Connect plugin
78 130 CC BY 3.0
Supplemental Data CSV CC BY 3.0

Citations & Data Usage Policy

Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:

Data Citation

Campanella, G., Hanna, M. G., Brogi, E., & Fuchs, T. J. (2019). Breast Metastases to Axillary Lymph Nodes [Data set]. The Cancer Imaging Archive.

Detailed Description

Explanation of target.csv files

target.csv contains a binary label for each slide image in the dataset.

  • target=1 means that the image contains breast cancer metastases.
  • target=0 means that the image does not contain breast cancer metastases.

Other Publications Using this Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.

Publication Citation

Campanella, G., Hanna, M. G., Geneslaw, L., Miraflor, A., Werneck Krauss Silva, V., Busam, K. J., Brogi, E., Reuter, V. E., Klimstra, D. S., & Fuchs, T. J. (2019). Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nature Medicine (Vol. 25, Issue 8, pp. 1301–1309). Springer Science and Business Media LLC.

TCIA Citation

Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC.