Skip to main content

AML-CYTOMORPHOLOGY_MLL_HELMHOLTZ

The Cancer Imaging Archive

AML-Cytomorphology_MLL_Helmholtz | A morphological dataset of white blood cells from patients with four different genetic AML entities and non-malignant controls

DOI: 10.7937/6PPE-4020 | Data Citation Required | Image Collection

Location Species Subjects Data Types Cancer Types Size Supporting Data Status Updated
Blood Human 189 Histopathology Acute Myeloid Leukemia 13.3GB Clinical Public, Complete 2023/10/13

Summary

This dataset comprises four prevalent AML subtypes with defining genetic abnormalities and typical morphological features according to the WHO 2022 classification: (i) APL with PML::RARA fusion, (ii) AML with NPM1 mutation, (iii) AML with CBFB::MYH11 fusion (without NPM1 mutation), and (iv) AML with RUNX1::RUNX1T1 fusion, as well as a control group of healthy stem cell donors. 

A total of 189 peripheral blood smears from the Munich Leukemia Laboratory (MLL) database from the years 2009 to 2020 were digitized. First, all blood smears were scanned with 10x magnification and an overview image was created. Using the Metasystems Metafer platform, cell detection was performed automatically using a segmentation threshold and logarithmic color transformation. Further analysis regarding the quality of the region within the blood smear was performed automatically. Per patient, 99-500 white blood cells were then scanned in 40x magnification via oil immersion microscopy in .TIF format, corresponding to 24,9μm x 24,9μm (144x144 pixels). For this, a CMOS Color Camera from MetaSystems with a resolution of 4096x3000px and a pixel size of 3,45μm x 3,45μm was used. Four pixels were binned into one, leading to a size of 6.9μm x 6.9μm, and a resolution of 6.9μm / 40 (1px = 0,1725μm). Additional information about patient age, sex and blood counts are provided in a separate .csv file.

To our knowledge, this dataset covers the morphological complexity of acute myeloid leukemia in peripheral blood smears in unseen quality and quantity. 

Data Access

Version 1: Updated 2023/10/13

Title Data Type Format Access Points Subjects Studies Series Images License
Slide Images Histopathology TIFF
Download requires IBM-Aspera-Connect plugin
189 81,214 CC BY 4.0
Clinical metadata XLSX CC BY 4.0
Related Datasets
No related Analysis Results found: Submit your proposal! No related Collections found
Legend: Analysis Results| Collections

Citations & Data Usage Policy

Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:

Data Citation

Hehr, M., Sadafi, A., Matek, C., Lienemann, P., Pohlkamp, C., Haferlach, T., Spiekermann, K., & Marr, C. (2023). A morphological dataset of white blood cells from patients with four different genetic AML entities and non-malignant controls (AML-Cytomorphology_MLL_Helmholtz) (Version 1) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/6PPE-4020

Acknowledgements

  • All samples were collected, diagnosed and scanned at the Munich Leukemia Laboratory (MLL).
  • Carsten Marr has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 866411).
  • Matthias Hehr acknowledges support from Deutsche José Carreras-Leukämie Stiftung.

Related Publications

Publications by the Dataset Authors

The authors recommended this paper as the best source of additional information about this dataset:

  • Hehr, M., Sadafi, A., Matek, C., Lienemann, P., Pohlkamp, C., Haferlach, T., Spiekermann, K., & Marr, C. (2023). Explainable AI identifies diagnostic cells of genetic AML subtypes. In H. Mattie (Ed.), PLOS Digital Health (Vol. 2, Issue 3, p. e0000187). Public Library of Science (PLoS). https://doi.org/10.1371/journal.pdig.0000187

No publications by dataset authors were found.

Publication Citation

Hehr, M., Sadafi, A., Matek, C., Lienemann, P., Pohlkamp, C., Haferlach, T., Spiekermann, K., & Marr, C. (2023). Explainable AI identifies diagnostic cells of genetic AML subtypes. In H. Mattie (Ed.), PLOS Digital Health (Vol. 2, Issue 3, p. e0000187). Public Library of Science (PLoS). https://doi.org/10.1371/journal.pdig.0000187

Research Community Publications

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact the TCIA Helpdesk.

Other Publications Using this Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact the TCIA Helpdesk.