Skip to main content

PRETREAT-METSTOBRAIN-MASKS

The Cancer Imaging Archive

Pretreat-MetsToBrain-Masks | A Large Open Access Dataset of Brain Metastasis 3D Segmentations on MRI with Clinical and Imaging Feature Information

DOI: 10.7937/6be1-r748 | Data Citation Required | Image Collection

Location Species Subjects Data Types Cancer Types Size Supporting Data Status Updated
Brain Human 200 MR, Segmentations Breast Cancer, Small Cell Lung Cancer, Non-small Cell Lung Cancer, Melanoma 1.7GB Clinical, Image Analyses Public, Complete 2023/12/19

Summary

Cohort Selection:

Patient images were collected from three sources: the Yale New Haven Health database (2013-2021), the Yale tumor board registry (2021), and the Yale gamma knife registry (2017-2021).

Inclusion/exclusion criteria:

Inclusion criteria included a pathologically proven diagnosis of brain metastasis and availability of a pretreatment scan with standard MRI sequences (T1 w, T1 post-gadolinium, T2w, and FLAIR). Exclusion criteria included lack of pretreatment scan or one of the standard MR sequences and significant motion artifact in any of the standard sequences.

Imaging acquisition/parameters:

Images were acquired using a 1.5T or 3T scanner. Most patients had T1 post-gadolinium sequences acquired using the MPRAGE protocol with a small subset having only spin echoes. Segmentations of core and necrotic tumor components were performed on T1 post-gadolinium sequences while whole tumor was segmented on FLAIR sequences.

Segmentation protocol:

All segmentations were performed manually on DICOM images in research PACS using a volumetric tool after transfer from clinical PACS.  All segmentations were approved by two neuroradiologists.

Image preprocessing:

The four standard sequences for each patient were exported from research PACS into NifTI format, co-registered to the SRl24 anatomical template, resampled to a uniform isotropic resolution (1 mm3), and skull stripped. Core, necrosis, and whole segmentation masks were exported individually from research PACS into NifTI format, combined into one mask while retaining their positions in native space, and registered to the SRl24 anatomical template before resampling to a uniform isotropic resolution. The sequence and segmentation NifTI files were manually checked and corrected by a neuroradiologist on the ITK-SNAP software.

Clinical and imaging data collection:

Demographic data, including sex, ethnicity, age at diagnosis, smoking pack-year history, and presence of extranodal metastasis were obtained using the electronic medical record (EMR). Survival was calculated by subtracting the date of diagnosis from the date of death or from the date of last EMR note for censored patients. Qualitative imaging features, including infratentorial involvement and intratumoral susceptibility in at least one lesion on SWI sequence, were obtained from visual assessment of images. Quantitative imaging features were extracted from the NifTI segmentation mask for each patient and included total enhancing tumor volume, total necrotic tumor volume, total peritumoral edema volume, ratio of necrotic to enhancing volume, ratio of peritumoral edema to enhancing volume, number of enhancing lesions, number of necrotic lesions, and number of lesions with peritumoral edema.  Finally, origin of metastasis was also obtained from the EMR from previous oncological and/or pathological reports. Dates of birth, diagnosis, death, and last note were all anonymized preserving duration between events. Clinical and imaging feature information are stored in an Excel file. 

Currently, many published brain metastasis segmentation algorithms have only been trained and validated on single institution datasets, leading to poor model generalizability. The availability of public datasets is important for algorithm generalizability and implementation into clinical practice. Our dataset is unique in its inclusion of many sub centimeter brain metastases, for which there are yet no robust algorithms. In addition, the dataset has many lesions with necrotic segmentations, which are historically not provided with dataset segmentations. We also include survival outcomes manually obtained from the EMR, allowing for future correlation of our imaging data with clinical outcomes. In addition, the heterogeneity of sources from which we obtained our patient data will allow for algorithms to be trained on a real-world hospital dataset that is not highly curated, thus enhancing model applicability to clinical practice. 

Data Access

Version 1: Updated

Title Data Type Format Access Points Subjects Studies Series Images License
Images and Segmentations MR, Segmentations NIFTI
Download requires IBM-Aspera-Connect plugin
200 1,000 CC BY 4.0
Clinical data and data dictionary XLSX 200 CC BY 4.0

Additional Resources for this Dataset

The following external resources have been made available by the data submitters.  These are not hosted or supported by TCIA, but may be useful to researchers utilizing this collection.

Citations & Data Usage Policy

Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:

Data Citation

Ramakrishnan, D., Jekel, L., Chadha, S., Janas, A., Moy, H., Maleki, N., Sala, M., Kaur, M., Petersen, G. C., Merkaj, S., von Reppert, M., Baid, U., Bakas, S., Kirsch, C., Davis, M., Bousabarah, K., Holler, W., Lin, M., Westerhoff, M., Aneja, S., Memon, F., Aboian, M. (2023). A Large Open Access Dataset of Brain Metastasis 3D Segmentations on MRI with Clinical and Imaging Feature Information (Version 1) [dataset]. The Cancer Imaging Archive. DOI: https://doi.org/10.7937/6be1-r748

Acknowledgements

We would like to acknowledge the individuals and institutions that have provided data for this collection:

  • Yale Department of Radiology and Biomedical Imaging

  • The SAIC Metadata Content Development Team identified existing or created new caDSR Common Data elements (CDEs) to describe the harmonized components of this dataset. The Metadata Content Development Team is supported by CBIIT under Task Order 140D0421F0008 from NCI.

Other Publications Using this Data

The following publications are recommended by the data submitters that may be useful to researchers utilizing this collection:

  • Ramakrishnan, D., Jekel, L., Chadha, S. et al. A large open access dataset of brain metastasis 3D segmentations on MRI with clinical and imaging information. Sci Data 11, 254 (2024). https://doi.org/10.1038/s41597-024-03021-9
  • Ramakrishnan, D., Jekel, L., Chadha, S., Janas, A., Moy, H., Maleki, N., Sala, M., Kaur, M., Petersen, G. C., Merkaj, S., von Reppert, M., Baid, U., Bakas, S., Kirsch, C., Davis, M., Bousabarah, K., Holler, W., Lin, M., Westerhoff, M., … Aboian, M. S. (2023). A Large Open Access Dataset of Brain Metastasis 3D Segmentations with Clinical and Imaging Feature Information (Version 2). arXiv. https://doi.org/10.48550/ARXIV.2309.05053

TCIA maintains a list of publications which leverage TCIA data.  If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.

TCIA Citation

Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7