Skip to main content


CALGB50303-Tumor-Annotations | Annotations for Rituximab and Combination Chemotherapy in Treating Patients With Diffuse Large B-Cell Non-Hodgkin's Lymphoma

DOI: 10.7937/9JER-G980 | Data Citation Required | Analysis Result

Cancer Types Location Subjects Related Collections Size Supporting Data Updated
Lymphoma Various 155 59.52MB Tumor segmentations, Seed points 2023/03/30


This dataset contains image annotations derived from the NCI Clinical Trial “Rituximab and Combination Chemotherapy in Treating Patients With Diffuse Large B-Cell Non-Hodgkin’s Lymphoma (CALGB50303)”.  This dataset was generated as part of an NCI project to augment TCIA datasets with annotations that will improve their value for cancer researchers and AI developers.

Annotation Protocol

For each patient, every DICOM Study and DICOM Series was reviewed to identify and annotate the clinically relevant time points and sequences. In a typical patient all available time points were annotated.

In a typical patient the following annotation rules were followed:

a.    PERCIST criteria was followed. Specifically, the lesions estimated to have the most elevated SUVmax were annotated.
b.    Lesions were annotated in the axial plane. If no axial plane were available, lesions were annotated in the coronal plane.
c.    Lesions were annotated on the attenuation corrected PET series as well as the diagnostic contrast-enhanced CT. If no diagnostic contrast-enhanced CT was available for that timepoint, then the non-contrast CT portion of the PET/CT was annotated.
d.    A maximum of 5 lesions were annotated per patient scan (timepoint); no more than 2 per organ. For the purposes of this project, the lymph nodes constitute 1 organ, while other lymphatic structures such as the spleen, salivary glands, and Waldeyer’s ring structures constitute separate organs.  The same 5 lesions were annotated at each time point.  RECIST 1.1 principles were followed. Specifically, lymph nodes were annotated if > 1.5 cm in short axis. Other lesions were annotated if > 1 cm.
e.    Lesions were labeled separately.
f.    Seed points were automatically generated and reviewed by a radiologist.

At each time point:

  1. A seed point (kernel) was created for each segmented structure. The seed points for each segmentation are provided in a separate DICOM RTSS file.
  2. SNOMED-CT “Anatomic Region Sequence” and “Segmented Property Category Code Sequence” and codes were inserted for all segmented structures.
  3. “Tracking ID” and “Tracking UID” tags were inserted for each segmented structure to enable longitudinal lesion tracking.
  4. Imaging time point codes were inserted to help identify each annotation in the context of the clinical trial assessment protocol.
    1. “Clinical Trial Time Point ID” was used to encode time point type using one of the following strings as applicable: “pre-dose” or “post-chemotherapy”
    2. Content Item in “Acquisition Context Sequence” was added containing “Time Point Type” using Concept Code Sequence (0040,A168) selected from:
      1. (255235001, SCT, “Pre-dose”)
      2. (262502001, SCT, “Post-chemotherapy”)

Important supplementary information and sample code

  1. A spreadsheet containing key details about the annotations is available in the Data Access section below.
  2. A Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and the REST API (with authentication) to access these data can be found in the Additional Resources section below.

Data Access

Version 1: Updated 2023/03/30

Title Data Type Format Access Points Subjects Studies Series Images License
CALGB50303 Annotations - Segmentations, Seed Points, and Negative Findings Assessments RTSTRUCT DICOM
Download requires NBIA Data Retriever
155 519 3,077 3,077
CALGB50303 Annotation Metadata CSV CC BY 4.0

Collections Used In This Analysis Result

Title Data Type Format Access Points Subjects Studies Series Images License
Original CALGB50303 Images used to create Segmentations and Seed Points CT, PT DICOM 152 448 661 204,738
Original CALGB50303 Images used to create Negative Assessment reports CT, PT DICOM 89 183 209 59,572

Additional Resources For This Dataset

Citations & Data Usage Policy

Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:

Data Citation

Rozenfeld, M., & Jordan, P. (2023). Annotations for Rituximab and Combination Chemotherapy in Treating Patients With Diffuse Large B-Cell Non-Hodgkin’s Lymphoma (CALGB50303-Tumor-Annotations) [Data set]. The Cancer Imaging Archive.

Publications Using This Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.

TCIA Citation

Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC.

Collections Used In This Analysis Result

Related Collections