Skip to main content

VAREPOP-APOLLO

The Cancer Imaging Archive

VAREPOP-APOLLO | VA Research Precision Oncology Program - APOLLO

DOI: 10.7937/ghkn-md15 | Data Citation Required | 490 Views | Image Collection

Location Species Subjects Data Types Cancer Types Size Status Updated
Esophagus, Lung, Pancreas, and Thymus Human 32 CT, MR, PT, Diagnosis Esophageal Carcinoma, Lung Squamous Cell Carcinoma, Lung Adenocarcinoma, Lung Other, Pancreatic adenocarcinoma, Thymoma 75.93GB Public, Ongoing 2024/12/06

Summary

The Research for Precision Oncology Program (RePOP) is a research activity that established a cohort of Veterans diagnosed with cancer and had genomic analyses performed on their tumor tissue as part of the standard of care. All data relevant to a patient’s cancer and cancer care were collected under RePOP, including patient demographics, comorbidities, genomic analysis, treatments, medications, lab values, imaging studies, and outcomes. All RePOP participants signed informed consent and signed HIPAA authorization to have their data stored and shared from RePOP’s Precision Oncology Program Data Repository (PODR).

Cancer Type

Modalities

Subjects

Esophageal Carcinoma

CT, PT

2

Lung Squamous Cell Carcinoma

CT, MR, PT

17

Lung Adenocarcinoma

CT, MR, PT

9

Lung Other

CT, PT

2

Pancreatic Adenocarcinoma

CT, MR, PT

1

Thymoma

CT, MR, PT

1

Data Access

Version 1: Updated 2024/12/06

Title Data Type Format Access Points Subjects Studies Series Images License
Radiology Images CT, MR, PT DICOM
Download requires NBIA Data Retriever
32 174 1,131 137,441 CC BY 4.0
VAREPOP-APOLLO Case Details Diagnosis CSV 32 174 1,131 CC BY 4.0
Related Datasets
No related Analysis Results found: Submit your proposal!
APOLLO-5
Legend: Analysis Results| Collections

Citations & Data Usage Policy

Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:

Data Citation

The Research for Precision Oncology Program and the Applied Proteogenomics Organizational Learning and Outcomes (APOLLO) Research Network. (2024). VA Research Precision Oncology Program – APOLLO (VAREPOP-APOLLO) (Version 1) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/GHKN-MD15

Acknowledgement

The VHA’s Research for Precision Oncology Program in collaboration with the APOLLO program requests that publications using data from this program include the following statement: “Data used in this publication were generated by the Veterans Health Administration’s Research for Precision Oncology Program and the Applied Proteogenomics Organizational Learning and Outcomes (APOLLO) Research Network.”

Detailed Description

 

De-identification of DICOM dates

The resulting DICOM dates are meaningless yet preserve the relative temporal distance between studies for a patient

De-identification of dates uses the DICOM standard “Retain Longitudinal With Modified Dates Option” which allows dates to be retained as long as they are modified from the original date. Date and Date-Time fields in TCIA DICOM image headers are de-identified by normalizing to a base date of January 1, 1975 and then shifted by the number of days between the original Study Date and an “anchor date”.  The anchor date for APOLLO is the Date of Diagnosis.   The choice of ‘1975’ was arbitrary, but it allows one to ensure that the dates in de-identified DICOM files have been properly de-identified as anything not around that year would be suspect.

TCIA Study Date = 01/01/1975 + (Original Study Date – Date of Diagnosis).

For example, if the original Study Date was 03/29/2018 and the Date of Diagnosis was 03/27/2018 then the Days from Diagnosis would be +2 and the TCIA Study Date would become 01/03/1975.

This technique de-identifies the dates while preserving the longitudinal relationship between dates.  Therefore, a researcher won’t know the precise date the scan occurred, but if a follow up scan was performed 120 days later, that same 120 day difference between scans of a subject will exist in the TCIA images.  Dates that occur in DICOM tags other than Date or Date-Time fields are removed. An example of this would be a date entered into the Series Description field.  If the date is associated with a library for Code Meaning then that date is preserved as the date would be required to look up the meaning in the correct version of the library.  To show that the dates have been modified, the term “MODIFIED” is written into DICOM tag (0028,0303) “LongitudinalTemporalInformationModified”.

Original dates will be first normalized to 01 January, 1975 and then offset relative to the date of diagnosis. The CTP code for shifting the StudyDate is shown below:

<e en="T" t="00080020" n="StudyDate"> @dateinterval(StudyDate,diagnosisdate,PatientID,@NORMDATE)</e>

Insertion of computed “Days from Diagnosis” value

The inserted “Days from Diagnosis” value can be compared with similar values in the APOLLO clinical data to understand the clinical context of the imaging study

The number of days the study occurred relative to the date of diagnosis is calculated by the CTP software (using the diagnosis date in the CTP lookup table at the submission site) and automatically stored in the DICOM tag (0012,0052) Longitudinal Temporal Offset from Event with the associated tag (0012,0053) Longitudinal Temporal Event Type set to “Days from Diagnosis”. The days from diagnosis links the imaging data to the clinical data for a given subject. The CTP code for this is:

<e en="T" t="00120052" n="LongitudinalTemporalOffsetfromEvent">@always()@dateinterval(StudyDate,ddate,PatientID)</e>

<e en="T" t="00120053" n="LongitudinalTemporalEventType">@always()@param(@LTET)</e> (where LTET is defined as DIAGNOSIS)

Insertion of “Diagnosis Year”

It is important for cancer researchers to know the timeframe for which the cancer was diagnosed to relate the prescribed cancer treatment or staging to what was available at that time.

In order to relate the treatments that were available at the time of the diagnosis, the year that the primary diagnosis was made is recorded in a CTP owned group 13 private tag as follows.

<e en="T" t="00131051" n="DiagnosisYear">@always()@lookup(PatientID,diagnosisdate)</e>

In a separate stage of the pipeline the diagnosisdate is truncated to be just the year that the diagnosis was made.

<e en="T" t="00131051" n="DiagnosisYear">@truncate(DiagnosisYear,-4)</e>

The approximate StudyYear can be calculated by adding the days from diagnosis in tag LongitudinalTemporalOffsetfromEvent to the DiagnosisYear.

In order to use a normalized date function the private tags must also be de-identified at the site using a CTP script that encapsulates the TCIA Safe Private Tag Knowledge Base. With this approach, only the Safe Private Tags contained within the TCIA Private Tag Knowledge Base and encoded into the CTP script at the time the CTP script was created will be retained. If there are Private Tags that are known to be important but not part of the current Safe tags of the TCIA Private Tag Knowledge Base, then it is up to the submitting site to submit a Private Tag Dictionary of those tags to TCIA for consideration.

The normalized date workflow described above requires that diagnosis date be present and this workflow does not handle the example where there no diagnosis date is present.

Acknowledgements

The Veterans Health Administration's Research for Precision Oncology Program and the Applied Proteogenomics Organizational Learning and Outcomes (APOLLO) Research Network.

Related Publications

Publications by the Dataset Authors

The authors recommended the following as the best source of additional information about this dataset:

Elbers, D. C., Fillmore, N. R., Sung, F. C., Ganas, S. S., Prokhorenkov, A., Meyer, C., Hall, R. B., Ajjarapu, S. J., Chen, D. C., Meng, F., Grossman, R. L., Brophy, M. T., & Do, N. V. (2020). The Veterans Affairs Precision Oncology Data Repository, a Clinical, Genomic, and Imaging Research DatabasePatterns (New York, N.Y.)1(6), 100083. https://doi.org/10.1016/j.patter.2020.100083

Research Community Publications

TCIA maintains a list of publications that leverage our data. At this time, we are not aware of any publications based on this data. If you have a publication you’d like to add, please contact TCIA’s Helpdesk.

TCIA maintains a list of publications that leveraged this dataset. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.

Additional Publications Related to this Work

Elbers, D. C., Fillmore, N. R., Sung, F. C., Ganas, S. S., Prokhorenkov, A., Meyer, C., Hall, R. B., Ajjarapu, S. J., Chen, D. C., Meng, F., Grossman, R. L., Brophy, M. T., & Do, N. V. (2020). The Veterans Affairs Precision Oncology Data Repository, a Clinical, Genomic, and Imaging Research DatabasePatterns (New York, N.Y.)1(6), 100083. https://doi.org/10.1016/j.patter.2020.100083

Other Publications Using this Data

TCIA maintains a list of publications that leverage our data. At this time, we are not aware of any publications based on this data. If you have a publication you’d like to add, please contact TCIA’s Helpdesk.