DI-Cubed-Reports | SDTM datasets of clinical data and measurements for selected cancer collections to TCIA
DOI: 10.7937/TCIA.2019.zfv154m9 | Data Citation Required | 13 Views | Analysis Result
Location | Subjects | Size | Updated | |||
---|---|---|---|---|---|---|
Breast, Glioblastoma | Breast, Brain | 516 | Standardized (SDTM format) conversions of clinical and image analysis data | 2019/06/21 |
Summary
The Data Integration & Imaging Informatics (DI-Cubed) project explored the issue of lack of standardized data capture at the point of data creation, as reflected in the non-image data accompanying 4 TCIA breast cancer collections (Multi-center breast DCE-MRI data and segmentations from patients in the I-SPY 1/ACRIN 6657 trials (ISPY1), BREAST-DIAGNOSIS, Single site breast DCE-MRI data and segmentations from patients undergoing neoadjuvant chemotherapy (Breast-MRI-NACT-Pilot), The Cancer Genome Atlas Breast Invasive Carcinoma Collection (TCGA-BRCA)) and the Ivy Glioblastoma Atlas Project (IvyGAP) brain cancer collection. The work addressed the desire for semantic interoperability between various NCI initiatives by aligning on common clinical metadata elements and supporting use cases that connect clinical, imaging, and genomics data. Accordingly, clinical and measurement data imported into I2B2 were cross-mapped to industry standard concepts for names and values including those derived from BRIDG, CDISC SDTM, DICOM Structured Reporting models and using NCI Thesaurus, SNOMED CT and LOINC controlled terminology. A subset of the standardized data was then exported from I2B2 in SDTM compliant SAS transport files. The SDTM data was derived from data taken from both the curated TCIA spreadsheets as well as tumor measurements and dates from the TCIA Restful API. Due to the nature of the available data not all SDTM conformance rules were applicable or adhered to. These Study Data Tabulation Model format (SDTM) datasets were validated using Pinnacle 21 CDISC validation software. The validation software reviews datasets according to their degree of conformance to rules developed for the purposes of FDA submissions of electronic data. Iterative refinements were made to the datasets based upon group discussions and feedback from the validation tool. Export datasets for the following SDTM domains were generated:
Data Access
Version 1: Updated 2019/06/21
Title | Data Type | Format | Access Points | Subjects | License | |||
---|---|---|---|---|---|---|---|---|
SAS Transport Files | Other | XPT | CC BY 3.0 | |||||
Image Analysis | Other | CSV | CC BY 3.0 |
Citations & Data Usage Policy
Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:
Data Citation |
|
Hickman H., Ver Hoef W., Hastak S., Neville J., Clunie D., Wagner U., Helton E. (2019). SDTM datasets of clinical data and measurements for selected cancer collections to TCIA [Dataset]. The Cancer Imaging Archive. doi: 10.7937/TCIA.2019.zfv154m9 |
Detailed Description
TCIA breast cancer collections used:
- Multi-center breast DCE-MRI data and segmentations from patients in the I-SPY 1/ACRIN 6657 trials (ISPY1)
- BREAST-DIAGNOSIS
- Single site breast DCE-MRI data and segmentations from patients undergoing neoadjuvant chemotherapy (Breast-MRI-NACT-Pilot)
- The Cancer Genome Atlas Breast Invasive Carcinoma Collection (TCGA-BRCA)
TCIA brain cancer collection used:
Related Publications
Publications by the Dataset Authors
The authors recommended the following as the best source of additional information about this dataset:
Publication Citation |
|
Clunie, D., Hickman, H., Ver Hoef, W., Hastak, S., Evans, J., Neville, J., & Wagner, U. (2020). Observations from the Data Integration and Imaging Informatics (DI-Cubed) Project. MDPI AG. https://doi.org/10.20944/preprints202008.0474.v1 |
Research Community Publications
TCIA maintains a list of publications that leveraged this dataset. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.