CPTAC-HNSCC | The Clinical Proteomic Tumor Analysis Consortium Head and Neck Squamous Cell Carcinoma Collection
DOI: 10.7937/K9/TCIA.2018.UW45NH81 | Data Citation Required | 532 Views | 1 Citations | Image Collection
Location | Species | Subjects | Data Types | Cancer Types | Size | Status | Updated | |
---|---|---|---|---|---|---|---|---|
Head-Neck | Human | 137 | CT, PT, MR, RTSTRUCT, Histopathology | Head and Neck Cancer | Clinical, Genomics, Proteomics, Microarray | Limited, Ongoing | 2024/11/18 |
Summary
This collection contains subjects from the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium Head-and-Neck cancer (CPTAC-HNSCC) cohort. CPTAC is a national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics. Radiology and pathology images from CPTAC patients are being collected and made publicly available by The Cancer Imaging Archive to enable researchers to investigate cancer phenotypes which may correlate to corresponding proteomic, genomic and clinical data. Imaging from each cancer type will be contained in its own TCIA Collection, with the collection name "CPTAC-cancertype". Radiology imaging is collected from standard of care imaging performed on patients immediately before the pathological diagnosis, and from follow-up scans where available. For this reason the radiology image data sets are heterogeneous in terms of scanner modalities, manufacturers and acquisition protocols. Pathology imaging is collected as part of the CPTAC qualification workflow. All CPTAC cohorts are released as either a single combined cohort, or split into Discovery and Confirmatory where applicable. There are two main types of proteomic studies: discovery proteomics and targeted proteomics. The term "discovery proteomics" is in reference to "untargeted" identification and quantification of a maximal number of proteins in a biological or clinical sample. The term “targeted proteomics” refers to quantitative measurements on a defined subset of total proteins in a biological or clinical sample, often following the completion of discovery proteomics studies to confirm interesting targets selected. Commonly used proteomic technologies and platforms are different types of mass spectrometry and protein microarrays depending on the needs, throughput and sample input requirement of an analysis, with further development on nanotechnologies and automation in the pipeline in order to improve the detection of low abundance proteins, increase throughput, and selectively reach a target protein in vivo. Once the protein targets of interest are identified, high-throughput targeted assays are developed for confirmatory studies: tests to affirm that the initial tests were accurate. A summary of CPTAC imaging efforts can be found on the CPTAC Imaging Proteomics page. You can join the CPTAC Imaging Special Interest Group to be notified of webinars & data releases, collaborate on common data wrangling tasks and seek out partners to explore research hypotheses! Artifacts from previous webinars such as slide decks and video recordings can be found on the CPTAC SIG Webinars page.CPTAC Imaging Special Interest Group
Data Access
Some data in this collection contains images that could potentially be used to reconstruct a human face. To safeguard the privacy of participants, users must sign and submit a TCIA Restricted License Agreement to help@cancerimagingarchive.net before accessing the data.
Version 17: Updated 2024/11/18
4 radiology cases added
Title | Data Type | Format | Access Points | Subjects | License | |||
---|---|---|---|---|---|---|---|---|
Images | CT, PT, MR, RTSTRUCT | DICOM | Download requires NBIA Data Retriever |
137 | 389 | 3,348 | 385,015 | TCIA Restricted |
Tissue Slide Images | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
112 | 390 | CC BY 3.0 |
Additional Resources for this Dataset
The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.
- Imaging Data Commons (IDC) (Imaging Data)
- Proteomic Data Commons (PDC) (Proteomic & Clinical Data)
- Genomic Data Commons (GDC) (Genomic & Clinical Data)
Citations & Data Usage Policy
Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:
Data Citation |
|
National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC). (2018). The Clinical Proteomic Tumor Analysis Consortium Head and Neck Squamous Cell Carcinoma Collection (CPTAC-HNSCC) (Version 17) [Dataset]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2018.UW45NH81 |
Acknowledgement |
|
The CPTAC program requests that publications using data from this program include the following statement: “Data used in this publication were generated by the National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC).” |
Detailed Description
Accessing CPTAC publication cohorts
All CPTAC cohorts are released as either a single combined cohort, or split into Discovery and Confirmatory where applicable. In the case of CPTAC-HNSCC there was a “Discovery Cohort” release. Images associated with these cases can be downloaded using the following links:
Accessing the Proteomic & Genomic Clinical Data
To access/download the clinical data on the Proteomic Data Commons (PDC) and Genomic Data Commons (GDC), once you have identified the data of your interest, move to the ‘Clinical’ tab on the browse page. Select the checkbox to select a specific row, all rows on the page or all pages and click the export clinical manifest button in CSV or TSV format on the GDC, or TSV or JSON format on the PDC.
A Note about TCIA and CPTAC Subject Identifiers and Dates
Subject Identifiers:
A subject with radiology and pathology images stored in TCIA is identified with a de-identified project Patient ID that is identical to the Patient ID of the same subject with clinical, proteomic, and/or genomic data stored in other CPTAC databases and web sites.
Dates:
The radiology imaging data is in DICOM format. To provide temporal context information aligned with events in the clinical data set for each patient, TCIA has inserted information in DICOM tag (0012,0050) Clinical Trial Time Point ID. This DICOM tag contains the number of days from the date the patient was initially diagnosed pathologically with the disease to the date of the scan. E.g. a scan acquired 3 days before the diagnosis would contain the value -3. A follow up scan acquired 90 days after diagnosis would contain the value 90.
The DICOM date tags (be they birth dates, imaging study dates, etc.) are modified per TCIA’s standard process which offsets them by a random number of days. The offset is a number of days between 3 and 10 years prior to the real date that is consistent for each TCIA image-submitting site and collection, but that varies among sites and among collections from the same site. Thus, the number of days between a subject’s longitudinal imaging studies are accurately preserved when more than one study has been archived while still meeting HIPAA requirements.
Acknowledgements
We would like to acknowledge the individuals and institutions that have provided data for this collection:
- International Institute for Molecular Oncology, Poznań, Poland - Special thanks to Maciej Wiznerowicz MD, PhD and Jan Lubiński MD PhD; Heliodor Święcicki Clinical Hospital, Poznań, Poland; Witold Szyfter MD, PhD, Malgorzata Wierzbicka MD, PhD., Ewelina Kalinowicz MD and Joanna Napierała
- BioPartners, CA - Special thanks to Alexander Gasparian, PhD. from the Department of Drug Discovery and Biomedical Sciences, University of South Carolina College of Pharmacy, Kakhaber Zaalishvili, MD Medical Advisor and Staff Pathologist at BioPartners, LLC, Milla Gorodnia, President of BioPartners, Inc., Victoria Christensen, Global Business Development/Project Coordination Manager, Oksana Havryliuk, MD. Chief of Research department of radiodiagnostics of NCI (Ukraine), Marianna Gredil’, Director of BioPartners, LLC, and Anna Legenka Chief of the Data Department at BioPartners, LLC.
- Beaumont Health System, Royal Oak, MI - Special thanks to George D. Wilson, PhD from the Department of Radiation Oncology Research, Barbara Pruetz of the Biobank, Debra Kapczynski, MHSA, CIIP, RT(R)(CT) and Rachel Deyer from the Department of Diagnostic Radiology.
- Boston Medical Center, Boston, MA - Special thanks to Chris D. Andry M.Phil, PhD from the Department of Pathology and Laboratory Medicine, Margaret Lavoye, Artem Kaliaev, Wilson Chavez, Stephan Anderson, Jorge Soto, and Mitchell Horn from the Department of Radiology, Elizabeth Duffy, MA and Cheryl Spencer, MA of the Biobank.
- University of Calgary, Alberta, Canada - Special thanks to Oliver Bathe, MD, FRCS(C) from the Departments of Surgery/Oncology, Marina Salluzzi, PhD and Nicole Blenkin from the Department of Radiology, Calgary Image Processing and Analysis Centre (CIPAC), and Jennifer Koziak and Elleine Allapitan from the Department of Surgery.
- Washington University in St. Louis, Saint Louis, MO - Special thanks to Sidharth V Puram, MD, PhD, Fudong Wang, PhD, Thomas Barrett, MD, Salma Ramadan, MD, and Sophie Gerndt, MD from the Department of Otolaryngology-Head and Neck Surgery.
- University of Kansas Medical Center – Special thanks to Andrew K. Godwin, PhD and Rashna Madan, MBBS from the Department of Pathology and Laboratory Medicine and Monica Mays, Lauren DiMartino, and Alex Webster from the University of Kansas Cancer Center.
- University of Oklahoma– Special thanks to Kar-Ming Fung, MD, PhD, Director of Anatomic Pathology, David Brown, Assistant Director of Clinical Trials Office, Biospecimen Core Lab, Shawnton Long, Manager of Clinical Trials Office, Tissue Lab, Biospecimen Core Lab, and Myria Reed, RHIA, Project Manager of Clinical Trials Office, Biospecimen Core Lab.
- H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL - Special thanks to John Koomen, Ph.D. from the Department of Molecular Oncology, and Edward R. Seijo, MS and Olya Stringfield, PhD.
Related Publications
Publications by the Dataset Authors
The authors recommended the following as the best source of additional information about this dataset:
No other publications were recommended by dataset authors.
Research Community Publications
TCIA maintains a list of publications that leveraged this dataset. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.
Previous Versions
Version 16: Updated 2024/04/05
Added 11 caseID to radiology
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | CT, MR, PT, RTSTRUCT | DICOM | Download requires NBIA Data Retriever |
133 | 367 | 3,200 | 364,960 | TCIA Restricted |
Tissue Slide Images | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
112 | 390 | CC BY 3.0 |
Version 15: Updated 2024/01/11
Added 9 ID to radiology download:C3L-05553, C3L-05511, C3L-05514, C3L-10001, C3L-10461, C3L-10463, C3L-10464, C3L-10466, C3L-10467.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | CT, MR, PT | DICOM | Download requires NBIA Data Retriever |
122 | 341 | 2,293 | 330,721 | TCIA Restricted |
Tissue Slide Images | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
112 | 390 | CC BY 3.0 |
Version 14: Updated 2023/10/05
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | RTSTRUCT, CT, MR, PT | DICOM | Download requires NBIA Data Retriever |
113 | 312 | 2,072 | 291,868 | TCIA Restricted |
Tissue Slide Images | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
112 | 390 | CC BY 3.0 |
Version 13: Updated 2023/08/18
Added DICOM imaging.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
TCIA Restricted | |||||
Tissue Slide Images | SVS | Download requires IBM-Aspera-Connect plugin |
CC BY 3.0 |
Version 12: Updated 2023/02/24
Radiology modality data cleanup to remove extraneous scans.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | Download requires IBM-Aspera-Connect plugin |
Version 11: Updated 2023/02/16
added radiology cases from 4 sites
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | Download requires IBM-Aspera-Connect plugin |
Version 10: Updated 2020/09/03
Changed to new Aspera download link for histopathology slides.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 9: Updated 2020/06/23
Added imaging for 3 radiological imaging participants.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 8: Updated 2020/03/31
Added 8 new radiology subjects.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 7: Updated 2019/12/19
Added 20 new radiology subjects.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 6: Updated 2019/09/30
added new subjects.
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 5: Updated 2019/06/30
Added Subjects
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | SVS | |||||||
Proteomics | WEB |
Version 4: Updated 2019/03/31
Added new subjects
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | WEB | |||||||
Proteomics | WEB |
Version 3: Updated 2018/10/24
Added new subjects
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | WEB | |||||||
Proteomics | WEB |
Version 2: Updated 2018/06/30
Added new subjects
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | WEB | |||||||
Proteomics | WEB |
Version 1: Updated 2018/04/25
Title | Data Type | Format | Access Points | License | ||||
---|---|---|---|---|---|---|---|---|
Images | DICOM | Download requires NBIA Data Retriever |
||||||
Tissue Slide Images | WEB | |||||||
Proteomics | WEB |