NLST | National Lung Screening Trial
DOI: 10.7937/TCIA.HMQ8-J677 | Data Citation Required | Image Collection
Location | Species | Subjects | Data Types | Cancer Types | Size | Status | Updated | |
---|---|---|---|---|---|---|---|---|
Chest | Human | 26,254 | CT, Other, Histopathology, Demographic, Diagnosis, Exposure, Measurement, Follow-Up | Lung Cancer | Clinical | Public, Complete | 2021/09/24 |
Summary
Background: The aggressive and heterogeneous nature of lung cancer has thwarted efforts to reduce mortality from this cancer through the use of screening. The advent of low-dose helical computed tomography (CT) altered the landscape of lung-cancer screening, with studies indicating that low-dose CT detects many tumors at early stages. The National Lung Screening Trial (NLST) was conducted to determine whether screening with low-dose CT could reduce mortality from lung cancer. Methods: From August 2002 through April 2004, we enrolled 53,454 persons at high risk for lung cancer at 33 U.S. medical centers. Participants were randomly assigned to undergo three annual screenings with either low-dose CT (26,722 participants) or single-view posteroanterior chest radiography (26,732). Data were collected on cases of lung cancer and deaths from lung cancer that occurred through December 31, 2009. This dataset includes the low-dose CT scans from 26,254 of these subjects, as well as digitized histopathology images from 451 subjects. Results: The rate of adherence to screening was more than 90%. The rate of positive screening tests was 24.2% with low-dose CT and 6.9% with radiography over all three rounds. A total of 96.4% of the positive screening results in the low-dose CT group and 94.5% in the radiography group were false positive results. The incidence of lung cancer was 645 cases per 100,000 person-years (1060 cancers) in the low-dose CT group, as compared with 572 cases per 100,000 person-years (941 cancers) in the radiography group (rate ratio, 1.13; 95% confidence interval [CI], 1.03 to 1.23). There were 247 deaths from lung cancer per 100,000 person-years in the low-dose CT group and 309 deaths per 100,000 person-years in the radiography group, representing a relative reduction in mortality from lung cancer with low-dose CT screening of 20.0% (95% CI, 6.8 to 26.7; P=0.004). The rate of death from any cause was reduced in the low-dose CT group, as compared with the radiography group, by 6.7% (95% CI, 1.2 to 13.6; P=0.02). Conclusions: Screening with the use of low-dose CT reduces mortality from lung cancer. (Funded by the National Cancer Institute; National Lung Screening Trial ClinicalTrials.gov number, NCT00047385). Data Availability: A summary of the National Lung Screening Trial and its available datasets are provided on the Cancer Data Access System (CDAS). CDAS is maintained by Information Management System (IMS), contracted by the National Cancer Institute (NCI) as keepers and statistical analyzers of the NLST trial data. The full clinical data set from NLST is available through CDAS. Users of TCIA can download without restriction a publicly distributable subset of that clinical data, along with the CT and Histopathology images collected during the trial. (These previously were restricted.)
Data Access
Version 3: Updated 2021/09/24
Data embargo of limited access is lifted September 2021, with the addition of downloadable pathology slide data and clinical data spreadsheet & dictionaries.
Title | Data Type | Format | Access Points | Subjects | License | |||
---|---|---|---|---|---|---|---|---|
Radiology CT Images | CT | DICOM | Download requires NBIA Data Retriever |
26,254 | 73,116 | 203,099 | 21,082,265 | CC BY 4.0 |
Tissue Slide Images - Primary Tumor | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
451 | 1,225 | CC BY 4.0 | ||
Clinical data including data dictionaries | Demographic, Diagnosis, Exposure, Measurement, Follow-Up | SAS, ZIP, and DOC | CC BY 4.0 | |||||
Additional histopathology slide images Table 1 for which the participants have no Baseline Questionnaire data | Other | DOCX | 2 | CC BY 4.0 | ||||
Histopathology additional slide images for which the participants have no Baseline Questionnaire data | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
2 | 4 | CC BY 4.0 | ||
Additional histopathology slide images Table 2 for participants with Second Primary Tumors as well as those included in the standard package | Other | DOCX | 10 | 23 | CC BY 4.0 | |||
Histopathology additional slide images for participants with Second Primary Tumors as well as those included in the standard package | Histopathology | SVS | Download requires IBM-Aspera-Connect plugin |
10 | 23 | CC BY 4.0 |
Additional Resources for this Dataset
The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.
- Imaging Data Commons (IDC) (Imaging Data)
- IDC Zenodo community datasets:
The following external resources have been made available by the data submitters. These are not hosted or supported by TCIA, but may be useful to the researchers utilizing this collection
- Clinical data – TCIA hosts a subset of the full clinical data. If you need the full clinical data, please visit the Cancer Data Access System (CDAS) system.
- Data Science Bowl 2017 – A significant amount of information (code and discussion threads) were generated about NLST in connection with this challenge competition.
- ClinicalTrials.gov entry about the Trial NCT00047385, “National Lung Screening Trial A Randomized Trial Comparing Low-dose Helical CT With Chest Xray for Lung Cancer”
Citations & Data Usage Policy
Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:
Data Citation |
|
National Lung Screening Trial Research Team. (2013). Data from the National Lung Screening Trial (NLST) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.HMQ8-J677 |
Detailed Description
The full CT data (manifest-NLST_allCT.tcia) occupy 11.3 terabytes when downloaded. For convenience, you can use the “Search” feature to filter for subsets and download in chunks.
The pathology slide data:
- Primary Tumor slides (faspex) Primary Tumor slides (the standard package), 1225 files.
- Additional slides (faspex) Additional histopathology slide images for which the participants have no Baseline Questionnaire data (4 slides) Detail in Table 1.
- Second Primary-Tumor slides (faspex) Additional histopathology slide images for participants with Second Primary Tumors as well as those included in the “standard” package (23 slides) Detail in Table 2.
NLST Design & Process, Protocol Documents, and Results: https://cdas.cancer.gov/learn/nlst/main-findings/
- Overview, study design, recruitment methods, endpoint verification process (determining cause of death), quality control (LSS and ACRIN).
- LSS Manual of Operations (MOOP) and ACRIN Protocol.
- DSMB announcement of results (10/28/2010).
- Feasibility Study for the NLST, Psychosocial and Behavioral Issues, Technical Publications. https://cdas.cancer.gov/learn/nlst/main-findings/
- Browse Publications : https://cdas.cancer.gov/publications/?study=nlst
- Browse projects : https://cdas.cancer.gov/approved-projects/?data_types=nlst_data
NLST Data Collected: https://biometry.nci.nih.gov/cdas/learn/nlst/data-collected/
- Questionnaires, screening, diagnostic procedures, cancer diagnosis, treatment, progression, mortality, contamination.
- Further detail about the CT can be found here
Biospecimens Collected
Formalin-fixed paraffin embedded (FFPE) tissue specimens are available for a subset of the NLST participants who developed lung cancer during the trial. Donor blocks were obtained from local pathology laboratories and tissue cores (0.6mm) were extracted from them to construct tissue microarrays (TMA). Tissue cores were sampled from primary main invasive tumor histology, secondary tumor histology, carcinoma in situ, adjacent normal lung tissue, metastatic lesion from lymph node(s) and/or distant sites, benign (un-involved) lymph node, proximal and/or distal bronchi.
In total, tissue materials were collected from 438 lung cancer cases. All have cores arrayed across nine TMAs, one of which only contains tissue collected after neoadjuvant treatment. 434 of these also have loose cores available for nucleic acid extraction. On average, each TMA contains 504 cores from 48 subjects.
Applications for access to these specimens can be submitted under the PLCO Etiologic and Early Marker Studies Program (EEMS). The application review process opens twice a year, once in the winter and once in the summer. For more information about EEMS and to initiate an application visit the PLCO EEMS Application page. When filling out the application, specify “NLST Tissue” under the case definition.
Related Publications
Publications by the Dataset Authors
The authors recommended the following as the best source of additional information about this dataset:
Publication Citation |
|
National Lung Screening Trial Research Team; Aberle DR, Adams AM, Berg CD, Black WC, Clapp JD, Fagerstrom RM, Gareen IF, Gatsonis C, Marcus PM, Sicks JD (2011). Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. New England Journal of Medicine, 365(5), 395–409. https://doi.org/10.1056/nejmoa1102873 |
No other publications were recommended by dataset authors.
Research Community Publications
TCIA maintains a list of publications that leveraged this dataset. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.
Previous Versions
Version 2: Updated 2015/12/14
Change: restoration of images that had become corrupted/missing during a storage transfer.