Key PointsQuestion
Can molecular profiling of immune-related genes be used to estimate prognosis in early-stage nonsquamous non–small cell lung cancer?
Findings
In this multiple-cohort study that analyzed frozen tumor tissue samples from 2414 patients, an immune signature of 25 gene pairs significantly stratified patients into low- vs high-risk groups for overall survival across and within subpopulations at various tumor stages (I, IA, IB, or II).
Meaning
Dysregulated immune contexture may contribute to the survival differences among patients with nonsquamous non–small cell lung cancer.
Importance
The prevalence of early-stage non–small cell lung cancer (NSCLC) is expected to increase with recent implementation of annual screening programs. Reliable prognostic biomarkers are needed to identify patients at a high risk for recurrence to guide adjuvant therapy.
Objective
To develop a robust, individualized immune signature that can estimate prognosis in patients with early-stage nonsquamous NSCLC.
Design, Setting, and Participants
This retrospective study analyzed the gene expression profiles of frozen tumor tissue samples from 19 public NSCLC cohorts, including 18 microarray data sets and 1 RNA-Seq data set for The Cancer Genome Atlas (TCGA) lung adenocarcinoma cohort. Only patients with nonsquamous NSCLC with clinical annotation were included. Samples were from 2414 patients with nonsquamous NSCLC, divided into a meta-training cohort (729 patients), meta-testing cohort (716 patients), and 3 independent validation cohorts (439, 323, and 207 patients). All patients underwent surgery with a negative surgical margin, received no adjuvant or neoadjuvant therapy, and had publicly available gene expression data and survival information. Data were collected from July 22 through September 8, 2016.
Main Outcomes and Measures
Overall survival.
Results
Of 2414 patients (1205 men [50%], 1111 women [46%], and 98 of unknown sex [4%]; median age [range], 64 [15-90] years), a prognostic immune signature of 25 gene pairs consisting of 40 unique genes was constructed using the meta-training data set. In the meta-testing and validation cohorts, the immune signature significantly stratified patients into high- vs low-risk groups in terms of overall survival across and within subpopulations with stage I, IA, IB, or II disease and remained as an independent prognostic factor in multivariate analyses (hazard ratio range, 1.72 [95% CI, 1.26-2.33; P < .001] to 2.36 [95% CI, 1.47-3.79; P < .001]) after adjusting for clinical and pathologic factors. Several biological processes, including chemotaxis, were enriched among genes in the immune signature. The percentage of neutrophil infiltration (5.6% vs 1.8%) and necrosis (4.6% vs 1.5%) was significantly higher in the high-risk immune group compared with the low-risk groups in TCGA data set (P < .003). The immune signature achieved a higher accuracy (mean concordance index [C-index], 0.64) than 2 commercialized multigene signatures (mean C-index, 0.53 and 0.61) for estimation of survival in comparable validation cohorts. When integrated with clinical characteristics such as age and stage, the composite clinical and immune signature showed improved prognostic accuracy in all validation data sets relative to molecular signatures alone (mean C-index, 0.70 vs 0.63) and another commercialized clinical-molecular signature (mean C-index, 0.68 vs 0.65).
Conclusions and Relevance
The proposed clinical-immune signature is a promising biomarker for estimating overall survival in nonsquamous NSCLC, including early-stage disease. Prospective studies are needed to test the clinical utility of the biomarker in individualized management of nonsquamous NSCLC.
Non–small cell lung cancer (NSCLC) accounts for approximately 85% of lung cancer, the leading cause of death from cancer in the United States and worldwide.1 Biomarkers that can reliably estimate disease prognosis and patient survival would have tremendous value in guiding the management of lung cancer.2 For example, most patients with stage I NSCLC currently do not receive adjuvant systemic treatment after local therapy because several large randomized studies3-5 have failed to show a survival benefit among unselected patients, for whom the toxic effects associated with chemotherapy may outweigh the potential benefit for many patients. Thus, identification of the subset of patients at highest risk for recurrence and death who have the greatest need for additional systemic therapy is needed.
A number of studies have proposed gene expression–based signatures for survival stratification in patients with NSCLC.6-10 Unfortunately, none has been incorporated into routine clinical practice owing to issues such as overfitting on small discovery data sets and lack of sufficient validation.11 The availability of public, large-scale gene expression data sets brings the opportunity to identify potentially more reliable lung cancer biomarkers.8,12,13 To use all this information effectively, however, the diversity of data also represents a daunting challenge. Traditional approaches using gene expression levels require appropriate normalization, which is a difficult task given the potential biological heterogeneity among data sets and technical biases across measurement platforms.14 Instead, methods based on the relative ranking of gene expression levels eliminate the requirement for data preprocessing, such as scaling and normalization, and have been shown to produce robust results in various applications, including cancer classification.15-17
Various components of the immune system have been shown to be a determining factor during cancer initiation and progression.18,19 Evading immune destruction has been recognized as an emerging hallmark of cancer.20-23 Recent immunotherapies targeting specific immune checkpoints such as programmed death 1 or programmed death ligand 1 have demonstrated a remarkable, durable response in NSCLC.24,25 Certain histopathologic patterns, such as intratumoral infiltration by cytotoxic lymphocytes, have also been associated with better prognoses in several cancer types, including NSCLC.26-30 However, the molecular characteristics describing tumor-immune interaction remain to be comprehensively explored regarding their prognostic potential in NSCLC.31-34
In this study, we combined multiple gene expression data sets to develop and validate an individualized prognostic signature for nonsquamous NSCLC based on immune-related gene pairs (IRGPs). To leverage the complementary value of molecular and clinical characteristics, we integrated the immune signature with clinical factors to build a composite prognostic index, which allowed improved estimation of nonsquamous NSCLC prognosis.
Study Design and Patients
We retrospectively analyzed the gene expression profiles of frozen tumor tissue samples from 19 public NSCLC cohorts, including 18 microarray data sets and 1 RNA-Seq data set for The Cancer Genome Atlas (TCGA) lung adenocarcinoma cohort. Only patients with nonsquamous NSCLC with clinical annotation were included.35,36 We excluded patients who had a positive surgical margin or had received neoadjuvant therapy, adjuvant chemotherapy, or other pharmaceutical therapy owing to immune-modulating effects of some therapeutics.37 Overlapped patients (n = 19) between the Director’s Challenge Consortium (DCC) and a data set in Gene Expression Omnibus (GSE14814) were removed from the DCC data set. Overall, we included 2414 patients in our study (eTable 1 in the Supplement). Details about sample preparation and RNA measurement can be found elsewhere.7,8,38-52 Preprocessing of gene expression profiles can be found in the eMethods in the Supplement. This study of deidentified data was approved by the institutional review board of Stanford University, Palo Alto, California.
Data were collected from July 22 to September 8, 2016. eFigure 1 in the Supplement shows the overall study design. We selected the 3 largest individual data sets for independent validation, namely, TCGA lung adenocarcinoma (TCGA), DCC, and GSE30219. The remaining 16 microarray data sets were merged into 1 meta–data set, which was randomly split approximately in half into meta-training and meta-testing data sets. Clinical and pathologic characteristics of patients in each data set are shown in eTable 2 in the Supplement.
Construction of an Individualized Prognostic Signature Based on IRGPs
We constructed a prognostic signature by focusing on immune-related genes (IRGs), which were downloaded from the ImmPort database (https://immport.niaid.nih.gov).53 A variety of IRGs were included, such as cytokines, cytokine receptors, and genes related to the T-cell receptor signaling pathway, B-cell antigen receptor signaling pathway, natural killer cell cytotoxicity, and antigen processing and presentation pathways. Immune-related genes measured by all platforms were selected. The gene expression level in a specific sample or profile underwent pairwise comparison to generate a score for each IRGP. An IRGP score of 1 was assigned if IRG 1 was less than IRG 2; otherwise the IRGP score was 0. This gene pair–based approach has an important advantage because the score is calculated based entirely on the gene expression profile of a tumor sample and can be used in an individualized manner without the need for normalization. Some IRGPs with constant values (0 or 1) in a particular platform or data set may be attributable to (1) platform-dependent preferential measurement, which can cause biases and may not be reproducible across platforms, and (2) biologically preferential transcription, which does not provide discriminative information about patient survivial.54 Therefore, we removed IRGPs with constant values in any individual data set of the meta-data set.
Prognostic IRGPs were selected using the log-rank test to assess the association between each IRGP and patients’ overall survival in the meta-training data set. Prognostic IRGPs with a familywise error rate less than 0.05 were candidates to build the IRGP index (IRGPI). To minimize the risk of overfitting, we applied a Cox proportional hazards regression model combined with the least absolute shrinkage and selection operator (glmnet, version 2.0-5).55 The penalty parameter was estimated by 10-fold cross-validation in the meta-training data set at 1 SE beyond the minimum partial likelihood deviance.55
To separate patients into low- or high-risk groups, the optimal IRGPI cutoff was determined by a time-dependent receiver operating characteristic (ROC) curve (survivalROC, version 1.0.3)56 at 5 years in the meta-training data set. We used the nearest neighbor estimation57 method to estimate the ROC curve. The IRGPI corresponding to the shortest distance between the ROC curve and point representing the 100% true-positive rate and 0% false-positive rate was used as the cutoff value.
The prognostic value of the IRGPI was assessed in patients with all stages of disease and in stage-specific groups (early stages [I/II], I, IA, IB, or II) in the meta-training, meta-testing, and independent validation cohorts in univariate analyses. We then combined IRGPI with available clinical and pathologic variables in multivariate analyses. Age, grade, and stage were treated as continuous variables. Grade was coded as well differentiated (0), moderately differentiated (1), or poorly differentiated (2). Stage IA was coded as 1; the range between IA and IB, as 1.5; IB, as 2; IIA, as 3; the range between IIA and IIB, as 3.5; IIB, as 4; IIIA, as 5; the range between IIIA and IIIB, as 5.5; IIIB, as 6; and IV, as 7. The prognostic accuracy of the biomarkers in continuous form was evaluated using the concordance index (C-index), which ranges from 0 to 1.0, with 0.5 indicating random estimation. We compared the prognostic accuracy of the IRGPI with 2 existing multigene signatures (eMethods in the Supplement) in terms of the C-index.58
Functional Annotation and Analysis
To gain biological understanding of the IRGPI, we conducted enrichment analysis of its component IRGs with DAVID (Database for Annotation, Visualization and Integrated Discovery) Bioinformatics Resources (version 6.8; https://david.ncifcrf.gov/).59 The background gene list consists of IRGs measured by all platforms. Biological processes of gene ontology with P < .10 were examined. In the TCGA project, snap-frozen tissue block of the tumor was divided into top, middle, and bottom tissue sections. The middle section was used to derive genomic data, and the top and bottom sections underwent histopathologic examination.60 Thus, in addition to RNA-Seq data, information about immune infiltration by lymphocytes, monocytes, and neutrophils and necrosis percentage is also available for tumor samples in the TCGA data set. We compared those pathologic characteristics between patients in different immune risk groups according to the IRGPI by using the Wilcoxon rank sum test.
Construction and Validation of a Composite Immune-Clinical Prognostic Index
Based on the results of multivariate analyses, we integrated age, stage, and IRGPI risk score to a composite immune-clinical prognostic index (ICPI) by applying Cox proportional hazards regression in the meta-training data set. The prognostic performance of continuous ICPI score was compared with that of the IRGPI in terms of C-index and revealed by the restricted mean survival (RMS) curve.61 RMS represents the life expectancy at 10 years for patients with different risk scores. Similar to the aforementioned method for defining the cutoff of IRGPI, the cutoff value for ICPI was estimated by time-dependent ROC curve in the meta-training data set. The performance of binary IRGPI and ICPI was assessed in terms of the RMS time ratio between low- and high-risk groups.62 A higher RMS time ratio corresponds to a larger prognostic difference.
All statistical analyses were performed using R (version 3.3.1; https://www.r-project.org/). Univariate analysis of the association of IRGPI and other clinical pathologic factors with overall survival was evaluated using log-rank test. For factors significantly associated with overall survival in univariate analyses, multivariate analysis was performed with the Cox proportional hazards regression model. The C-index was calculated with survcomp (version 1.22.0) and compared with compareC (version 1.3.1) packages.63 The RMS curve and RMS time ratio were estimated with survival (version 2.41-2) and survRM2 (version 1.0-2) packages.62 Statistical significance was defined as P < .05 unless specified otherwise.
Construction and Definition of the IRGPI
A total of 2414 patients with nonsquamous NSCLC (1205 men [50%], 1111 women [46%], and 98 of unknown sex [4%]; median age [range], 64 [15-90] years) were included in the analysis. Among 1443 IRGs from the ImmPort database, 524 IRGs were measured by all platforms and 137 026 IRGPs were constructed. We removed 126 615 IRGPs (92.4%) with constant ordering in any meta-data sets. The possible cause for constant ordering IRGPs was analyzed in the eResults in the Supplement. No statistically significant difference was observed between meta-training and meta-testing data sets in terms of clinical and pathologic factors (eTable 2 in the Supplement). The association of the 10 411 IRGPs with overall survival was assessed in the meta-training cohort, resulting in 281 prognostic IRGPs. We then constructed an IRGPI consisting of 25 IRGPs by using L1-penalized Cox proportional hazards regression on the meta-training data set. Robustness of the 25 IRGPs against 1000 randomizations of the meta cohort was assessed (eMethods in the Supplement). The 25 IRGPs of the IRGPI were selected at a significantly higher frequency (P < 1.7 × 10−14) than were those by different randomizations (eResults in the Supplement). The IRGPI consisted of 40 unique IRGs, of which 28 were cytokines or cytokine receptors (eTable 3 in the Supplement). On the basis of time-dependent ROC curve analysis, the optimal cutoff for the IRGPI to stratify patients into the high or the low immune risk group was determined to be 0.988 (eFigure 2 in the Supplement).
Validation of the IRGPI as a Stage-Independent Prognostic Factor
The IRGPI significantly stratified patients into low- vs high-risk groups in terms of overall survival (eTable 4 and eFigure 3 in the Supplement). It remained as an independent prognostic factor in multivariate analyses, after adjusting for clinical and pathologic factors such as age, sex, smoking status, tumor stage, and tumor grade (hazard ratio [HR] range, 1.72 [95% CI, 1.26-2.33; P < .001] to 2.36 [95% CI, 1.47-3.79; P < .001]) (eTable 4 in the Supplement). Furthermore, the IRGPI stratified patients with early-stage (I and II) nonsquamous NSCLC into significantly different prognostic groups (eFigure 4A-E in the Supplement). When considering patients with stage I disease only, the IRGPI remained highly prognostic for the meta-testing data set (HR, 2.89; 95% CI, 1.95-4.29; P = 3.03 × 10−8) and independent validation cohorts (HR, 2.04; 95% CI, 1.53-2.73; P = 8.87 × 10−7) (Figure 1A and B). When further restricted to patients with stage IA or IB disease, the IRGPI could stratify patients into subgroups with significantly different prognoses in the meta-testing and/or independent validation cohorts (eFigure 5 in the Supplement). Similarly, a higher IRGPI was correlated with significantly worse prognosis patients with stage II disease in meta-testing (HR, 1.73; 95% CI, 1.03-2.92; P = .037) and in independent validation cohorts (HR, 3.39; 95% CI, 1.80-6.37; P = 5.53 × 10-5) (Figure 1C and D) and patients with more advanced stage III to IV disease (HR, 1.56; 95% CI, 1.14-2.15; P = .005) (eFigure 4F in the Supplement). Overall, the IRGPI appears to estimate overall survival indepently of stage in nonsquamous NSCLC (Figure 2).
Functional Annotation of the IRGPI
Enrichment analysis of the 40 unique IRGs identified 6 overrepresented biological processes in gene ontology (eTable 5 in the Supplement). Most biological processes were chemotaxis. Chemotaxis of various immune infiltrates, such as neutrophils, monocytes, and macrophages, was observed. In the TCGA data set, we found that the percentages of necrosis and neutrophil infiltration were significantly different between IRGPI risk groups (Figure 3). The mean level of necrosis in the IRGPI high-risk group was 3-fold that in IRGPI low-risk group in the bottom (6.5% vs 2.1%; P = 2.39 × 10−6) and top (4.6% vs 1.5%; P = 6.29 × 10−13) sections. Patients with a higher IRGPI also had significantly higher neutrophil infiltration in their tumors in the bottom (4.8% vs 1.3%; P = .002) and top (4.3% vs 1.6%; P = .001) sections. However, no statistically significant difference in lymphocyte or monocyte infiltration was observed between the 2 IRGPI risk groups in the TCGA data set.
Comparison With Other Gene Expression Signatures
We compared the IRGPI with 2 clinical applicable and commercialized biomarkers, including a 14-gene biomarker for stages I to III nonsquamous NSCLC and a 31-cell cycle progression (CCP) gene biomarker for early-stage (I and II) NSCLC (eMethods in the Supplement).6,9,64 Only TCGA and GSE13213 included all 14 genes needed to construct the 14-gene biomarker. For both data sets, the IRGPI achieved a higher C-index compared with the 14-gene biomarker (eFigure 6A in the Supplement). Of note, for patients with stages I to III disease in the TCGA data set, the C-index of the IRGPI was significantly different from random estimation (C-index, 0.62; 95% CI, 0.55-0.69; P < .001), whereas the 14-gene biomarker was not (C-index, 0.50; 95% CI, 0.44-0.56; P = .93). Similarly, in 3 of 4 data sets, the IRGPI showed a higher C-index than the CCP biomarker (eFigure 6B in the Supplement). Overall, the mean C-index weighted by cohort size was 0.64 for IRGPI vs 0.53 for the 14-gene biomarker and 0.61 for the CCP biomarker in respective comparisons.
We used the same 1000 random splits of the meta-cohort to further assess the accuracy and variability of survival estimation for different biomarkers (eMethods in the Supplement). The IRGP-based models showed a higher median C-index of 0.65 and lower variation (SD, 0.015) in the meta-testing cohorts compared with CCP and the 14-gene biomarker, with median (SD) C-indexes of 0.57 (0.020) and 0.56 (0.025), respectively.
Integrated Prognostic Index by Combining the IRGPI With Clinical Factors
In multivariate analysis (eTable 4 in the Supplement), age, stage and IRGPI were independent prognostic factors in at least 3 data sets, suggesting their complementary value. To further improve accuracy, we combined age, stage, and IRGPI score to fit a Cox proportional hazards regression model using the meta-training data set and derived an ICPI as (0.0265 × age) + (0.267 × stage) + (1.917 × IRGPI score). An optimal cutoff of 0.116 for stratifying patients was determined based on time-dependent ROC curve analysis in the meta-training data set (eFigure 7A in the Supplement). Significantly improved estimation of survival was achieved by the continuous form of ICPI relative to IRGPI (mean C-index, 0.70 vs 0.63 in the validation data set; P < .005) (eFigure 7B in the Supplement and Figure 4). Similar results were observed in binary form of the ICPI relative to the IRGPI (eFigure 8 and eTable 6 in the Supplement).
We also compared our ICPI with a commercialized clinical-molecular composite biomarker mPS, which integrates stage with CCP score, in patients with early-stage disease (eMethods in the Supplement). Our ICPI achieved a higher accuracy of survival estimation in all validation data sets (mean C-index, 0.68 for ICPI vs 0.65 for mPS; eFigure 9 in the Supplement).
Patients with early-stage NSCLC are at substantial risk for recurrence and death, even after complete surgical resection. The use of adjuvant therapy in early-stage, particularly stage I, NSCLC remains controversial, because previous randomized trials have not demonstrated a consistent survival benefit. Reliable prognostic biomarkers are critically needed to select patients who are at highest risk for recurrence and who might benefit from additional systemic therapy. Significant research on gene expression–based prognostic signatures6-10 has led to recent commercialization of 2 biomarkers in lung adenocarcinoma,6,9 but their accuracy of survival estimation remains limited. In this study, we developed a prognostic signature based on 25 immune-related gene pairs for nonsquamous NSCLC and validated it in multiple independent data sets across different platforms. Our prognostic immune signature can further stratify clinically defined groups of patients (eg, early-stage [I/II] and stages I, IA, IB, and II nonsquamous NSCLC) into subgroups with different survival outcomes. In benchmark comparisons, our signature achieved higher accuracy than 2 commercialized molecular biomarkers. We further leveraged the complementary value of molecular and clinical characteristics and showed that combining both could provide a more accurate estimation of overall survival in nonsquamous NSCLC.
To identify reliable biomarkers of nonsquamous NSCLC prognoses, we combined gene expression profiles from multiple data sets and used methods that are specifically designed to perform robustly given technical biases inherent across different platforms with microarray or RNA-Seq technologies.65,66 Our prognostic signature is based on the relative ranking of gene expression values and only involves pairwise comparison within the gene expression profile of a sample, thus eliminating the need for data normalization. As such, our prognostic signature can serve as an individualized, single-sample estimate of survival of NSCLC and may be readily translated to clinical practice.
Prognostic or predictive biomarkers related to the tumor immune microenvironment may hold great promise for identifying novel molecular targets and improving patient management in the era of immunotherapy.2 Suzuki et al67 discovered that stromal rather than tumor nest FoxP3:CD3 ratio and tumor expression level of certain cytokines (interleukin 12 receptor β2 and interleukin 7 receptor) were correlated with recurrence in stage I lung adenocarcinoma. Because gene expression profiles used in the present study were derived from a core sample of tumor tissue, we did not observe significant differences in lymphocyte infiltration between low- and high-risk IRGPI groups, which is consistent with the previous findings. Similarly, most genes contained in our immune signature were also cytokines and cytokine receptors, which play key roles in chemotaxis, angiogenesis, and inflammatory processes.68 An increased inflammatory microenvironment has been shown to be a consistent component of neoplastic process and tumor progression.68,69 Different from apoptosis, which is immunologically and inflammatorily silent,70 necrosis can release proinflammatory intracellular content into the tumor microenvironment and induce an inflammatory response involving a diverse set of immune cells such as neutrophils and macrophages.71 In addition, tumor-associated neutrophils have been shown to be associated with poor prognoses in a variety of cancer types.19,72-75 We found significantly increased level of necrosis and infiltration of neutrophils consistently in the high immune–risk group in the TCGA data set. On the basis of the aforementioned findings, dysregulated immune contexture might be the reason for the survival differences observed between patient groups as defined by our signature.
Limitations of our study include its retrospective nature, although we tried to include as many data sets as possible for more rigorous validation of our biomarker. Similar to other studies,76,77 gene expression signatures are subject to sampling bias caused by intratumor genetic heterogeneity. Although we excluded IRGPs with constant ordering to reduce certain cross-study batch effects, their complex nature implies that not all batch effects can be addressed, and some may remain.14 Future studies will integrate diverse biological processes, which could provide a more complete molecular picture of the tumor.
The proposed immune-related gene pair–based signature is a promising prognostic biomarker in nonsquamous NSCLC, including early-stage disease. Prospective studies are needed to further validate its analytical accuracy for estimating prognoses and to test its clinical utility in individualized management of nonsquamous NSCLC.
Corresponding Author: Ruijiang Li, PhD, Department of Radiation Oncology, Stanford University School of Medicine, 1070 Arastradero Rd, Palo Alto, CA 94304 (rli2@stanford.edu).
Accepted for Publication: April 19, 2017.
Published Online: July 6, 2017. doi:10.1001/jamaoncol.2017.1609
Author Contributions: Dr R. Li had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: All authors.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: B. Li, R. Li.
Critical revision of the manuscript for important intellectual content: All authors.
Statistical analysis: B. Li, Cui, Diehn.
Obtained funding: R. Li.
Administrative, technical, or material support: B. Li, Cui, R. Li.
Study supervision: Diehn, R. Li.
Conflict of Interest Disclosures: None reported.
Funding/Support: This study was supported in part by grant R01CA193730 from the National Institutes of Health.
Role of the Funder/Sponsor: The sponsor had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
2.Vargas
AJ, Harris
CC. Biomarker development in the precision medicine era: lung cancer as a case study.
Nat Rev Cancer. 2016;16(8):525-537.
PubMedGoogle ScholarCrossref 3.Pignon
J-P, Tribodet
H, Scagliotti
GV,
et al; LACE Collaborative Group. Lung adjuvant cisplatin evaluation: a pooled analysis by the LACE Collaborative Group.
J Clin Oncol. 2008;26(21):3552-3559.
PubMedGoogle ScholarCrossref 4.Sangha
R, Price
J, Butts
CA. Adjuvant therapy in non–small cell lung cancer: current and future directions.
Oncologist. 2010;15(8):862-872.
PubMedGoogle ScholarCrossref 5.Strauss
GM, Herndon
JE
II, Maddaus
MA,
et al. Adjuvant paclitaxel plus carboplatin compared with observation in stage IB non–small-cell lung cancer: CALGB 9633 with the Cancer and Leukemia Group B, Radiation Therapy Oncology Group, and North Central Cancer Treatment Group Study groups.
J Clin Oncol. 2008;26(31):5043-5051.
PubMedGoogle ScholarCrossref 6.Kratz
JR, He
J, Van Den Eeden
SK,
et al. A practical molecular assay to predict survival in resected non-squamous, non–small-cell lung cancer: development and international validation studies.
Lancet. 2012;379(9818):823-832.
PubMedGoogle ScholarCrossref 7.Beer
DG, Kardia
SL, Huang
CC,
et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma.
Nat Med. 2002;8(8):816-824.
PubMedGoogle Scholar 8.Shedden
K, Taylor
JM, Enkemann
SA,
et al; Director’s Challenge Consortium for the Molecular Classification of Lung Adenocarcinoma. Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study.
Nat Med. 2008;14(8):822-827.
PubMedGoogle ScholarCrossref 9.Wistuba
II, Behrens
C, Lombardi
F,
et al. Validation of a proliferation-based expression signature as prognostic marker in early stage lung adenocarcinoma.
Clin Cancer Res. 2013;19(22):6261-6271.
PubMedGoogle ScholarCrossref 10.Chen
H-Y, Yu
S-L, Chen
C-H,
et al. A five-gene signature and clinical outcome in non-small-cell lung cancer.
N Engl J Med. 2007;356(1):11-20.
PubMedGoogle ScholarCrossref 11.Subramanian
J, Simon
R. Gene expression–based prognostic signatures in lung cancer: ready for clinical use?
J Natl Cancer Inst. 2010;102(7):464-474.
PubMedGoogle ScholarCrossref 13.Guo
NL, Wan
Y-W, Tosun
K,
et al. Confirmation of gene expression–based prediction of survival in non–small cell lung cancer.
Clin Cancer Res. 2008;14(24):8213-8220.
PubMedGoogle ScholarCrossref 14.Leek
JT, Scharpf
RB, Bravo
HC,
et al. Tackling the widespread and critical impact of batch effects in high-throughput data.
Nat Rev Genet. 2010;11(10):733-739.
PubMedGoogle ScholarCrossref 15.Popovici
V, Budinska
E, Tejpar
S,
et al. Identification of a poor-prognosis
BRAF-mutant–like population of patients with colon cancer.
J Clin Oncol. 2012;30(12):1288-1295.
PubMedGoogle ScholarCrossref 16.Zak
DE, Penn-Nicholson
A, Scriba
TJ,
et al; ACS and GC6-74 cohort study groups. A blood RNA signature for tuberculosis disease risk: a prospective cohort study.
Lancet. 2016;387(10035):2312-2322.
PubMedGoogle ScholarCrossref 17.Heinäniemi
M, Nykter
M, Kramer
R,
et al. Gene-pair expression signatures reveal lineage control.
Nat Methods. 2013;10(6):577-583.
PubMedGoogle ScholarCrossref 18.Angell
H, Galon
J. From the immune contexture to the Immunoscore: the role of prognostic and predictive immune markers in cancer.
Curr Opin Immunol. 2013;25(2):261-267.
PubMedGoogle ScholarCrossref 19.Gentles
AJ, Newman
AM, Liu
CL,
et al. The prognostic landscape of genes and infiltrating immune cells across human cancers.
Nat Med. 2015;21(8):938-945.
PubMedGoogle ScholarCrossref 23.Gentles
AJ, Bratman
SV, Lee
LJ,
et al. Integrating tumor and stromal gene expression signatures with clinical indices for survival stratification of early-stage non–small cell lung cancer.
J Natl Cancer Inst. 2015;107(10):djv211.
PubMedGoogle ScholarCrossref 24.Topalian
SL, Hodi
FS, Brahmer
JR,
et al. Safety, activity, and immune correlates of anti–PD-1 antibody in cancer.
N Engl J Med. 2012;366(26):2443-2454.
PubMedGoogle ScholarCrossref 25.Garon
E, Gandhi
L, Rizvi
N,
et al. LBA43: antitumor activity of pembrolizumab (PEMBRO; MK-3475) and correlation with programmed death ligand 1 (PD-L1) expression in a pooled analysis of patients (pts) with advanced non–small cell lung carcinoma (NSCLC).
Ann Oncol. 2014;25(suppl 4):1-41.
Google ScholarCrossref 26.Galon
J, Costes
A, Sanchez-Cabo
F,
et al. Type, density, and location of immune cells within human colorectal tumors predict clinical outcome.
Science. 2006;313(5795):1960-1964.
PubMedGoogle ScholarCrossref 27.Al-Shibli
KI, Donnem
T, Al-Saad
S, Persson
M, Bremnes
RM, Busund
L-T. Prognostic effect of epithelial and stromal lymphocyte infiltration in non–small cell lung cancer.
Clin Cancer Res. 2008;14(16):5220-5227.
PubMedGoogle ScholarCrossref 28.Bates
GJ, Fox
SB, Han
C,
et al. Quantification of regulatory T cells enables the identification of high-risk breast cancer patients and those at risk of late relapse.
J Clin Oncol. 2006;24(34):5373-5380.
PubMedGoogle ScholarCrossref 29.Mlecnik
B, Tosolini
M, Kirilovsky
A,
et al. Histopathologic-based prognostic factors of colorectal cancers are associated with the state of the local immune reaction.
J Clin Oncol. 2011;29(6):610-618.
PubMedGoogle ScholarCrossref 30.Galon
J, Pagès
F, Marincola
FM,
et al. Cancer classification using the Immunoscore: a worldwide task force.
J Transl Med. 2012;10(1):205.
PubMedGoogle ScholarCrossref 31.Bindea
G, Mlecnik
B, Tosolini
M,
et al. Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer.
Immunity. 2013;39(4):782-795.
PubMedGoogle ScholarCrossref 32.Rooney
MS, Shukla
SA, Wu
CJ, Getz
G, Hacohen
N. Molecular and genetic properties of tumors associated with local immune cytolytic activity.
Cell. 2015;160(1-2):48-61.
PubMedGoogle ScholarCrossref 33.Angelova
M, Charoentong
P, Hackl
H,
et al. Characterization of the immunophenotypes and antigenomes of colorectal cancers reveals distinct tumor escape mechanisms and novel targets for immunotherapy.
Genome Biol. 2015;16(1):64.
PubMedGoogle ScholarCrossref 34.Suzuki
K, Kachala
SS, Kadota
K,
et al. Prognostic immune markers in non–small cell lung cancer [published correction appears in
Nature. 2014;514(7521):262].
Clin Cancer Res. 2011;17(16):5247-5256.
PubMedGoogle ScholarCrossref 35.Cancer Genome Atlas Research Network. Comprehensive molecular profiling of lung adenocarcinoma.
Nature. 2014;511(7511):543-550.
PubMedGoogle ScholarCrossref 36.Cancer Genome Atlas Research Network. Comprehensive genomic characterization of squamous cell lung cancers [published correction appears in
Nature. 2012;491(7423):288].
Nature. 2012;489(7417):519-525.
PubMedGoogle ScholarCrossref 37.Zitvogel
L, Apetoh
L, Ghiringhelli
F, Kroemer
G. Immunological aspects of cancer chemotherapy.
Nat Rev Immunol. 2008;8(1):59-73.
PubMedGoogle ScholarCrossref 38.Rousseaux
S, Debernardi
A, Jacquiau
B,
et al. Ectopic activation of germline and placental genes identifies aggressive metastasis-prone lung cancers.
Sci Transl Med. 2013;5(186):186ra66.
PubMedGoogle ScholarCrossref 39.Okayama
H, Kohno
T, Ishii
Y,
et al. Identification of genes upregulated in
ALK-positive and
EGFR/KRAS/ALK-negative lung adenocarcinomas.
Cancer Res. 2012;72(1):100-111.
PubMedGoogle ScholarCrossref 40.Sato
M, Larsen
JE, Lee
W,
et al. Human lung epithelial cells progressed to malignancy through specific oncogenic manipulations.
Mol Cancer Res. 2013;11(6):638-650.
PubMedGoogle ScholarCrossref 41.Der
SD, Sykes
J, Pintilie
M,
et al. Validation of a histology-independent prognostic gene signature for early-stage, non-small-cell lung cancer including stage IA patients.
J Thorac Oncol. 2014;9(1):59-64.
PubMedGoogle ScholarCrossref 42.Tomida
S, Takeuchi
T, Shimada
Y,
et al. Relapse-related molecular signature in lung adenocarcinomas identifies patients with dismal prognosis.
J Clin Oncol. 2009;27(17):2793-2799.
PubMedGoogle ScholarCrossref 43.Wilkerson
MD, Yin
X, Walter
V,
et al. Differential pathogenesis of lung adenocarcinoma subtypes involving sequence mutations, copy number, chromosomal instability, and methylation.
PLoS One. 2012;7(5):e36530.
PubMedGoogle ScholarCrossref 44.Bhattacharjee
A, Richards
WG, Staunton
J,
et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.
Proc Natl Acad Sci U S A. 2001;98(24):13790-13795.
PubMedGoogle ScholarCrossref 45.Takeuchi
T, Tomida
S, Yatabe
Y,
et al. Expression profile–defined classification of lung adenocarcinoma shows close relationship with underlying major genetic changes and clinicopathologic behaviors.
J Clin Oncol. 2006;24(11):1679-1688.
PubMedGoogle ScholarCrossref 46.Tang
H, Xiao
G, Behrens
C,
et al. A 12-gene set predicts survival benefits from adjuvant chemotherapy in non–small cell lung cancer patients.
Clin Cancer Res. 2013;19(6):1577-1586.
PubMedGoogle ScholarCrossref 47.Hou
J, Aerts
J, den Hamer
B,
et al. Gene expression-based classification of non–small cell lung carcinomas and survival prediction.
PLoS One. 2010;5(4):e10312.
PubMedGoogle ScholarCrossref 48.Bild
AH, Yao
G, Chang
JT,
et al. Oncogenic pathway signatures in human cancers as a guide to targeted therapies.
Nature. 2006;439(7074):353-357.
PubMedGoogle ScholarCrossref 49.Botling
J, Edlund
K, Lohr
M,
et al. Biomarker discovery in non–small cell lung cancer: integrating gene expression profiling, meta-analysis, and tissue microarray validation.
Clin Cancer Res. 2013;19(1):194-204.
PubMedGoogle ScholarCrossref 50.Kuner
R, Muley
T, Meister
M,
et al. Global gene expression analysis reveals specific patterns of cell junctions in non–small cell lung cancer subtypes.
Lung Cancer. 2009;63(1):32-38.
PubMedGoogle ScholarCrossref 51.Zhu
CQ, Ding
K, Strumpf
D,
et al. Prognostic and predictive gene signature for adjuvant chemotherapy in resected non–small-cell lung cancer.
J Clin Oncol. 2010;28(29):4417-4424.
PubMedGoogle ScholarCrossref 52.Xie
Y, Xiao
G, Coombes
KR,
et al. Robust gene expression signature from formalin-fixed paraffin-embedded samples predicts prognosis of non–small-cell lung cancer patients.
Clin Cancer Res. 2011;17(17):5705-5714.
PubMedGoogle ScholarCrossref 53.Bhattacharya
S, Andorf
S, Gomes
L,
et al. ImmPort: disseminating data to the public for the future of immunology.
Immunol Res. 2014;58(2-3):234-239.
PubMedGoogle ScholarCrossref 54.Kim
S, Lin
CW, Tseng
GC. MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis.
Bioinformatics. 2016;32(13):1966-1973.
PubMedGoogle ScholarCrossref 55.Simon
N, Friedman
J, Hastie
T, Tibshirani
R. Regularization paths for Cox’s proportional hazards model via coordinate descent.
J Stat Softw. 2011;39(5):1-13.
PubMedGoogle ScholarCrossref 56.Heagerty
PJ, Lumley
T, Pepe
MS. Time-dependent ROC curves for censored survival data and a diagnostic marker.
Biometrics. 2000;56(2):337-344.
PubMedGoogle ScholarCrossref 57.Akritas
MG. Nearest neighbor estimation of a bivariate distribution under random censoring.
Ann Stat. 1994;22(3):1299-1327.
Google ScholarCrossref 58.Cheng
WY, Ou Yang
TH, Anastassiou
D. Development of a prognostic model for breast cancer survival in an open challenge environment.
Sci Transl Med. 2013;5(181):181ra50.
PubMedGoogle ScholarCrossref 59.Huang
W, Sherman
BT, Lempicki
RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.
Nat Protoc. 2009;4(1):44-57.
PubMedGoogle ScholarCrossref 60.Cooper
LA, Kong
J, Gutman
DA, Dunn
WD, Nalisnik
M, Brat
DJ. Novel genotype-phenotype associations in human cancers enabled by advanced molecular platforms and computational analysis of whole slide images.
Lab Invest. 2015;95(4):366-376.
PubMedGoogle ScholarCrossref 61.Eng
KH, Schiller
E, Morrell
K. On representing the prognostic value of continuous gene expression biomarkers with the restricted mean survival curve.
Oncotarget. 2015;6(34):36308-36318.
PubMedGoogle Scholar 62.Uno
H, Claggett
B, Tian
L,
et al. Moving beyond the hazard ratio in quantifying the between-group difference in survival analysis.
J Clin Oncol. 2014;32(22):2380-2385.
PubMedGoogle ScholarCrossref 63.Kang
L, Chen
W, Petrick
NA, Gallas
BD. Comparing two correlated C indices with right-censored survival outcome: a one-shot nonparametric approach.
Stat Med. 2015;34(4):685-703.
PubMedGoogle ScholarCrossref 64.Zheng
Y, Bueno
R. Commercially available prognostic molecular models in early-stage lung cancer: a review of the Pervenio Lung RS and Myriad myPlan Lung Cancer tests.
Expert Rev Mol Diagn. 2015;15(5):589-596.
PubMedGoogle ScholarCrossref 65.Tan
AC, Naiman
DQ, Xu
L, Winslow
RL, Geman
D. Simple decision rules for classifying human cancers from gene expression profiles.
Bioinformatics. 2005;21(20):3896-3904.
PubMedGoogle ScholarCrossref 66.Eddy
JA, Sung
J, Geman
D, Price
ND. Relative expression analysis for molecular cancer diagnosis and prognosis.
Technol Cancer Res Treat. 2010;9(2):149-159.
PubMedGoogle ScholarCrossref 67.Suzuki
K, Kadota
K, Sima
CS,
et al. Clinical impact of immune microenvironment in stage I lung adenocarcinoma: tumor interleukin-12 receptor β2 (IL-12Rβ2), IL-7R, and stromal FoxP3/CD3 ratio are independent predictors of recurrence.
J Clin Oncol. 2013;31(4):490-498.
PubMedGoogle ScholarCrossref 68.Lin
WW, Karin
M. A cytokine-mediated link between innate immunity, inflammation, and cancer.
J Clin Invest. 2007;117(5):1175-1183.
PubMedGoogle ScholarCrossref 70.Cvetanovic
M, Ucker
DS. Innate immune discrimination of apoptotic cells: repression of proinflammatory macrophage transcription is coupled directly to specific recognition.
J Immunol. 2004;172(2):880-889.
PubMedGoogle ScholarCrossref 71.Festjens
N, Vanden Berghe
T, Vandenabeele
P. Necrosis, a well-orchestrated form of cell demise: signalling cascades, important mediators and concomitant immune response.
Biochim Biophys Acta. 2006;1757(9-10):1371-1387.
PubMedGoogle ScholarCrossref 72.Donskov
F. Immunomonitoring and prognostic relevance of neutrophils in clinical trials.
Semin Cancer Biol. 2013;23(3):200-207.
PubMedGoogle ScholarCrossref 73.Jensen
HK, Donskov
F, Marcussen
N, Nordsmark
M, Lundbeck
F, von der Maase
H. Presence of intratumoral neutrophils is an independent prognostic factor in localized renal cell carcinoma.
J Clin Oncol. 2009;27(28):4709-4717.
PubMedGoogle ScholarCrossref 74.Fossati
G, Ricevuti
G, Edwards
SW, Walker
C, Dalton
A, Rossi
ML. Neutrophil infiltration into human gliomas.
Acta Neuropathol. 1999;98(4):349-354.
PubMedGoogle ScholarCrossref 75.Jensen
TO, Schmidt
H, Møller
HJ,
et al. Intratumoral neutrophils and plasmacytoid dendritic cells indicate poor prognosis and are associated with pSTAT3 expression in AJCC stage I/II melanoma.
Cancer. 2012;118(9):2476-2485.
PubMedGoogle ScholarCrossref 76.Gerlinger
M, Rowan
AJ, Horswell
S,
et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing.
N Engl J Med. 2012;366(10):883-892.
PubMedGoogle ScholarCrossref 77.Zhang
J, Fujimoto
J, Zhang
J,
et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing.
Science. 2014;346(6206):256-259.
PubMedGoogle ScholarCrossref