Models that combine transcriptomic with spatial protein information exceed the predictive value for either single modality

Vathiotis, Ioannis A.; Yang, Zhi; Reeves, Jason; Toki, Maria; Aung, Thazin Nwe; Wong, Pok Fai; Kluger, Harriet; Syrigos, Konstantinos N.; Warren, Sarah; Rimm, David L.

doi:10.1038/s41698-021-00184-1

Download PDF

Brief Communication
Open access
Published: 28 May 2021

Models that combine transcriptomic with spatial protein information exceed the predictive value for either single modality

npj Precision Oncology volume 5, Article number: 45 (2021) Cite this article

4662 Accesses
10 Citations
4 Altmetric
Metrics details

Subjects

Immunotherapy has reshaped the field of cancer therapeutics but the population that benefits are small in many tumor types, warranting a companion diagnostic test. While immunohistochemistry (IHC) for programmed death-ligand 1 (PD-L1) or mismatch repair (MMR) and polymerase chain reaction (PCR) for microsatellite instability (MSI) are the only approved companion diagnostics others are under consideration. An optimal companion diagnostic test might combine the spatial information of IHC with the quantitative information from RNA expression profiling. Here, we show proof of concept for combination of spatially resolved protein information acquired by the NanoString GeoMx® Digital Spatial Profiler (DSP) with transcriptomic information from bulk mRNA gene expression acquired using NanoString nCounter® PanCancer IO 360™ panel on the same cohort of immunotherapy treated melanoma patients to create predictive models associated with clinical outcomes. We show that the combination of mRNA and spatially defined protein information can predict clinical outcomes more accurately (AUC 0.97) than either of these factors alone.

Multimodal single-cell and whole-genome sequencing of small, frozen clinical specimens

Article 09 January 2023

Deep spatial-omics analysis of Head & Neck carcinomas provides alternative therapeutic targets and rationale for treatment failure

Article Open access 13 September 2023

Digital quantitative assessment of PD-L1 using digital spatial profiling

Article 06 April 2020

Combination immunotherapy targeting both cytotoxic T-lymphocyte associated protein 4 (CTLA-4) and programmed cell death 1 (PD-1) immune checkpoints has resulted in a median progression-free survival (PFS) of 11.5 months and 5-year survival rates of up to 52% in previously untreated patients with advanced melanoma.^1,2,3 However, objective response to immune checkpoint inhibitors (ICI) is limited to 40% and evidence of clinical activity is present in up to 65% of patients.⁴

A number of methods have been tested for their predictive value for ICI therapy.⁵ PD-L1 expression by IHC on formalin-fixed, paraffin-embedded (FFPE) tissue has demonstrated limited predictive ability in patients with metastatic melanoma. PD-L1 IHC maintains suboptimal accuracy and reproducibility and offers limited information about the tumor microenvironment (TME).^6,7,8 High tumor mutational burden (TMB) has also been correlated with response to ICI. However, TMB provides indirect and equivocal information about the immune response and has not been standardized yet.^9,10,11 Recent studies have focused on generating gene expression profiles (GEP) to address all different cell types and phenotypes that comprise the complex TME and describe the crosstalk among different immune-regulatory pathways.^12,13,14 GEPs have indeed proved more accurate in predicting response to ICI, however, transcriptomic assays lack spatial information that may provide context about the source of the transcript within the tumor microenvironment.¹⁵

To assess the relative power of both spatially informed protein combined with GEPs, we utilized a cohort of 59 retrospectively collected melanoma patients that received treatment with anti-PD-1 (34/59; nivolumab, pembrolizumab) or combination (25/59; ipilimumab plus nivolumab) immunotherapy in the metastatic setting at Yale Cancer Center (Supplementary Table 1).¹⁶ Unsupervised hierarchical clustering on the 770 mRNA and 132 protein variables (44 DSP targets, measured in three different compartments) revealed that DSP data mainly clustered away from bulk mRNA gene expression data, suggesting that RNA and protein bear discrete, mostly nonoverlapping pieces of biological information (Fig. 1a). Next, we compared normalized bulk mRNA counts to normalized protein counts in three different compartments (the melanocyte [s100/HMB45] compartment, the leukocyte [CD45] compartment and the macrophage [CD68] compartment) and the sum of all three compartments. We saw that mRNAs and protein products were best correlated in the melanocyte compartment, possibly reflecting both the abundance of tumor tissue after FFPE sample macrodissection as well as its transcriptional overactivity driving gene expression. Most protein derivatives exhibited a positive correlation with their corresponding mRNAs. We also found a particular set of proteins that showed weak correlation or anti-correlation with their mRNAs (CD276, MLH1, MYC, BCL2, MSH2, MKI67, PMS2, CTNNB1, and STAT3) (Fig. 1b).

**Fig. 1: Correlation between mRNA and protein.**

Although currently challenging to assess on a single platform, combined modality (mRNA and protein) models may provide more detailed and comprehensive biological information, incorporating data related to immune regulation and other aspects of the tumor–stroma interaction and thus, prove superior to the existing models in predicting response to ICI. To explore this hypothesis, we extracted 527 variables that were modestly associated with the best overall response (BOR) (p < 0.10) by unadjusted univariate analysis (Fig. 2a). After removing moderately correlated predictors (R² > 0.70), we used Elastic Net Regularization for feature selection and optimization for inclusion in different predictive models. We generated three models: a bulk mRNA gene expression model (n = 770 variables; PD/SD vs PR/CR, -0.23; p < 0.0001), a DSP model (n = 117 variables; PD/SD vs PR/CR, -0.04; p = 0.002) and a combined modality model (n = 44 variables, including 10 protein and 34 mRNA; PD/SD vs PR/CR, -0.68; p < 0.0001) (Fig. 2b). All proteins included in the DSP model were quantified in either the s100/HMB45 or the CD68 compartment. Although both PD-L1 mRNA and protein in the CD68 compartment were significantly associated with BOR, neither was selected in the combined modality model.

**Fig. 2: Combination of mRNA and protein improves best overall response (BOR) classification.**

We observed improvement in the classification of BOR for the 44-variable combined modality model (Area under the curve [AUC], 0.97; 95 confidence intervals [CI], 0.92 to 1.00; sensitivity, 0.96; specificity, 0.93; positive predictive value [PPV], 0.91; negative predictive value [NPV], 0.96). This exceeded the AUC for both the 770-variable transcriptomic model (AUC, 0.93; 95% CI, 0.87–1.00; sensitivity, 0.93; specificity, 0.87; PPV, 0.85; NPV, 0.94) and the 117-variable DSP model (AUC, 0.87; 95% CI, 0.80–0.94; sensitivity, 0.79; specificity, 0.88; PPV, 0.84; NPV, 0.84). Model improvement was more prominent over DSP rather than bulk mRNA, possibly because of the increased number (~ 5-fold) of features derived from the RNA dataset that were introduced in the analysis. The features, or variables, while fractionally increasing the AUC, can make the application of the model impractical or non-reproducible.

Toward the goal of generating a clinical test, we extended the analysis of the data to find the minimal subset of variables that need to be included in the model without causing substantial decline in model performance, representing the optimal trade-off between efficacy and simplicity. To accomplish this, we ranked the top ten sets of variables that demonstrated the highest predictive ability, based on AUC, for any given number of variables included in the model (K). Then, we constructed new sets that were composed of the most frequently appearing variables for each K value. Finally, we calculated AUC, sensitivity, specificity, PPV, and NPV to compare these sets for K values between 4 and 13 (Fig. 3a and Supplementary Fig. 1a–d). The first peak in all five curves was observed when the number of variables was equal to eight (K = 8). Hence, we selected the 8-variable (CCNO, ID4, IER3, IL2RB, MGMT, NRDE2, TNFAIP6, and MSH2 in s100/HMB45) (Fig. 3b) hereafter referred to as the Yale Mixed Modality Model (YMMM) (Supplementary Table 2). We then tested the YMMM for the prediction of BOR to ICI in patients with advanced melanoma (AUC, 0.88; 95% CI, 0.78–0.95; sensitivity, 0.85; specificity, 0.83; PPV, 0.79; NPV, 0.88) (Fig. 4a, b). It was apparent that YMMM incorporated three distinct components; a component pertaining to cell cycle regulation and oncogenesis (CCNO, ID4, and IER3), a component related to unrepaired DNA damage, accumulation of mutations, and microsatellite instability (MGMT, NRDE2, and MSH2 in S100/HMB45) and a component directly linked with the immune response towards the primary tumor (IL2RB and TNFAIP6).^{17,18,19,20,21}

**Fig. 3: Generation of Yale Mixed Modality Model (YMMM) for the prediction of best overall response to immunotherapy in patients with advanced melanoma.**

**Fig. 4: Yale Combined Modality Model (YMMM) predicts response to immunotherapy in patients with advanced melanoma.**

YMMM performance for the prediction of progression-free survival (PFS) and overall survival (OS) needs to be considered in the context for which it will be used. For this analysis, we calculated the optimal, based on the Youden’s index, cutpoint for the prediction of BOR and used it to create high and low risk subgroups. Patients with high score according to YMMM performed significantly better in terms of both PFS (HR, 0.20; 95% CI, 0.10–0.41; log rank p < 0.0001) and OS (HR, 0.16; 95% CI, 0.06–0.43; log rank p < 0.0001) in comparison with patients with low YMMM score (Fig. 4c, d). Previous studies have demonstrated that conventional biomarkers carry suboptimal predictive ability for melanoma patients treated with ICI, as they are only able to illuminate one or limited aspects of the tumor-TME interaction and are designed to implement binary patient stratification (positive/negative), failing to incorporate the dynamic range of responses to this particular type of therapy.²² YMMM represents a multimodality approach that selects and encompasses essential information about multiple elements related to response to ICI. Furthermore, it functions as a continuous score, rather than a binary variable, enabling precise as well as dynamic benefit stratification to optimize clinical decision making.

But in practice, especially in the metastatic setting, many predictive assays do not use the optimal area under the curve since it is critical to provide patients the greatest opportunity to benefit by maximizing sensitivity. An example of this is ERBB2 in breast cancer. The current assay combination of IHC, then FISH has high sensitivity (as high as 95%) but relatively low specificity^23,24 for predictive response to HER2 targeting therapy. In fact, even in the adjuvant setting where 65–70% of patients showed long-term survival with placebo,²⁵ the same assay is used in effort to leave no patient behind, although many patients will not benefit from the drug (low specificity). Similarly, as we build the YMMM assay with a limited, accessible and highly reproducible biomarker set, we need to design the assay for high sensitivity, even at the expense of specificity. Our model suggests that for prediction of response with 95% sensitivity the assay would have a specificity between 0.63 and 0.94.

In summary, this is a proof of concept study and further analyses are required to construct and validate the YMMM. As such, a limitation of this work is the absence of validation using cohorts in the literature since no previous cohort has collected both spatial protein and transcriptomic information. Another limitation is the relatively small size of the cohort and the fact that it is comprised of patients that received either single-agent or combination immunotherapy. Future studies are planned to validate the 8-variable YMMM, including retrospective collections of patients with melanoma, as well as other tumor types, treated with ICI and ultimately, prospective clinical trials. In addition, YMMM or similar models should be correlated with other predictive assays, including PD-L1 IHC score and TMB. Finally, in an era where immunotherapy indications are relentlessly expanding, YMMM or similarly constructed mixed modality models could be used to develop predictors for single agents or therapeutic combinations that may have distinct, compartment-specific mechanisms of action.

Methods

Tissue microarray and patient cohorts

Tissue specimens were prepared in a tissue microarray (TMA) format as described previously.²⁶ After review by a board-certified pathologist, representative 0.6 mm cores from areas with high tumor content were obtained from FFPE specimens and arrayed in a recipient block. FFPE normal tissue was used as a control. All specimens were collected from the Yale Pathology archives. The study cohort (YTMA376) is a retrospective collection of 59 pretreatment melanoma tumor specimens resected between 2011 and 2016. Uveal melanoma was excluded. The corresponding patients were treated with anti-PD-1 (nivolumab, pembrolizumab) or combination (ipilimumab plus nivolumab) immunotherapy in the metastatic setting at Yale Cancer Center. Clinicopathological data were collected from clinical records and pathology reports; the data cut-off date was September 1, 2017.²⁷ Response Evaluation Criteria in Solid Tumors (RECIST) 1.1 were used to determine best overall response (BOR) as complete response (CR), partial response (PR), stable disease (SD), or progressive disease (PD), and objective response rate (ORR; CR/PR), durable clinical benefit rate (DCBR; CR/PR/SD ≥ 6 months), disease control rate (DCR; CR/PR/SD).²⁸ All patients provided written informed consent or waiver of consent. The study was approved by the Yale Human Investigation Committee protocol #9505008219 and conducted in accordance with the Declaration of Helsinki.

mRNA gene expression

For the gene expression analysis, pretreatment FFPE whole tissue sections from the 59 melanoma patients included in YTMA 376 were employed. Two slides from each patient were macrodissected and RNA was extracted. The mRNA transcripts were hybridized to 4-color, 6-spot optical barcodes, exclusive for each of the targets included in the 770-plex PanCancer IO360 panel. Barcodes were then measured by a fluorescence microscope on the nCounter platform. Finally, RNA counts were normalized for technical efficiency by the geometric mean of internal control probes, and then, to account for sample-specific RNA content, against the geometric mean housekeeping genes present on the panel. For analysis, normalized counts were log₂ transformed.

Digital spatial profiling

The NanoString DSP is a novel platform that allows spatially-resolved, high-plex quantitative measurement of target proteins on a single FFPE slide. In this study, TMA slides were incubated with cocktails of 44 unique, previously validated, oligonucleotide-conjugated antibodies (Extended Data Table 3). Each TMA spot was represented by a unique region of interest (ROI). We hypothesized that immune markers, including immune checkpoints, have differential expression patterns among immune cell populations that comprise the tumor microenvironment and carry different predictive significance with respect to the cell type that they are expressed. So, on each ROI, different compartments, called areas of interest (AOI), were created based on fluorescent staining with antibodies targeting s100 with HMB45 for melanocytes, CD45 for tumor-infiltrating leukocytes, and CD68 for tumor-infiltrating macrophages (Extended Data Fig. 2). Oligos from each AOI were then released upon exposure to UV light. Photocleaved oligos were collected via microcapillary tube inspiration by sequential assignment of the CD68 + , CD45 + , and finally s100/HMB45 + AOI and transferred into a microwell plate with a spatial resolution of approximately 10 mm. Photocleaved oligos were then hybridized to 4-color, 6-spot optical barcodes producing uniquely labeled tags per AOI for each of the 44 antibodies included in the original mix. Digital counts from barcodes corresponding to protein probes were first normalized with internal positive and negative controls to account for system variation, and then normalized to the area of their compartment.

Statistical analysis and predictive model generation

After excluding five controls from the analysis including Histone H3, Mouse IgG1, Mouse IgG2a, Rabbit IgG, and S6, a total of 887 targets (770 + (44 – 5) × 3) remained to build an elastic net regularized regression model for predicting BOR. A more predictive subset of variables (n = 527) was formed with p-value less than 0.10 in univariate logistic regression models. To minimize the multicollinearity issue among 527 predictors, an iterative pruning procedure was performed by ranking predictors in descending order of its univariate R-squared in predicting BOR and only keeping those with the highest AUC by removing other moderately correlated predictors (correlation coefficient > 0.7). Therefore, only 72 predictors with pair-wise correlation coefficients less than 0.7 remained to enter the next phase of modeling training to tune two important parameters in regularization models, the elastic net mixing parameter α and the regularization parameter λ. Melanoma tumor specimens were split into 80% training set and 20% testing set stratified by BOR. Models were built on the training set in which α and λ were tuned simultaneously in four-fold cross-validation.²⁹ AUC values were used to evaluate model performance and to select the optimal parameters. The process was performed by looping across levels of α ranging from 0 to 1 in steps of 0.05 in which λ was selected at the highest value of AUC for a given α. To stabilize the tuning process for the parameter determination, the previous looping step was performed for 40 replicates to obtain the maximized averaged AUC values for the best combination of parameters (α = 0.15, λ = 0.642). To further reduce overfitting on training a small dataset, instead of fitting the entire data with the tuned parameters, an optimal subset of predictors was constructed by those most frequently selected predictors, with non-zero coefficients, from fitting the elastic net models on bootstrapped data over 1000 replicates, which returned a model size of 59 at its median value. Then, α = 0.15, λ = 0.642 were applied to the data consisting of the top 59 most frequently selected predictors in which 44 of 59 returned non-zero coefficients. Results of utilizing both proteins and bulk mRNA were compared to two other scenarios where either only proteins or bulk mRNA were used in model building. To further assess the predictive performance of different combinations within these 44 targets, the top ten highest AUC with corresponding targets were recorded to determine the most predictive subset over a range of the number of desirable predictors, K, from 4 to 13. It is noted that the following results are based on the coefficient derived from the final model without refitting any new models. When K is greater than five, 2,000,000 unique combinations were created using the Monte Carlo method instead of an exhaustive search of all combinations of all predictors. Among all possible combinations of a given size of K, predictors were ranked by its frequency based on the results from the top ten highest AUC value in which K number of predictors were selected. To calculate the 95% confidence intervals of AUC, sensitivity, specificity, positive predictive value, and negative predictive value the smoothed bootstrap from the kernel boot package was applied to draw samples with replacement from the empirical distribution for 1,000 times which estimates the uncertainty of each measurement.³⁰ The best subset of eight predictors (CCNO, ID4, IER3, IL2RB, MGMT, NRDE2, TNFAIP6, and MSH2 in s100/HMB45) was selected which had the largest improvement on the AUC value. The variable importance was calculated based on the decrease in AUC after 1,000 replicates of permutation in each predictor.³¹ Signature scores of these eight predictors, the sum of the product of expression level and coefficients, were used to estimate the AUC value and 95% CI in predicting BOR. Kaplan-Meier analyses were performed on overall survival and progression-free survival data between high-score groups and low-score groups, which the cutoff of scores was determined at the highest Youden’s index in predicting BOR.³² The entire analysis was performed using R 3.6.3.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data generated and analyzed during this study are described in the following data record: https://doi.org/10.6084/m9.figshare.14345894.³³ The data are housed in Yale AQUAmine in the files ‘376_2_1_Nanostring_IO360_panel_IxV.txt’, ‘376_1_3_Nanostring_2nd_run_immune_panel_mxt.txt’ and ‘376_3_2_Nanostring_immune_panel_mxt.txt’. These files are not publicly available as they contain information that could compromise research participant privacy. However, the data can be made available upon reasonable request to the corresponding author Dr David L Rimm.

Code availability

The data were processed and analyzed using R version 3.6.3 which is tested on both Linux and Windows systems. The R packages and versions used are kernelboot(0.1.7), caret(6.0-86), lattice(0.20-41), OptimalCutpoints(1.1-4), glmnetUtils(1.1.5), patchwork(1.0.0), survminer(0.4.6), ggpubr(0.2.5), magrittr(1.5), pROC(1.16.2), DT(0.13), glmnet(3.0-2), Matrix(1.2-18), survival(3.1-8), pheatmap(1.0.12), ggrepel(0.8.2), ggplot2(3.3.0), readxl(1.3.1), and rsq(1.1). The datasets generated and/or analyzed during the current study are available in the data folder of the GitHub repository, https://github.com/Nanostring-Biostats/TSCOL_0137-Vathiotis_Rimm.

References

Larkin, J. et al. Combined Nivolumab and Ipilimumab or monotherapy in untreated melanoma. N. Engl. J. Med. 373, 23–34 (2015).
Article Google Scholar
Larkin, J. et al. Five-year survival with combined nivolumab and ipilimumab in advanced melanoma. N. Engl. J. Med. 381, 1535–1546 (2019).
Article CAS Google Scholar
Tawbi, H. A. et al. Combined Nivolumab and Ipilimumab in melanoma metastatic to the brain. N. Engl. J. Med. 379, 722–730 (2018).
Article CAS Google Scholar
Wolchok, J. D. et al. Nivolumab plus ipilimumab in advanced melanoma. N. Engl. J. Med. 369, 122–133 (2013).
Article CAS Google Scholar
Lu, S. et al. Comparison of biomarker modalities for predicting response to PD-1/PD-L1 checkpoint blockade: a systematic review and meta-analysis. JAMA Oncol. 5, 1195–1204 (2019).
Article Google Scholar
Daud, A. I. et al. Programmed death-ligand 1 expression and response to the anti-programmed death 1 antibody pembrolizumab in melanoma. J. Clin. Oncol. 34, 4102–4109 (2016).
Article CAS Google Scholar
Hirsch, F. R. et al. PD-L1 immunohistochemistry assays for lung cancer: results from phase 1 of the blueprint PD-L1 IHC assay comparison project. J. Thorac. Oncol. 12, 208–222 (2017).
Article Google Scholar
Topalian, S. L. et al. Safety, activity, and immune correlates of anti-PD-1 antibody in cancer. N. Engl. J. Med. 366, 2443–2454 (2012).
Article CAS Google Scholar
Conroy, J. M. et al. Analytical validation of a next-generation sequencing assay to monitor immune responses in solid tumors. J. Mol. Diagn. 20, 95–109 (2018).
Article CAS Google Scholar
Gubin, M. M. et al. Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature 515, 577–581 (2014).
Article CAS Google Scholar
Rizvi, N. A. et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124–128 (2015).
Article CAS Google Scholar
Ayers, M. et al. IFN-γ-related mRNA profile predicts clinical response to PD-1 blockade. J. Clin. Invest. 127, 2930–2940 (2017).
Article Google Scholar
Jiang, P. et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat. Med. 24, 1550–1558 (2018).
Article CAS Google Scholar
Morrison, C. et al. Predicting response to checkpoint inhibitors in melanoma beyond PD-L1 and mutational burden. J. Immunother. Cancer 6, 32 (2018).
Article Google Scholar
Cristescu, R., et al. Pan-tumor genomic biomarkers for PD-1 checkpoint blockade-based immunotherapy. Science 362, 3593 (2018)
Toki, M. I. et al. High-plex predictive marker discovery for melanoma immunotherapy-treated patients using digital spatial profiling. Clin. Cancer Res. 25, 5503–5512 (2019).
Article CAS Google Scholar
Patel, D. et al. Inhibitor of differentiation 4 (ID4): From development to cancer. Biochim. Biophys. Acta 1855, 92–103 (2015).
CAS PubMed Google Scholar
Garcia, M. N. et al. IER3 supports KRASG12D-dependent pancreatic cancer development by sustaining ERK1/2 phosphorylation. J. Clin. Invest. 124, 4709–4722 (2014).
Article CAS Google Scholar
Guang, S. et al. Small regulatory RNAs inhibit RNA polymerase II during the elongation phase of transcription. Nature 465, 1097–1101 (2010).
Article CAS Google Scholar
Yang, M. et al. NK cell development requires Tsc1-dependent negative regulation of IL-15-triggered mTORC1 activation. Nat. Commun. 7, 12730 (2016).
Article Google Scholar
Wisniewski, H. G. & Vilcek, J. TSG-6: an IL-1/TNF-inducible protein with anti-inflammatory activity. Cytokine Growth Factor Rev. 8, 143–156 (1997).
Article CAS Google Scholar
Chen, D. S. & Mellman, I. Elements of cancer immunity and the cancer-immune set point. Nature 541, 321–330 (2017).
Article CAS Google Scholar
Lebeau, A. et al. Her-2/neu analysis in archival tissue samples of human breast cancer: comparison of immunohistochemistry and fluorescence in situ hybridization. J. Clin. Oncol. 19, 354–363 (2001).
Article CAS Google Scholar
Wolff, A. C. et al. Recommendations for human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American pathologists clinical practice guideline update. J. Clin. Oncol. 31, 3997–4013 (2013).
Article Google Scholar
Cameron, D. et al. 11 years’ follow-up of trastuzumab after adjuvant chemotherapy in HER2-positive early breast cancer: final analysis of the HERceptin Adjuvant (HERA) trial. Lancet 389, 1195–1205 (2017).
Article CAS Google Scholar
Camp, R. L., Charette, L. A. & Rimm, D. L. Validation of tissue microarray technology in breast carcinoma. Lab Invest 80, 1943–1949 (2000).
Article CAS Google Scholar
Wong, P. F. et al. Multiplex Quantitative Analysis of Tumor-Infiltrating Lymphocytes and Immunotherapy Outcome in Metastatic Melanoma. Clin. Cancer Res 25, 2442–2449 (2019).
Article CAS Google Scholar
Eisenhauer, E. A. et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur. J. Cancer 45, 228–247 (2009).
Article CAS Google Scholar
Polansky, A. M. & Schucany, W. R. Kernel smoothing to improve bootstrap confidence intervals. J. R. Stat. Soc. 59, 821–838 (1997).
Article Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article Google Scholar
Janitza, S., Strobl, C. & Boulesteix, A. L. An AUC-based permutation variable importance measure for random forests. BMC Bioinforma. 14, 119 (2013).
Article Google Scholar
Therneau, T. M., Grambsch, Patricia M., Modeling Survival Data: Extending the Cox Model. 2000: Springer.
Vathiotis, I. A. et al. Metadata record for the manuscript: Models that Combine Transcriptomic with Spatial Protein Information Exceed the Predictive Value for Either Single Modality. figshare https://doi.org/10.6084/m9.figshare.14345894 (2021).

Download references

Acknowledgements

This work was supported by funds from Eli Lilly and Company and Yale Specialized Programs of Research Excellence in Lung Cancer. Dr. Vathiotis was supported by a scholarship from the Hellenic Society of Medical Oncologists (HESMO). The authors thank Lori A. Charette and the staff of Yale Pathology tissue services for expert histology services.

Author information

Authors and Affiliations

Department of Pathology, Yale School of Medicine, New Haven, CT, USA
Ioannis A. Vathiotis, Maria Toki, Thazin Nwe Aung, Pok Fai Wong & David L. Rimm
Yale Cancer Center, Yale School of Medicine, New Haven, CT, USA
Ioannis A. Vathiotis, Maria Toki, Thazin Nwe Aung, Pok Fai Wong, Harriet Kluger & David L. Rimm
NanoString Technologies, Seattle, WA, USA
Zhi Yang, Jason Reeves & Sarah Warren
Section of Medical Oncology, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
Harriet Kluger
Department of Medicine, National and Kapodistrian University of Athens School of Medicine, Athens, Greece
Konstantinos N. Syrigos

Authors

Ioannis A. Vathiotis
View author publications
You can also search for this author in PubMed Google Scholar
Zhi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jason Reeves
View author publications
You can also search for this author in PubMed Google Scholar
Maria Toki
View author publications
You can also search for this author in PubMed Google Scholar
Thazin Nwe Aung
View author publications
You can also search for this author in PubMed Google Scholar
Pok Fai Wong
View author publications
You can also search for this author in PubMed Google Scholar
Harriet Kluger
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos N. Syrigos
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Warren
View author publications
You can also search for this author in PubMed Google Scholar
David L. Rimm
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study concept design: D.L.R. Acquisition, analysis, or interpretation of data: I.A.V., M.T., P.F.W., D.L.R. Statistical analysis: Z.Y., J.R., T.N.A. Critical revision of the manuscript for important intellectual content: all authors. All authors read and approved the final manuscript.

Corresponding author

Correspondence to David L. Rimm.

Ethics declarations

Competing interests

Z.Y., J.R., S.W. are employees of Nanostring. D.L.R. reports grants from Eli Lilly Co, during the conduct of the study; personal fees from Amgen, grants and personal fees from Astra Zeneca, personal fees from Biocept, personal fees from BMS, personal fees from Cell Signaling Technology, grants and personal fees from Cepheid, personal fees from Daiichi Sankyo, personal fees from GSK, grants and personal fees from Konica Minolta, personal fees from Merck, personal fees and non-financial support from Nanostring, grants and personal fees from NextCure, personal fees from Odonate, personal fees from PAIGE.AI, personal fees from Roche, personal fees from Sanofi, personal fees from Ventana, grants and personal fees from Ultivue, outside the submitted work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vathiotis, I.A., Yang, Z., Reeves, J. et al. Models that combine transcriptomic with spatial protein information exceed the predictive value for either single modality. npj Precis. Onc. 5, 45 (2021). https://doi.org/10.1038/s41698-021-00184-1

Download citation

Received: 31 August 2020
Accepted: 16 April 2021
Published: 28 May 2021
DOI: https://doi.org/10.1038/s41698-021-00184-1

This article is cited by

Baseline gene expression profiling determines long-term benefit to programmed cell death protein 1 axis blockade
- Ioannis A. Vathiotis
- Leonidas Salichos
- David L. Rimm
npj Precision Oncology (2022)
MCM3 is a novel proliferation marker associated with longer survival for patients with tubo-ovarian high-grade serous carcinoma
- Eun Young Kang
- Joshua Millstein
- Martin Köbel
Virchows Archiv (2022)