Skip to main content

Translation of the 27-gene immuno-oncology test (IO score) to predict outcomes in immune checkpoint inhibitor treated metastatic urothelial cancer patients



The IO Score is a 27-gene immuno-oncology (IO) classifier that has previously predicted benefit to immune checkpoint inhibitor (ICI) therapy in triple negative breast cancer (TNBC) and non-small cell lung cancer (NSCLC). It generates both a continuous score and a binary result using a defined threshold that is conserved between breast and lung. Herein, we aimed to evaluate the IO Score’s binary threshold in ICI-naïve TCGA bladder cancer patients (TCGA-BLCA) and assess its clinical utility in metastatic urothelial cancer (mUC) using the IMvigor210 clinical trial treated with the ICI, atezolizumab.


We identified a list of tumor immune microenvironment (TIME) related genes expressed across the TCGA breast, lung squamous and lung adenocarcinoma cohorts (TCGA-BRCA, TCGA-LUSQ, and TCGA-LUAD, 939 genes total) and then examined the expression of these 939 genes in TCGA-BLCA, to identify patients as having high inflammatory gene expression. Using this as a test of classification, we assessed the previously established threshold of IO Score. We then evaluated the IO Score with this threshold in the IMvigor210 cohort for its association with overall survival (OS).


In TCGA-BLCA, IO Score positive patients had a strong concordance with high inflammatory gene expression (p < 0.0001). Given this concordance, we applied the IO Score to the ICI treated IMvigor210 patients. IO Score positive patients (40%) had a significant Cox proportional hazard ratio (HR) of 0.59 (95% CI 0.45–0.78 p < 0.001) for OS and improved median OS (15.6 versus 7.5 months) compared to IO Score negative patients. The IO Score remained significant in bivariate models combined with all other clinical factors and biomarkers, including PD-L1 protein expression and tumor mutational burden.


The IMvigor210 results demonstrate the potential for the IO Score as a clinically useful biomarker in mUC. As this is the third tumor type assessed using the same algorithm and threshold, the IO Score may be a promising candidate as a tissue agnostic marker of ICI clinical benefit. The concordance between IO Score and inflammatory gene expression suggests that the classifier is capturing common features of the TIME across cancer types.


Immune checkpoint inhibitors targeting programmed cell death 1 or its ligand, programmed cell death 1 ligand 1, (PD-1 or PD-L1, respectively) are active in many different tumor types and may modulate the immune response by impacting a common immune regulatory pathway. To date, observed benefit across tumor types has been modest despite co-development of candidate biomarkers for identifying responders to ICI therapy, including immunohistochemistry (IHC) for PD-L1 and tumor mutation burden (TMB). These biomarkers are limited in their value as predictive markers due to inconsistent scoring methods and differing thresholds for positivity, often specifically tailored to tumor types or even clinical studies [1,2,3]. Additionally, ICIs are a systemic therapy which are impacted by the TIME [4,5,6]. Therefore, a biomarker that provides a phenotypic classification of the TIME may be useful to better identify tumors poised to respond to ICI therapeutics.

Atezolizumab was the first FDA approved ICI in mUC based on the initial results of IMvigor210, a non-randomized Phase II clinical trial involving two different cohorts of mUC. Cohort I (NCT02951767) was comprised of 119 patients who were ineligible to receive cisplatin [7] and Cohort II (NCT02108652) was comprised of 310 patients whose disease had progressed on or after platinum therapy [8]. Based on this study, atezolizumab was approved in May of 2016 under an accelerated approval pathway for platinum refractory bladder cancers (Cohort II) independent of PD-L1 status even though the overall response rates were more favorable in those patients who had PD-L1 staining in  ≥ 5% in immune cells (IC2/3). Unfortunately, a Phase 3 confirmatory study, IMvigor211 (NCT02302807), using the primary endpoint of OS where patients with IC2/3 status were tested first in a hierarchical sequence, failed to reach significance [9]. In 2018, Mariathasan and colleagues published the gene expression data for 348 of IMvigor210 patients, representing a mix of both cohorts, along with exploratory analyses for the association with outcome using candidate genomic biomarker signatures and clinical features in a search for biomarkers that might inform response [10].

The IO Score has been validated as a biomarker that identifies patients likely to benefit from ICIs in TNBC and NSCLC [11,12,13,14,15]. These studies were conducted across tumor types using the same threshold of positivity for IO Score. The IO Score was derived from an unsupervised classification of triple negative breast cancer (TNBCtype) specimens [14,16,, 16, 18]. Two subtypes originally thought to describe intrinsic features of the tumor (the immunomodulatory or IM and the mesenchymal stem-like or MSL, respectively) were later recognized as classifiers that distinguish tumor infiltrating lymphocytes and stromal features of a particular tumor’s TIME, respectively. A third subtype, the mesenchymal (M), likely reflects tumors that have undergone some level of epithelial-to-mesenchymal transition (EMT) [17, 19], which in turn is associated with tumor resistance to the immune system [20,21,22,23,24]. Together, the unique interaction of these components defines the IO Score.

While the IO Score was developed in TNBC, its three components are not unique to TNBC but collectively measure ubiquitous features across tumors of epithelial origin. Given that urothelial carcinoma is likewise of epithelial origin, we sought to test the clinical utility of the IO Score to identify patients who benefit from the administration of ICIs in bladder cancer. We first evaluated the predefined IO Score threshold using the IM, MSL, and M subtypes to classify ICI naïve samples from TCGA-BLCA. Using the predefined threshold for IO Score we then tested its association with treatment outcome in patients from the IMvigor210 study. Finally, we compared these results to the previously published extensive exploratory analysis of candidate biomarkers, biomarker signatures, and clinical factors [10].


Data access, software and statistical calculations

R version 4.0.2 (2020-06-22) and R-Studio Version 1.0.143 were used for all data manipulation and calculations [25, 26]. Heatmaps were generated using the ComplexHeatmap package in R [27]. TCGA datasets containing the whole transcriptome RNA-seq gene expression data for TCGA-BRCA, TCGA-LUAD, TCGA-LUSC, and TCGA-BLCA were downloaded using the GenomicDataCommons and TCGAbiolinks packages in R [2831]. The expression data was filtered for identifiable HUGO Gene Nomenclature Committee (HGNC) gene symbols, non-redundancy, and variation across tumor samples. If a single gene had multiple transcripts which met the above criteria the expression across samples were averaged and this, depending upon the dataset, resulted in approximately 34,000 uniquely differentiated members. The four data sets were scaled and log transformed.

The IMvigor210 data including gene expression, clinical data, and alternative biomarker signatures, were made available by the IMvigor210 investigators [10] who created a customized analysis package in R IMvigor210 CoreBiologies [32], which was utilized to download the entire dataset and normalize the gene expression. The R packages survival [33], prodlim [34], and MASS [35] were used for all survival analyses, median follow-up and ordinal regression, respectively. Chi-square testing was used to determine significance testing when comparing response groups and other categorical variables. Bivariate testing comparing the IO Score with clinical factors or genomic signatures was performed by adding both factors as independent variables to a Cox proportional hazard equation. When performing bivariate analysis for comparison between IO Score and the genomic signatures explored by [10], and a binary threshold was not defined for the latter, the median value was used.

Kaplan–Meier estimates were used to calculate median OS and percentage alive at end of study (24.5 months), and Cox proportional hazards was used to calculate hazards with the Efron method used for handling ties.

Assessment of the established IO score threshold in bladder cancer TIME classification

Expression data from TCGA—TCGA-BRCA, TCGA-LUSQ, and TCGA-LUAD—were log transformed and then scaled by patient and gene axes. The 101-gene centroids for the IM, M, and MSL subtypes were used to generate patient specific scores in TCGA-BRCA, TCGA-LUAD, TCGA-LUSQ [18] for each classification. The expression pattern of each of the 34,000 protein coding and non-coding transcripts across each tissue type dataset were then compared to the IM, MSL and M patient phenotypes (Pearson correlation coefficient). The top 1000 correlated genes for each subtype in each of the three tissues were selected. All selected genes were statistically significant but no correction for multiple hypotheses was performed. The three gene sets were compared and the 939 that were common between all three selected gene sets was selected as the expanded TIME classifier gene set (Fig. 1).

Fig. 1
figure 1

Isolating a Tumor Immune Microenvironment (TIME) Gene Set A Downloaded the full dataset of gene expression data in Breast Cancer from the TCGA Database; B Compared against an established 101-gene molecular classifier for TNBC [18] to correspond with three known signatures: IM, M, and MSL, and selected the top 1,000 genes correlated with each of the IM, M, and MSL signatures; C Repeated with TCGA gene expression data sets from Lung Adenocarcinoma and Lung Squamous Cell Carcinoma; D Identified top 1,000 genes for each signature in each additional cancer dataset; E Intersection of top 1,000 genes in each subtype between three cancers yielded a total of 939 genes

The TCGA-BLCA bladder cancer gene expression was filtered to these 939 genes (Fig. 2A). Each of the 406 bladder cancer tumor samples were assigned a phenotypic score for each of IM, MSL, and M using the 101 gene centroids. K-means clustering was performed for the 939 genes with k = 3, using supplied centers created with these tumor phenotypic scores. This assigned each of the 939 genes to one of the IM, M, or the MSL clusters. K-means was performed on the x-axis (tumor samples), with k = 3 and the centers derived by traditional k-means methodology (Fig. 2B). Hierarchical clustering was performed within the IM, M and MSL k-means gene clusters on the y-axis and the three k-means tumor clusters on the x-axis. (Fig. 2C).

Fig. 2
figure 2

Process of Clustering TCGA Bladder Cancer Data by TIME Phenotype A Unclustered TCGA bladder cancer RNA-Seq data set of the previously identified 939 TIME genes; B Patients are put into three groups on both the x- and y-axes via k-means clustering; C Hierarchical clustering is performed within each of the three groups on the x- and y-axes, respectively; D Each patient’s IO Score is overlaid above the heatmap, both as a continuous variable (top bar) and as a binary score with a threshold of 0.09

The gene expression data for the 27 genes used to generate the IO Score were extracted and the IO algorithm was applied to the 406 TCGA-BLCA tumor samples, giving each sample a continuous score (range of − 1 to 1) then, a binary score (IO Score + /IO Score-) if this continuous score was equal or greater than the previously established threshold of 0.09 (Fig. 2D). Thresholds were evaluated by determining the value where the difference between sensitivity and specificity was minimized [36]. Mathematically, this can be expressed as the minimum value of the output of the following equation:

$$\sum_{1}^{Total \,Samples}absolute\, value(sensitivity\left(x\right) - specificity(x))$$

where x represents applying every possible threshold to the 406 IO Scores from the TCGA-BLCA tumor samples. We then calculated the accuracy with both the existing threshold and the calculated threshold and tested their difference by a statistical test for proportions.

Analysis of the IMvigor210 data

The whole transcriptome gene expression, and clinical data, including response and survival data, from the IMvigor210 study have been released for 76 of the platinum contra-indicated Cohort I patients and 272 of the platinum refractory Cohort II patients for a total of 348 patients [10]. The 27 gene IO Score was calculated as previously described [14] using the pre-established threshold of positivity to classify patients as IO Score positive or negative. Kaplan–Meier curves were plotted to estimate OS for IO Score and Cox proportional hazard analysis was used to calculate the HR and 95% CIs. The mean of the continuous variable of the IO Score was compared to four categories of objective response—complete response, partial response, stable disease, and progressive disease (CR, PR, SD, and PD, respectively) using either a t-test of means between groups or ordinal regression for a trend in all four groups. Finally, the IO Score was tested in a series of bivariate models with reported clinical factor and relevant biomarker classification and the Cox proportional hazard method was used to calculate HRs using the log-rank method for significance.


The TCGA-BLCA tumor samples were classified into IM, MSL, and M subtypes using the expanded TIME classifier. We then compared these classifications against the IO continuous and binary scores using the IM subtype (high inflammatory gene expression) as the surrogate classifier for IO Score positive (Fig. 2D). In total, 125 of 406 TCGA-BLCA tumor samples clustered with the high IM expressing genes. The MSL and M subtypes were surrogates for IO Score negative. In total, 151 patients clustered with the MSL genes, and 130 patients clustered with the M genes. The distribution of the IO Score positive patients within each of these clusters was 113 (90.4%), 22 (14.6%), and 25 (19.2%) for IM, MSL, and M respectively, showing a strong concordance between IO Score positive patients and the IM subtype (Fig. 3A, p < 0.0001, chi-squared test).

Fig. 3
figure 3

IO Score Positivity Among TIME Phenotype and Confirming Binary IO Score Threshold A Percentage of IO Score positive cases in each TIME phenotype (IM, MSL, and M); B Calculation of a new theoretical threshold for positivity of the IO Score. Positive IO Score in the IM phenotype and negative scores in the M and MSL phenotypes are considered true positives and true negatives, respectively. Using the intersection of sensitivity and specificity where both the actual threshold and the new theoretical threshold are then used to calculate overall accuracy

In order to confirm the predefined IO Score threshold we generated a threshold specific to the TCGA-BLCA cohort by comparing the minimal distance between sensitivity and specificity for classification of IO Score positive patients (IM) versus IO Score negative patients (M or MSL) and obtained a theoretical, optimal binary classification threshold of 0.142 (Fig. 3B) [36]. The overall accuracy for this new threshold for classification of IO Score positive patients into the IM subtype was 87%. There was no statistical difference between the calculated threshold and the previously established threshold of 0.09 (p = 0.42, t-test of proportions). Based on these data, we continued to implement the pre-defined threshold of 0.09 previously validated in TNBC and NSCLC to analyze the clinical response data from the IMvigor210 clinical trial cohort.

The IMvigor210 combined platinum resistant and platinum contraindicated patients for a total of 429 subjects of which 348 patients were successfully RNA sequenced [4]. For these 348 patients, the median follow-up time was 20.6 months (95% CI 18.0–21.7) and the median survival was 8.8 months (95% CI 7.4–10.6). OS at the end of study (24.5 months) was 28.6% (95% CI 23.8–34.5%). In total, 40% of the genomic cohort had positive IO Scores. Median OS was 15.6 months (95% CI 10.0-NR) for IO Score positive patients versus 7.5 months (95% CI 6.0–9.2) for IO Score negative patients (Fig. 4A). IC2/3 positive subjects comprised 34% of the cohort. IC2/3 positive patients had a median survival of 12.8 months (95% CI 9.9-NA) versus 7.7 (95% CI 5.9–9.2) for IC2/3 negative subjects (Fig. 4B). The HRs for IO Score and IC2/3 positive subjects were similar as was the percentage of patients alive at end of study: HR for OS at the end of the study for IO Score and IC2/3 were 0.59 (95% CI 0.45–0.78, p < 0.001) and 0.62 (95% CI 0.47–0.82), respectively. At the end of the study, 41.5% (95% CI 33.5–51.4%) of IO positive subjects and 20% (95% CI 14.6–27.5%) of IO Score negative subjects were still alive. Similarly, 41.4% (33.2–51.6%) of IC2/3 positive subjects and 21% (15.4–28.7%) of IC2/3 negative subjects were alive at end of study.

Fig. 4
figure 4

Kaplan Meier Curves for IO Score, PD-L1 IC2/3, and Combined Dual-Positive Cases A Kaplan Meier curves for IO Score with HR and OS results; B Kaplan Meier curves for IC2/3 with HR and OS results; C Kaplan Meier curves for combined analysis of IO Score-positive AND IC2/3-postive cases with HR and OS results. Dual positive patients had a median OS of 17. 8 months compared to 12.8 (difference of 4.9 months) and an HR of 0.48 compared to 0.62. D Summary statistics for IO for the three Kaplan–Meier estimates

Given the failure of the IMvigor211 to reach the primary endpoint of improved OS in IC2/3 patients, we performed an exploratory analysis to determine if IC2/3 positive patients who were also IO Score positive has better OS (Fig. 4C). These “double positive” patients were 24% of the total cohort, had a median OS of 17.8 months (12.8-NA) with an HR for OS of 0.48 (95% CI 0.34–0.68) and an end of study survival of 48% (95% CI 38.2–60.5%). “Double negative” patients were 50% of the population with median OS of 7.5 month (95% CI 5.7–9.6) and end of study survival of 18.7% (95% CI 12.9–27.2%).

In order to test whether IO Score was quantitatively related to objective response, the continuous IO Score was plotted against the four objective response categories (Additional file 1: Figure S1). The scores were significantly different when comparing the average IO continuous score for patients with a CR to the average of the continuous score patients with a PR, SD, and PD (p < 0.005), when response was compared to non-response (CR/PR versus SD/PD, p < 0.005), or when disease control was compared to progressive disease (CR/PR/SD to PD, p < 0.01). There was also a significant ordinal trend for IO Score across the objective response categories when ordered by decreasing response (p < 0.001).

Mariathasan et al. explored 28 different published clinical factors and biomarker signatures in the IMvigor210 cohort [10], including nineteen genomic signatures. IO Score was tested in a series of bivariate equations with each of these factors and maintained significance with every explored biomarker (Fig. 5A and B and Additional file 2: Figure S2A, Additional file 3: Figure S2B, Additional file 4: Figure S2C) and five of these biomarkers maintained significance with the IO Score. Of the standard clinical factors, ECOG status and regional versus distant metastasis maintained significance with the IO Score. Of the twenty-one genomic signatures, two maintained significance with the IO Score—a previously established, prognostic bladder cancer classifier (Lund classification [37, 38] HR = 1.08 p < 0.025 classification) and a cell cycle regulatory signature (HR = 1.34 p < 0.05). IC2/3 narrowly missed significance for OS in this cohort (HR = 0.53 p = 0.06). TMB was independent of IO Score when using either the previously FDA pan-cancer established threshold of  ≥ 10 mut/MB (TMB-high) (HR = 0.65 p < 0.01) or the study specific top quartile threshold fit to the IMvigor210 dataset (HR = 0.53 p < 0.005) (Additional file 5: Figure S3A). Of note, in the IO Score/TMB-high multivariate analysis, neither HR was significantly different from its univariate HR (univariate HRs 0.59/0.57 for IO Score/TMB-high, respectively; multivariate HRs 0.57/0.65) leading to the exploratory analysis of a decision tree model, where a patient was deemed “positive” if they were IO Score + or TMB-high and “negative” if they were neither (Additional file 5: Figure S3B). The HR for the decision tree model (n = 272) was similar, equaling 0.579 (95% CI 0.429–0.782, p < 0.001) with the key difference being the percentage of patients identified as positive: 61% by the model versus 40% and 41% for the IO Score and TMB-high respectively [Additional file 5: Figure S3C], with 22% of patients being positive for both IO Score and TMB-high.

Fig. 5
figure 5

IO Score Independence with Various Clinical Factors and Genomic Biomarkers A Series of bivariate Cox proportional hazards with various Clinical Factors. (Note ECOG Status (0, 1, and 2) and Tobacco Use (Never Smokers versus Previous Smokers versus Current Smokers) were treated as increasing risk factors. Metastasis was categorized as lymph node involvement versus either liver or visceral metastasis.); B Series of bivariate Cox proportional hazards with various Genomic Biomarkers. IC1 and IC2/3 is PD-L1 staining in immune cells of  > 1% and  > 5% of cells, respectively. TMB-high and TMB-low refer TMB  ≥ 10 mutations and  < 10 mutations per megabase, respectively. The IO Score maintained its significance in every analysis


An ideal biomarker for predicting ICI benefit would perform well across many tumor types and treatment indications, would be independent of clinicopathologic features, and would improve on currently established biomarkers. The IO Score has been previously shown to be associated with improved response to ICI therapy in NSCLC and TNBC [11,12,13,14,15]. Consistent with the hypothesis that the IO Score classifies three components of the TIME that are common to tumors of epithelial origin, IM, MSL (linked with the expression of cancer associated fibroblasts) and M, we tested whether the IO Score would be applicable to classification of bladder cancer.

We first confirmed the TIME classification function, using our previously established threshold for binary classification, to distinguish those strongly expressing the IM signature from those which were more enriched for MSL or M using TCGA-BLCA gene expression data in ICI-naïve patients. We then tested this validated algorithm and threshold for an association with clinical response to ICI therapy in the IMvigor210 bladder cancer clinical trial cohort. The strong association of the IO Score with OS was independent of other explored biomarkers, including both TMB and PD-L1.

The IMvigor210 study resulted in an accelerated, non-biomarker contingent approval for atezolizumab in platinum-refractory patients but was later withdrawn based on a negative confirmatory trial in the IC2/3 biomarker selected patient population (IMvigor211) [9]. This withdrawal highlights the need for careful selection of a biomarker that demonstrates consistent biology when being used for patient selection in a trial. We hypothesize that using a biomarker such as IO Score to enrich for likely responders would minimize the risk of clinical trial failures. Interestingly, one potential use of the IO Score is to combine it with other biomarkers. In the case of IC2/3, the combination of “double positives” increased median survival by 4.9 months, decreased the HR by 0.12, and increased the percentage of patients alive at end of study by 7% over IC2/3 alone. In the case of TMB and IO Score, where the bivariate analysis showed both markers were independent with little change in HR, the analysis was the inverse; a patient was deemed “positive” if he or she were positive for at least one of the biomarkers. This decision tree model led to an increase in the number of patients deemed positive overall (61% versus the 40% and 41% for IO Score and TMB-high, respectively) with little loss in clinical performance. Mariathasan et al. [10] attempted to identify biomarkers from the IMvigor210 trial to enrich for likely responders and yielded several potential candidates. While several of these signatures were significant in univariate models, one striking observation from our study is that of the 21 signatures tested, only two signatures, the Lund signature, and a cell-cycle regulator signature, maintained independence with IO Score (Fig. 5 and Additional file 2: Figure S2).

The demonstrated clinical utility of the IO Score may be due to the unsupervised clustering performed to first identify robust classifiers before proceeding to building the algorithm. This approach to biomarker development is similar to the approach taken by Perou and colleagues who demonstrated the use of hierarchical clustering of gene expression data to identify likely responders to endocrine therapy within hormone positive breast cancer. Their work led to the development of a targeted biomarker panel, PAM50 [39], now an FDA 510 (k) cleared assay to help inform use of adjuvant chemotherapy in addition to endocrine therapy. Partially because PAM50 was built on a robust classifier based on shared components between cancer types, it was confirmed and further defined as an informative biomarker in hormone-sensitive prostate cancer [40, 41].

Given our prior approach and published data in TNBC and NSCLC, we believe that by assessing the IO Score as a classifier of the TIME in ICI naïve patients, we can better inform clinical decision making for mUC patients due to the IO Score’s strong association with response to ICI therapy in multiple tumor types. Future studies in both bladder cancers and additional tumor types using randomized clinical trial samples are warranted.

Availability of data and materials

The gene expression from TCGA (HT-Seq) is publicly available at the Genomic Data Commons Data Portal. The data used here was downloaded on April 20, 2020 (TCGA-BRCA), May 2, 2020 (TCGA-LUSQ and TCGA-LUAD) and December 20, 2020 (TCGA-BLCA), all raw sequencing data required for the IMVigor210 RNA-seq analyses were previously deposited by Mariathasan et al. to the European Genome-Phenome Archive under accession number EGAS00001002556. The source code and processed data used for all analyses presented here have been made available in IMvigor210CoreBiologies (see Methods).



Confidence interval


Complete response


Epithelial-to-mesenchymal transition


Food and drug administration


HUGO Gene Nomenclature Committee


Hazard ratio


Human Genome Organisation


1% or higher of immune cells are positive PD-L1 IHC staining


5% or higher of immune cells are positive PD-L1 IHC staining


Immune checkpoint inhibitor










Mesenchymal stem-like


Metastatic urothelial cancer


Non-small cell lung cancer


Overall survival


Progressive disease


Program cell death-1


Program cell death ligand-1


Partial response


Stable disease


The Cancer Genome Atlas


TCGA bladder carcinoma gene expression dataset


TCGA breast carcinoma gene expression dataset


TCGA lung adenocarcinoma gene expression dataset


TCGA lung squamous cell carcinoma gene expression dataset


Tumor immune microenvironment


Tumor mutation burden


Tumor mutation burden greater than or equal to 10 mutations per megabase


Tumor mutation burden less than 9 mutations per megabase


Triple negative breast cancer


  1. Doroshow DB, Bhalla S, Beasley MB, Sholl LM, Kerr KM, Gnjatic S, et al. PD-L1 as a biomarker of response to immune-checkpoint inhibitors. Nat Rev Clin Oncol. 2021;18(6):345–62.

    Article  CAS  PubMed  Google Scholar 

  2. Lantuejoul S, Sound-Tsao M, Cooper WA, Girard N, Hirsch FR, Roden AC, et al. PD-L1 testing for lung cancer in 2019: perspective from the IASLC pathology committee. J Thorac Oncol. 2020;15(4):499–519.

    Article  CAS  PubMed  Google Scholar 

  3. McGrail DJ, Pilié PG, Rashid NU, Voorwerk L, Slagter M, Kok M, et al. High tumor mutation burden fails to predict immune checkpoint blockade response across all cancer types. Ann Oncol. 2021;32(5):661–72.

    Article  CAS  PubMed  Google Scholar 

  4. Petitprez F, Meylan M, de Reyniès A, Sautès-Fridman C, Fridman WH. The tumor microenvironment in the response to immune checkpoint blockade therapies. Front Immunol. 2020;11:784.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Zhou C, Liu Q, Xiang Y, Gou X, Li W. Role of the tumor immune microenvironment in tumor immunotherapy (Review). Oncol Lett. 2022;23(2):53.

    Article  CAS  PubMed  Google Scholar 

  6. Tang T, Huang X, Zhang G, Hong Z, Bai X, Liang T. Advantages of targeting the tumor immune microenvironment over blocking immune checkpoint in cancer immunotherapy. Signal Transduct Target Ther. 2021;6(1):72.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Balar AV, Galsky MD, Rosenberg JE, Powles T, Petrylak DP, Bellmunt J, et al. Atezolizumab as first-line treatment in cisplatin-ineligible patients with locally advanced and metastatic urothelial carcinoma: a single-arm, multicentre, phase 2 trial. Lancet. 2017;389(10064):67–76.

    Article  CAS  PubMed  Google Scholar 

  8. Rosenberg JE, Hoffman-Censits J, Powles T, van der Heijden MS, Balar AV, Necchi A, et al. Atezolizumab in patients with locally advanced and metastatic urothelial carcinoma who have progressed following treatment with platinum-based chemotherapy: a single-arm, multicentre, phase 2 trial. Lancet. 2016;387(10031):1909–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Powles T, Durán I, van der Heijden MS, Loriot Y, Vogelzang NJ, De Giorgi U, et al. Atezolizumab versus chemotherapy in patients with platinum-treated locally advanced or metastatic urothelial carcinoma (IMvigor211): a multicentre, open-label, phase 3 randomised controlled trial. Lancet. 2018;391(10122):748–57.

    Article  CAS  PubMed  Google Scholar 

  10. Mariathasan S, Turley SJ, Nickles D, Castiglioni A, Yuen K, Wang Y, et al. TGFβ attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature. 2018;554(7693):544–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Bianchini GD, Dugo M, Huang C, Egle D, Bermejo B, Seitz RS, Nielsen TJJ, Zamagni C, Thill M, Anton A, Russo S, Ciruelos EM, Schweitzer BL, Greil R, Semiglazov V, Gyorffy B, Valagussa P, Viale G, Callari M, Gianni L. Predictive value of gene-expression profiles (GEPs) and their dynamics during therapy in the NeoTRIPaPDL1 trial. Ann Oncol. 2021;32:S1283–346.

    Article  Google Scholar 

  12. Dugo MH, Chiun-Sheng; Egle, Daniel; Berñejo, Bego a; Zamagni, Claudio; Seitz , Robert S.; Nielsen, Tyler J.; Thill, Marc; Anton, Antonio; Russo, Stefania; Ciruelos, Eva Maria; Schweitzer, Brock L.; Ross, Douglas T.; Galbardi, Barbara; Greil, Richard; Semiglazov, Vladimir; Gyorffy, Balazs; Colleoni, Marco; Kelly, Catherine; Mariani, Gabriella; Lucia Del Mastro; Valagussa, Pinuccia; Viale, Giuseppe; Callari, Maurizio; Gianni, Luca; Bianchini, Giampaolo. editor Predictive value of RT-qPCR 27-gene IO score and comparison with RNA-Seq IO score in the NeoTRIPaPDL1 trial. San Antonio Breast Cancer Symposium. San Antonio, Texas: AACR; 2021.

  13. Iwase T, Blenman KRM, Li X, Reisenbichler E, Seitz R, Hout D, et al. A novel immunomodulatory 27-gene signature to predict response to neoadjuvant immunochemotherapy for primary triple-negative breast cancer. Cancers (Basel). 2021;13(19):4839.

    Article  CAS  Google Scholar 

  14. Nielsen TJ, Ring BZ, Seitz RS, Hout DR, Schweitzer BL. A novel immuno-oncology algorithm measuring tumor microenvironment to predict response to immunotherapies. Heliyon. 2021;7(3):e06438.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Ranganath H, Jain AL, Smith JR, Ryder J, Chaudry A, Miller E, et al. Association of a novel 27-gene immuno-oncology assay with efficacy of immune checkpoint inhibitors in advanced non-small cell lung cancer. BMC Cancer. 2022;22(1):407.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Lehmann BD, Bauer JA, Chen X, Sanders ME, Chakravarthy AB, Shyr Y, et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J Clin Invest. 2011;121(7):2750–67.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Lehmann BD, Jovanović B, Chen X, Estrada MV, Johnson KN, Shyr Y, et al. Refinement of triple-negative breast cancer molecular subtypes: implications for neoadjuvant chemotherapy selection. PLoS ONE. 2016;11(6):e0157368.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Ring BZ, Hout DR, Morris SW, Lawrence K, Schweitzer BL, Bailey DB, et al. Generation of an algorithm based on minimal gene sets to clinically subtype triple negative breast cancer patients. BMC Cancer. 2016;16:143.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Masuda H, Harano K, Miura S, Wang Y, Hirota Y, Harada O, et al. Changes in triple-negative breast cancer molecular subtypes in patients without pathologic complete response after neoadjuvant systemic chemotherapy. JCO Precis Oncol. 2022;6:e2000368.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Seliger B, Massa C. Immune therapy resistance and immune escape of tumors. Cancers. 2021;13(3):551.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Kannan A, Hertweck KL, Philley JV, Wells RB, Dasgupta S. Genetic mutation and exosome signature of human papilloma virus associated oropharyngeal cancer. Sci Rep. 2017;7:46102.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Romeo E, Caserta CA, Rumio C, Marcucci F. The vicious cross-talk between tumor cells with an EMT phenotype and cells of the immune system. Cells. 2019;8(5):460.

    Article  CAS  PubMed Central  Google Scholar 

  23. Xiao Y, Yu D. Tumor microenvironment as a therapeutic target in cancer. Pharmacol Ther. 2021;221:107753.

    Article  CAS  PubMed  Google Scholar 

  24. Chen XH, Liu ZC, Zhang G, Wei W, Wang XX, Wang H, et al. TGF-β and EGF induced HLA-I downregulation is associated with epithelial-mesenchymal transition (EMT) through upregulation of snail in prostate cancer cells. Mol Immunol. 2015;65(1):34–42.

    Article  CAS  PubMed  Google Scholar 

  25. Team RC. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2020.

    Google Scholar 

  26. Team R. RStudio: integrated development environment for R. Boston: RStudio, PBC; 2021.

    Google Scholar 

  27. Gu Z. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Morgan M, Davis Sean. Genomic Data Commons: NIH/NCI Genomic Data Commons Access. 2020.

  29. Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2016;44(8):e71.

    Article  PubMed  Google Scholar 

  30. Mounir M, Lucchetta M, Silva TC, Olsen C, Bontempi G, Chen X, et al. New functionalities in the TCGAbiolinks package for the study and integration of cancer data from GDC and GTEx. PLoS Comput Biol. 2019;15(3):e1006701.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Silva TC, Colaprico A, Olsen C, D’Angelo F, Bontempi G, Ceccarelli M, et al. TCGA workflow: analyze cancer genomics and epigenomics data using Bioconductor packages. F1000Res. 2016;5:1542.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Nickles DB, Richard. IMvigor210corebiologies: data processing and analysis code for the manuscript Mariathasan et al, TGF-b attenuates tumor response to PD-L1 blockade by contributing to exclusion of T cells. R package version 0.1.13. 2018.

  33. T T. A package for survival analysis in R_. 3.2–11 ed. 2021.

  34. Gerds TA. Prodlim: product-limit estimation for censored event history analysis. R package version. 2019.

  35. Venables WN, Ripley BD. Modern applied statistics with S. 4th ed. New York: Springer; 2002.

    Book  Google Scholar 

  36. Habibzadeh F, Habibzadeh P, Yadollahie M. On determining the most appropriate test cut-off value: the case of tests with continuous results. Biochem Med (Zagreb). 2016;26(3):297–307.

    Article  Google Scholar 

  37. Sjödahl G, Eriksson P, Liedberg F, Höglund M. Molecular classification of urothelial carcinoma: global mRNA classification versus tumour-cell phenotype classification. J Pathol. 2017;242(1):113–25.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Sjödahl G, Lauss M, Lövgren K, Chebil G, Gudjonsson S, Veerla S, et al. A molecular taxonomy for urothelial carcinoma. Clin Cancer Res. 2012;18(12):3377–86.

    Article  PubMed  Google Scholar 

  39. Parker JS, Mullins M, Cheang MC, Leung S, Voduc D, Vickery T, et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009;27(8):1160–7.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Yoon J, Kim M, Posadas EM, Freedland SJ, Liu Y, Davicioni E, et al. A comparative study of PCS and PAM50 prostate cancer classification schemes. Prostate Cancer Prostatic Dis. 2021;24(3):733–42.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Zhao SG, Chang SL, Erho N, Yu M, Lehrer J, Alshalalfa M, et al. Associations of luminal and basal subtyping of prostate cancer with prognosis and response to androgen deprivation therapy. JAMA Oncol. 2017;3(12):1663–72.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


The results published here are in whole or part based upon data generated by the TCGA Research Network:


Oncocyte funded the study.

Author information

Authors and Affiliations



RSS DTR, KM, TJN, BZR, MH, BLS, and DBB conceived and designed the analysis. RSS, BZR, CFM collected the data. RSS, BZR, CFM, MGV, and TJN performed analysis. RSS, MH, BZR, TJN, DBB, KM, CFM, MGV, BLS, and DTR wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Robert S. Seitz.

Ethics declarations

Ethics approval and consent to participate

This study was performed in accordance with the declaration of Helsinki.

Consent for publication

Not Applicable.

Competing interests

RSS, BZR, TJN, DBB, KM, CFM, MGV, BLS, and DTR are employed by Oncocyte Corporation, the commercial entity that markets the 27-gene IO Score as DetermaIO. MEH serves on advisory boards for Bristol Myers Squibb, Nektar Therapeutics, Janssen Pharmaceuticals, Exelixis, and CRISPR Therapeutics and receives research funding from Achilles, Apexigen, Astellas, AstraZeneca, Bayer, Bristol Myer Squibb, Clovis, Corvus, Eli Lilly, Endocyte, Genentech, Genmab, GSK, Innocrin, Iovance, KSQ Therapeutics, MedImmune, Merck, Nektar Therapeutics, Novartis, Pfizer, Progenics, Roche Laboratories, Sanofi Aventis, SQZ Biotech, Seattle Genetics. His spouse is employed by Arvinas.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Average score for the continuous IO Score by response The IO Score was significant by trend (ordinal logistic regression), complete response, objective response, and disease control rate (horizontal lines from top to bottom). CR = complete response, PR = partial response, SD = stable disease, and PD = progressive disease. Objective Response is CR or PR versus SD or PD and Disease Control is CR, PR, or SD versus PD. **p≤0.01, ***p≤0.001.

Additional file 2: Figure S2A.

. IO Score Independence with Additional Clinical Factors and Genomic Biomarkers Demonstrating IO Score independence with various genomic signatures in a series of bivariate Cox Proportional Hazards. In all cases the median of the signature was used as a threshold for positive or negative. A more complete description of each of these signatures can be found in the work of Mariathasan and colleagues [10].

Additional file 3. Figure S2B.

IO Score Independence with Additional Clinical Factors and Genomic Biomarkers Demonstrating IO Score independence with various genomic signatures in a series of bivariate Cox Proportional Hazards. In all cases the median of the signature was used as a threshold for positive or negative. A more complete description of each of these signatures can be found in the work of Mariathasan and colleagues [10].

Additional file 4. Figure S2C.

IO Score Independence with Additional Clinical Factors and Genomic Biomarkers Demonstrating IO Score independence with various genomic signatures in a series of bivariate Cox Proportional Hazards. In all cases the median of the signature was used as a threshold for positive or negative. A more complete description of each of these signatures can be found in the work of Mariathasan and colleagues [10].

Additional file 5: Figure S3A.

Identifying Additional Eligible Patients by Combining IO Score and TMB (A) Kaplan Meier curves showing individual OS results for TMB-high. (B) Kaplan Meier curves showing the combined results considering patients that were either IO Score+ or TMB -high versus those that were negative for both. (C) Percentage of patients positive for TMB-high/IO Score, and a combined population of either TMB-high or IO Score+. (D) Median and 2-year OS rates for each individual marker and combined population of either IO Score+ or TMB-high.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Seitz, R.S., Hurwitz, M.E., Nielsen, T.J. et al. Translation of the 27-gene immuno-oncology test (IO score) to predict outcomes in immune checkpoint inhibitor treated metastatic urothelial cancer patients. J Transl Med 20, 370 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: