Skip to main content

Identification of molecular subtypes and a novel prognostic model of diffuse large B-cell lymphoma based on a metabolism-associated gene signature

Abstract

Background

Diffuse large B cell lymphoma (DLBCL) is the most common lymphoma in adults. Metabolic reprogramming in tumors is closely related to the immune microenvironment. This study aimed to explore the interactions between metabolism-associated genes (MAGs) and DLBCL prognosis and their potential associations with the immune microenvironment.

Methods

Gene expression and clinical data on DLBCL patients were obtained from the GEO database. Metabolism-associated molecular subtypes were identified by consensus clustering. A prognostic risk model containing 14 MAGs was established using Lasso-Cox regression in the GEO training cohort. It was then validated in the GEO internal testing cohort and TCGA external validation cohort. GO, KEGG and GSVA were used to explore the differences in enriched pathways between high- and low-risk groups. ESTIMATE, CIBERSORT, and ssGSEA analyses were used to assess the immune microenvironment. Finally, WGCNA analysis was used to identify two hub genes among the 14 model MAGs, and they were preliminarily verified in our tissue microarray (TMA) using multiple fluorescence immunohistochemistry (mIHC).

Results

Consensus clustering divided DLBCL patients into two metabolic subtypes with significant differences in prognosis and the immune microenvironment. Poor prognosis was associated with an immunosuppressive microenvironment. A prognostic risk model was constructed based on 14 MAGs and it was used to classify the patients into two risk groups; the high-risk group had poorer prognosis and an immunosuppressive microenvironment characterized by low immune score, low immune status, high abundance of immunosuppressive cells, and high expression of immune checkpoints. Cox regression, ROC curve analysis, and a nomogram indicated that the risk model was an independent prognostic factor and had a better prognostic value than the International Prognostic Index (IPI) score. The risk model underwent multiple validations and the verification of the two hub genes in TMA indicated consistent results with the bioinformatics analyses.

Conclusions

The molecular subtypes and a risk model based on MAGs proposed in our study are both promising prognostic classifications in DLBCL, which may provide novel insights for developing accurate targeted cancer therapies.

Background

Diffuse large B-cell lymphoma (DLBCL) is the most common type of non-Hodgkin's lymphoma. About 60% of DLBCL patients experience effective remission after rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone (R-CHOP) regimens. However, approximately 30–40% of patients eventually relapse and 10% are primary refractory cases [1]. The International Prognostic Index (IPI), which is widely used to evaluate the prognosis of DLBCL, mainly depends on five traditional clinicopathological features: age, the Eastern Cooperative Oncology Group (ECOG) performance, Ann Arbor stage, lactate dehydrogenase (LDH) level, and extranodal sites, but does not consider the molecular characteristics and microenvironmental differences in lymphoma. Given that DLBCL is a highly heterogeneous tumor, having the same clinicopathological features does not always lead to the same prognosis [2]. Therefore, IPI score is not sufficient to accurately predict the prognosis [3]. It is necessary for us to develop new strategies to identify risk among DLBCL patients more reliably, so as to personalize treatment strategies. Recent studies have shown that risk models based on multi-gene expression are a reliable choice [4,5,6].

Metabolic reprogramming in tumor cells—notably, aerobic glycolysis, glutamine catabolism, macromolecular synthesis, and redox homeostasis—support the requirements of exponential growth and proliferation [7]. The upregulation of many metabolism-associated genes (MAGs) is driven by the activation of oncogenes. For example, the proto-oncogene c-MYC can activate most glycolytic enzyme genes (principally hexokinase 2(HK2), phosphofructokinase (PFK)-M1, lactate dehydrogenase (LDH)-A and pyruvate kinase M2(PKM2)) to provide fuel for aerobic glycolysis, which subsequently enhances oxidative phosphorylation (OXPHOS). Another oncogene closely related to MAGs is AKT, it directly promotes aerobic glycolysis by upregulating the expression of HK2, PFK1/2 and glucose transporters (GLUT). It can also activate mitochondrial hexokinase (mHK) to promote the coupling of glycolysis and OXPHOS [8, 9]. Therefore, MAGs are considered promising diagnostic markers and potential therapeutic targets. In addition, recent studies have focused on the relationship between metabolism and survival: pan-cancer studies have indicated that tumor subtypes with different MAG expression patterns lead to significantly different survival [10, 11]. Moreover, several risk models based on MAGs have been proposed for breast cancer [6], colorectal cancer [12], gastric cancer [13] and osteosarcoma [14]. However, the value of MAGs in DLBCL subtype identification and prognostic prediction remains unclear.

The tumor microenvironment, as the hotbed of the tumor, has significant immune cell infiltration. In accordance with the complexity of the tumor microenvironment, immune cells recruited to the tumor tissues have dual tumor-promoting and tumor-antagonizing characteristics [15]. The immune microenvironment plays a key role in tumor development and treatment. Studies have shown that metabolic reprogramming is closely related to the tumor immune microenvironment [16]. Metabolites derived from tumor cells can influence the composition and distribution of cells in the immune microenvironment in many ways, ultimately leading to immune dysfunction and tumor progression [17]. For example, metabolic reprogramming can affect the differentiation subtypes and functions of T cells and the polarization and function of macrophages [18]. However, studies on the relationship between MAGs and the immune microenvironment in DLBCL remain limited.

In the present study, we used multiple bioinformatics methods to comprehensively analyze MAGs, identified metabolism-associated molecular subtypes in DLBCL patients, constructed a novel MAG-based risk model evaluating the prognostic value of MAGs in DLBCL, and explored the relationships between MAGs and the immune microenvironment. Finally, two hub genes in the risk model were selected and verified using our own tissue microarray (TMA), and their potential utility as therapeutic targets and diagnostic markers was discussed. Our study may provide new clues on mechanisms and metabolic targets in DLBCL, and it may lay the foundation for accurate immunotherapy that targets metabolic pathways in DLBCL.

Methods

Data sources and preprocessing

The GSE10846 Series Matrix File data were downloaded from the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) database (the annotation platform was GPL570). The data of 412 DLBCL patients with a complete mRNA expression profile and survival time > 0 was extracted from the GSE10846 dataset. Of the 412 patients in the GSE10846 dataset, 232 patients had undergone R-CHOP treatment, while 180 patients only received CHOP treatment. We randomly divided (4:1 ratio) the 412 cases extracted from the GSE10846 dataset into a training cohort (n = 330) and a testing cohort (n = 82). Additionally, data on DLBC were downloaded from The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/), and 47 DLBCL cases with a complete mRNA expression profile and survival time > 0 were obtained from the TCGA database. We used the TCGA dataset (n = 47) as an external validation cohort to evaluate the predictive efficacy and robustness of the prognosis-associated risk model. Relevant grouping information and clinicopathological features were shown in Table 1.

Table 1 Clinicopathological characteristics of the DLBCL cases in GSE10846 and TCGA datasets

Identification of metabolism-associated subtypes

MAGs were obtained from the GeneCards database (https://www.genecards.org/). We identified 92 candidate prognosis-related MAGs in GSE10846 by univariate Cox regression. Based on the 92 MAGs, the 412 patients were divided into subgroups with different metabolic expression patterns by consensus clustering using the "ConsensusClusterPlus" R package, and unbiased and unsupervised outcomes were obtained.

Construction and validation of a MAG-based risk model

The 92 candidate MAGs related to prognosis were selected, and a prognostic model was constructed using least absolute shrinkage and selection operator (LASSO) regression. The risk score formula (based on the expression of each included gene weighted by its LASSO regression coefficient) was constructed using the following format: risk score = \({\sum }_{i=1}^{n}coef*gene expression\). Thereafter, the risk score of each patient was calculated. Using the median risk score as the cutoff, the training cohort was divided into low- and high-risk groups. Survival curves were generated by the Kaplan–Meier method and the two groups were compared using the log-rank test. A time-dependent receiver operating characteristic (ROC) curve analysis was used to study the model prediction accuracy. Cox regression was used to assess the independent prognostic value of the risk score and other clinicopathological features. To provide a reference for predicting the prognosis of DLBCL patients, we used the "rms" R package to construct a nomogram based on the risk score and clinicopathological features, and a calibration plot was used to assess the prognostic ability of the nomogram.

Immune analyses

The Estimation of Stromal and Immune cells in Malignant Tumor tissues using Expression data (ESTIMATE) method was performed to calculate the stromal score, immune score, ESTIMATE score, and tumor purity. Next, the Cell-type Identification By Estimating Relative Subsets Of RNA Transcripts (CIBERSORT) algorithm was used to analyze the RNA-Seq data of DLBCL patients in order to determine the relative proportions of 22 infiltrating immune cells. Furthermore, to quantify the immune cell infiltration in each sample, single-sample Gene Set Enrichment Analysis (ssGSEA) was used to assess the enrichment of 28 immune cells in the tumor samples. We then calculated the correlations between the risk score and immune regulatory genes, especially immune checkpoints.

Drug sensitivity analysis and construction of competing endogenouse RNA (ceRNA) network

Based on the Genomics of Drug Sensitivity in Cancer (GDSC) database (https://www.cancerrxgene.org/), which is the largest pharmacogenomics database, we used the "pRRophetic" R package to predict the chemotherapy sensitivity of each tumor sample. The estimated half-maximal inhibitory concentration (IC50) value of each chemotherapy drug was obtained by regression, and the accuracy of regression and prediction was tested by cross-validation with GDSC training set for 10 times. All parameters were selected as default values, including "combat" for removing batch effect and the average value of repeated gene expression. Furthermore, we used FunRich (v3.1.3) and NPInter (v4.0) to construct a ceRNA network based on the model genes.

Gene Set Variation Analysis (GSVA) and functional enrichment analyses

GSVA is a non-parametric and unsupervised method for evaluating the enrichment of gene sets in relation to mRNA expression data. In this study, gene sets were downloaded from the Molecular Signatures Database (v7.0). Each gene set was comprehensively scored by the GSVA algorithm, and the potential differences in biological functions between the high- and low-risk groups were evaluated. Additionally, to explore the functions of the prognosis-associated MAGs, the "ClusterProfiler" R package was used to annotate the genes with their predicted functions based on Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. GO terms and KEGG pathways with p and q values < 0.05 were deemed statistically significant.

Weighted Gene Co-expression Network Analysis (WGCNA)

To identify the hub genes among the 14 model genes, we used the WGCNA algorithm. After constructing a weighted gene co-expression network, the gene co-expression modules were identified, and the correlations between gene network and clinical phenotype were explored. The WGCNA-R package was used to construct the co-expression network of all genes in the GSE10846 dataset, and the genes with variance within the first 5000 were identified by the algorithm for subsequent analysis. The soft-threshold β was determined by the function "sft$powerEstimate". The weighted adjacency matrix was transformed into a topological overlap matrix (TOM) to estimate the network connectivity, with hierarchical clustering being used to construct the clustering tree structure of the TOM. Different branches of the clustering tree represented different gene modules, and different colors represented different modules. Tens of thousands of genes were classified into modules based on having similar expression patterns (using their weighted correlation coefficients).

TMA tissue samples

The DLBCL TMA contained 104 DLBCL tissues and 28 reactive hyperplasia tissues (from cases with the same gender ratio and age range) collected from 2008 to 2015. It was prepared by the Department of Clinical Biobank of the Affiliated Hospital of Nantong University. Clinicopathological data, including gender, age, B symptoms, Ann Arbor stage, hemoglobin (Hb) level, LDH level, IPI score were collected. In addition, X-tile 3.6.1 software was performed to determine the optimal cutoff values for two hub genes expression. This study was a retrospective study, and the informed consent of all patients was obtained before the study. The Ethics Committee of the Affiliated Hospital of Nantong University approved this research.

Fluorescence-based multiplex immunohistochemistry (mIHC) staining

The DLBCL TMA slides were stained with multiplex fluorescence by using the Opal 7-color Manual IHC Kit (PerkinElmer, MA). After dewaxing by xylene and rehydration by ethanol, slides were heated in a microwave with AR6 Buffer (AR600, AKOYA) and AR9 Buffer (AR900, AKOYA) for antigen retrieval. The slides were incubated with primary antibodies overnight at 4 °C and then incubated with secondary antibody for 10 min at room temperature. At last, we used 4',6-diamidino-2-phenylindole (DAPI; F6057, Sigma) to stain the nuclei and seal the slides. Imaging was achieved using the Vectra 3.0 Automated Quantitative Pathology Imaging System. Tumor and stroma images were captured at ×20 magnification. Finally, the staining was scored by inForm® Cell Analysis software based on the intensity and degree of staining. The degree of staining was compared using the Wilcoxon rank-sum test.

The primary antibodies used in this study were as follows: rabbit anti-PHKA1 (24279-1-AP, Proteintech), rabbit anti-PLTP (ab282456, Abcam), rabbit anti-CD163 (93498, Cell Signaling Technology), rabbit anti-CD68 (76437, Cell Signaling Technology), rabbit anti-CD11B (49420, Cell Signaling Technology), mouse anti-CD66b (ARG66287, Arigobio), rabbit anti-PD-1 (86163, Cell Signaling Technology) and rabbit anti-PD-L1 (13684, Cell Signaling Technology). The secondary antibody was Opal™ polymer HRP Ms + Rb (ARH1001EA, Perkin Elmer).

Statistical analysis

Survival curves were generated by the Kaplan–Meier method and compared using the log-rank test. Multivariate Cox proportional hazards regression was used to identify independent prognostic factors. Wilcoxon rank-sum test was applied to continuous variables with nonnormal distribution. All statistical analyses were performed in R software (v4.0). All statistical tests were two tailed, and p < 0.05 was considered statistically significant.

Results

Identification of prognosis-associated MAGs in DLBCL

The whole study process is depicted in the flow chart in Additional file 1: Figure S1. First, we obtained 958 MAGs from the GeneCards database by using “metabolism” as the search term and setting the relevance score > 5. Second, DLBC mRNA expression data (Fragments Per Kilobase of transcript per Million mapped reads [FPKM]) in the GSE10846 dataset was downloaded, and 877 MAGs were extracted (based on the 958 MAGs identified using GeneCards). To identify the prognosis-associated genes among the 877 MAGs, we used prognostic data on DLBCL patients and obtained 92 prognosis-associated MAGs by univariate Cox regression (p < 0.001; Additional file 2: Figure S2).

Identifying metabolism-associated molecular subgroups and differences in prognosis and the immune microenvironment between subgroups

Based on the expression patterns of 92 prognosis-associated MAGs, we used consensus clustering to cluster the 412 patients in the GSE10846 dataset into different metabolism-associated molecular subgroups. By increasing the clustering variable (k) from 2 to 5, we found that consensus clustering was most suitable when k = 2 (Fig. 1A). This indicated that DLBCL patients could be readily divided into two clusters, with 183 patients in cluster 1 and 229 patients in cluster 2. Heatmap visualization showed significant differences in the expression of the 92 MAGs between the two clusters (Fig. 1B). Survival analysis showed that the overall survival in the two clusters was different, with cluster 1 patients having a significantly worse prognosis than cluster 2 patients (follow-up time was 20 years in cluster 1 and 10 years in cluster 2) (Fig. 1C). In addition, the CIBERSORT algorithm was used to evaluate the differences in immune cell infiltration between the two clusters. The abundances of infiltrating immune cells between clusters are shown in Additional file 3: Figure S3, and the quantitative analysis demonstrated that there were significant differences. Cluster 1 had higher infiltration levels of Tregs, NK cells resting and mast cells activated, and lower infiltration levels of T cells gamma delta, T cells CD4 memory activated, monocytes and dendritic cells resting, while cluster 2 showed the opposite trends (Fig. 1D). Analysis of the differences in the expression of immune checkpoints between the two clusters showed that ADORA2A, CD244, CD274, CSF1R, CTLA4, HAVCR2, KIR2DL1, KIR2DL3, LAG3, LGALS9, PDCD1, TGFB1, TGFBR1, and VTCN1 expression levels were significantly higher in cluster 1 than in cluster 2 (Fig. 1E). Higher infiltration of immunosuppressive cells and increased expression of immune checkpoints in the cluster 1 indicated an immunosuppressive tumor microenvironment, which was consistent with the poor prognosis. These findings indicate that the expression of MAGs is related to the prognosis and the immunosuppressive microenvironment in DLBCL patients.

Fig. 1
figure 1

Consensus clustering and the different immune profiles between two clusters. A Consensus matrix heatmap indicating that the optimal value for consensus clustering is K = 2. B Heatmap visualizing the different expression pattern of the 92 MAGs in the two clusters. C Survival curve of the patients in the two clusters. D CIBERSORT analysis in the two clusters. E The expression of immune checkpoints among two clusters. P values were showed as: ns not significant; *p < 0.05; **p < 0.01; ***p < 0.001

Functional enrichment analyses of prognosis-associated MAGs and construction of transcriptional regulatory network

We further conducted functional enrichment analyses of the 92 prognosis-associated MAGs. GO analysis indicated that in biological processes (BP), 92 MAGs were mainly enriched in monosaccharide metabolic process, vitamin metabolic process, cofactor metabolic process, cofactor biosynthetic process and small molecule catabolic process. In cell component (CC), 92 MAGs were mainly enriched in cytochrome complex, respiratory chain complex, oxidoreductase complex, mitochondrial inner membrane and mitochondrial matrix. Regarding molecular function (MF), 92 MAGs were mainly enriched in fatty acid derivative binding, acyl-CoA dehydrogenase activity, amide binding, vitamin binding and coenzyme binding (Additional file 4: Figure S4A). Moreover, KEGG analysis revealed similar pathways, including biosynthesis of cofactors, starch and sucrose metabolism, central carbon metabolism in cancer, fatty acid degradation, and biosynthesis of amino acids (Additional file 4: Figure S4B). Combined with the results of GO and KEGG, we found that 92 MAGs were significantly enriched in many key biosynthetic and metabolic pathways, especially the biosynthesis and metabolism of cofactors, which might contribute to the hyperproliferation of tumor cells and dismal outcomes. Furthermore, we used Cytoscape software to construct a protein–protein interaction network of these prognostic MAGs (Additional file 4: Figure S4C).

Construction and validation of a prognosis-associated risk model composed of 14 MAGs

Based on the MAGs identified in univariate Cox regression analyses, we selected 14 MAGs by LASSO regression in order to construct a metabolism-associated prognosis risk model (Additional file 5: Figure S5 and Table 2). The 412 GEO patients were randomly divided (4:1 ratio) into a training cohort (n = 330) and a testing cohort (n = 82). Because of random grouping, the expression patterns of 14 MAGs in the training cohort and the testing cohort were similar. After constructing the model, the risk score of each DLBCL patient in the training cohort was computed based on the following formula: risk score = NR3C1 × (− 0.285969433071478) + IGFBP3 × (− 0.16695536054869) + RARRES2 × (− 0.14291303044122) + F5 × (− 0.0961037965837689) + APOC1 × (− 0.0768487272489624) + CSF2RA × (− 0.056110125649913) + ENPP1 × (− 0.0221430402174049) + GYG1 × 0.0449620982512186 + PHKA1 × 0.0693510636252306 + CPT1A × 0.0752116074259248 + PDK4 × 0.0767229743647787 + CLOCK × 0.0858851468938173 + CTH × 0.108708800807851 + PLTP × 0.16013617077787. Patients were divided into high- and low-risk groups according to the median risk score. The distribution of risk score, survival status, and the expression of the 14 MAGs in the training cohort are depicted in Fig. 2A. Combining the hazard ratio and gene expression heatmap of 14 MAGs, we found that the expression of genes with hazard ratio > 1, such as GYG1, PHKA1, CPT1A, PDK4, CLOCK, CTH, and PLTP, was higher in the high-risk group, while the expression of genes with hazard ratio < 1, such as NR3C1, IGFBP3, RARRES2, F5, APOC1, CSF2RA, and ENPP1, was higher in the low-risk group. Additionally, Kaplan–Meier curves indicated that the DLBCL patients in the high-risk group had significantly worse overall survival (OS) (Fig. 2B). The area under the curve (AUC) values of the ROC curves for 1-, 2-, and 3-year OS were 0.79, 0.81, and 0.81, respectively (Fig. 2C), demonstrating the great predictive performance of the prognosis-associated risk model.

Table 2 Features of MAGs in the risk model
Fig. 2
figure 2

Construction of the risk model in the GSE10846 training cohort and validation of the risk model in the GSE10846 testing cohort and TCGA cohort. A, D, G Distribution of the risk score, survival status, and gene expression of 14 MAGs in the GSE10846 training cohort (A), GSE10846 testing cohort (D) and TCGA cohort (G). B, E, H Kaplan–Meier curves of OS of patients in the high- and lowrisk groups in the GSE10846 training cohort (B), GSE10846 testing cohort (E) and TCGA cohort (H). C, F, I ROC curves for predicting the 1/2/3-year overall survival in the GSE10846 training cohort (C), GSE10846 testing cohort (F) and TCGA cohort (I)

Next, we used the GEO testing cohort as an internal validation cohort and the TCGA dataset as an external validation cohort to evaluate the prediction performance and robustness of the prognosis-associated risk model. Using the same formula, we obtained consistent results in the testing cohort, which confirmed the robustness of the risk model. The risk score distribution and gene expression profiles are shown in Fig. 2D. OS was significantly worse in the high-risk group than the low-risk group (Fig. 2E). The AUCs for 1-, 2-, and 3-year OS were 0.79, 0.83 and 0.81, respectively (Fig. 2F). In addition, the TCGA results concurred with the GEO training cohort results. The risk score distribution and gene expression profile are shown in Fig. 2G. Patients with higher risk scores had worse OS (Fig. 2H). The AUCs for 1-, 2-, and 3-year OS were 0.78, 0.61, and 0.61, respectively, indicating that the risk model had a strong prognostic value for DLBCL patients in the TCGA validation cohort (Fig. 2I). In conclusion, these results confirmed that the risk model had a robust and accurate ability to predict OS.

Clinical correlations and independent prognosis analysis of risk score

Next, we further validated the clinical value of the risk score. Firstly, we investigated the applicability of the risk score to the immunotherapy cohort. Rituximab (an anti-CD20 monoclonal antibody) plus polychemotherapy (R-CHOP) is the standard of care in DLBCL. Among the 412 patients in GSE10846, only 232 received R-CHOP treatment. We selected these 232 patients as the immunotherapy cohort and compared the survival differences between the high- and low-risk groups in this cohort. The results showed that OS was significantly worse in the high-risk group than the low-risk group (Fig. 3A), indicating that the risk score also had a strong prognostic value for patients who have received immunotherapy. Secondly, to identify whether the risk score was related to clinicopathological features, we compared the risk scores between groups divided based on clinical features, as shown in box plots in Fig. 3B–H. Using Kruskal–Wallis rank sum tests, we found that the risk score significantly differed by age, ECOG status, stage, LDH level, and IPI score (Fig. 3B–F), but it was not related to gender or extranodal sites (Fig. 3G, H). These results indicated that the risk score had good clinical value for classifying DLBCL samples.

Fig. 3
figure 3

Clinical correlations of risk score and development of the nomogram in the GSE10846 dataset. A Survival curve of patients in the high- and low-risk groups in the GSE10846 immunotherapy cohort (232 patients who had received R-CHOP treatment). BH Relationships between the risk score and clinicopathological features (including age, ECOG status, stage, LDH level, IPI score, gender and extranodal sites). 305 patients with complete clinicopathological features from the GSE10846 dataset were analyzed. The distance of both ends of boxes represents the interquartile range of values and the thick line represents the median value. I Univariate and multivariate analyses revealed the risk score was an independent prognostic factor for DLBCL patients. J Nomogram for predicting the 3- and 5-year OS of DLBCL patients. K Calibration curves of the nomogram for OS prediction at 3- and 5- year. L ROC curves indicating the comparisons of the risk score and the IPI score in predicting 1-year OS

In addition, univariate and multivariate Cox regression indicated that risk score was an independent prognostic factor in DLBCL (Fig. 3I). Next, we constructed a prognostic nomogram that integrated the risk score and all significant clinical features. Nomograms can be used to quantitatively predict the prognosis of patients, providing a reference for clinical decision-making. The nomogram demonstrated that the risk score contributed the most to the prognosis, more than the IPI score and other clinical features (such as age, stage, ECOG status, and LDH level) (Fig. 3J). Additionally, the calibration curves for 3- and 5-year OS indicated high consistency between the nomogram predictions and actual observations (Fig. 3K). These findings confirmed that the risk score was a satisfactory prognostic tool for use in DLBCL. Instinctively, we would compare it with the current widely used scoring system, IPI score. Time-dependent ROC curve analysis was further used to determine which scoring system best predicted the OS of DLBCL patients. The AUC of the risk score for predicting 1-year OS (AUC = 0.799) was significantly higher than that of IPI score (AUC = 0.631) (Fig. 3L). This demonstrated that the risk score was superior to the IPI score regarding survival prediction accuracy in DLBCL.

Relationship between risk score and immune microenvironment

The tumor immune microenvironment significantly affects the therapeutic effect and prognosis of tumor. We assessed the relationship between the risk score and the tumor immune microenvironment in DLBCL by multiple immune analyses. The ESTIMATE results indicated that the patients in the high-risk group had significantly lower stromal and ESTIMATE scores, and higher tumor purity, than those in the low-risk group (Fig. 4A). The ssGSEA was used to assess the immune status of the two groups, which suggesting that the DLBCL patients in the high-risk group had a relatively low immune status (Fig. 4B), which was consistent with the ESTIMATE results. In addition, the results of the CIBERSORT analysis revealed that there were significant differences in most infiltrating immune cells (Additional file 6: Figure S6A). The high-risk group was associated with significantly increased abundances of B cells naive, monocytes, macrophages M2, NK cells resting, NK cells activated, and significantly decreased abundances of T cells CD4 naive, T cells follicular helper, T cells gamma delta, macrophages M0 and dendritic cells resting, while the low-risk group showed the opposite trends (Fig. 4C). Higher abundance of immunosuppressive cells in the high-risk group indicated an immunosuppressive tumor microenvironment, which was consistent with the poor prognosis.

Fig. 4
figure 4

The different immune profiles between the low- and high- risk groups in the GSE10846 dataset. Two risk groups were divided based on the median risk score. A ESTIMATE algorithm. B ssGSEA analysis. C CIBERSORT analysis. D Correlation between risk score and immune cell content. E Expression variation of immune checkpoint. p values were showed as: ns not significant; *p < 0.05; **p < 0.01; ***p < 0.001

Additionally, we further explored the correlations between the risk score and immune cell content. The results showed that the risk score was positively correlated with macrophages M2, T cells CD4 naive, monocytes, B cells naïve, NK cells resting, and negatively correlated with T cells gamma delta, macrophages M0 and T cells follicular helper (Fig. 4D).

Thereafter, the immune-regulatory genes were further analyzed, and the differences in the expression levels of immune-related chemokines, immunosuppressants, immunostimulants, major histocompatibility complex (MHC) factors, and immune receptors between the high- and low-risk groups are shown in a heatmap (Additional file 6: Figure S6B). As immune cell dysfunction and immunosuppressive microenvironments are characterized by high expression of immune checkpoint-related transcripts, we subsequently focused on the differences in immune checkpoint expression levels and their correlations with the risk score. The results showed that the expression levels of ADORA2A, CD274, CSF1R, IL10, KIR2DL1, KIR2DL3, LGALS9, and TGFB1 were significantly higher in the high-risk group than the low-risk group (Fig. 4E). The correlations between the risk score and the expression of immune checkpoints showed that the risk score was positively correlated with the expression of CD274, CSF1R, KIR2DL1, KIR2DL3, LGALS9, and PVRL2, and negatively correlated with the expression of CD96, PDCD1LG2, TGFBR1, and TIGIT (Additional file 6: Figure S6C). The significantly increased expression of most immune checkpoints in the high-risk group further confirmed that the poor prognosis of high-risk patients was partly related to the immunosuppressive microenvironment. Based on these results, we can reasonably assume that the immunosuppressive microenvironments (characterized by low immune scores, low immune status, high abundance of immunosuppressive cells, and high expression of immune checkpoints) led to poor prognosis in the high-risk group. This is also consistent with our immune analysis of metabolism-associated molecular subtypes. Therefore, the risk model based on 14 MAGs is related to the immunosuppressive microenvironment of DLBCL, and abnormal immune cell infiltration and differential expression of immune checkpoints can be used as prognostic indicators and targets of immunotherapy, which is clinically significant.

Heterogeneity of drug sensitivity and signaling pathways in high- and low- risk groups

Based on the drug sensitivity data from the GDSC database, we used the "pRRophetic" R package to predict the drug sensitivity of each patient. Gemcitabine, vinblastine, and metformin had lower IC50 values in the high-risk group, while cisplatin and etoposide had higher IC50 values in the high-risk group (Additional file 7: Figure S7A). Furthermore, to explore the discrepancies in signaling pathways between the high- and low-risk groups, GSVA was performed. As shown in Additional file 7: Figure S7B, the differentially enriched signaling pathways between the two groups mainly involved the unfolded protein response, xenobiotic metabolism, KRAS signaling, glycolysis, TGF beta signaling, epithelial–mesenchymal transition, and heme metabolism. The GSVA results suggest that disturbances in these signaling pathways may worsen the prognosis of DLBCL patients in the high-risk group compared to the low-risk group.

Construction of ceRNA network

To further understand how the 14 MAGs in the risk model regulate mRNA expression by acting as miRNA sponges in DLBCL, we constructed a ceRNA network based on the 14 MAGs. Using FunRich to reverse predict miRNAs based on the 14 MAGs, 57 miRNAs and 130 mRNA-miRNA interactions were identified. Subsequently, using NPInter to reverse predict lncRNAs, 7395 mRNA-miRNA-lncRNA interactions were obtained. Finally, a ceRNA network related to the 14 genes was successfully constructed (Additional file 8: Figure S8). These data may provide clues for identifying the regulatory mechanism of the 14 MAGs in DLBCL.

Screening for hub genes in the risk model by WGCNA

To identify the most critical genes among the 14 model genes for further experimental verification, the WGCNA algorithm was applied. We constructed a weighted gene co-expression network based on the genes with variance within the first 5000 in the GSE10846 dataset (Additional file 9: Figure S9A). The soft-threshold was set to 1 to build a scale-free network (Additional file 9: Figure S9B). Next, we constructed an adjacency matrix and transformed it into a topological overlapping matrix (TOM). Based on the TOM, three gene modules were identified, namely blue (208), brown (151) and turquoise (4641) modules. Correlation analyses between the modules and clinical trait showed that the turquoise module had the highest correlation (cor = 0.4, p = 2e−17) (Additional file 9: Figure S9C). To identify the hub genes among the 14 model genes, we identified the overlapping genes between the turquoise module and the 14 model genes. This led to two hub genes being identified: PLTP and PHKA1 (Fig. 5A). Then, firstly, we explored the correlation between these two hub genes and infiltrating immune cells. We found that both hub genes were positively correlated with the infiltration of M2 macrophages and CD8 T cells (Fig. 5B). Secondly, we further analyzed the correlation between the two hub genes and immune checkpoints. As we expected, we found that PD-1, PD-L1 and LAG3, the common immune checkpoints on T cells, were also positively correlated with both hub genes, which indicated that the highly infiltrated CD8 T cells were dysfunctional (Fig. 5C). Thus, we speculated that these two hub genes were closely related to the immunosuppressive microenvironment and the potential mechanism of these two genes in DLBCL deserved further verification and discussion.

Fig. 5
figure 5

Identification of two hub genes and prediction of their relationship with immune cell content and experimental validation of their expression in the DLBCL TMA cohort. A Venn diagram analysis showed that the overlap of WGCNA analysis and LASSO model led to two hub genes being identified: PLTP and PHKA1. B Prediction of correlations between hub genes and immune cell content. C Prediction of correlations between hub genes and immune checkpoints. D Differences in PLTP expression between reactive hyperplasia tissues and DLBCL tissues with mIHC (p value by Wilcoxon rank-sum test). E Differences in PHKA1 expression between reactive hyperplasia tissues and DLBCL tissues with mIHC (p value by Wilcoxon rank-sum test)

Experimental verification of hub gene expression and their relationships with prognosis and the immune microenvironment in the DLBCL TMA cohort

Next, we preliminarily verified the two hub genes using mIHC in our own TMA cohort. First, we compared DLBCL tissues with benign reactive hyperplasia tissues, and we found that the expression levels of PLTP and PHKA1 were both significantly increased in DLBCL patients (Fig. 5D, E). Next, X-tile analysis of 5-year OS was performed using the TMA cohort to determine the optimal cutoff values for PLTP and PHKA1 expression. According to the optimal cutoff value for each gene, we divided the 104 DLBCL samples in the TMA cohort into high- and low-expression groups (high-PLTP expression: n = 36, low-PLTP expression: n = 68; and high-PHKA1 expression: n = 40, low-PHKA1 expression: n = 64). Relevant grouping information and clinicopathological features are shown in Tables 3, 4. We then studied the differences in the tumor immune microenvironment between the pairs of groups. Regarding PLTP, the high-expression group had worse prognosis and higher infiltration of M2 macrophages and tumor-associated macrophages (TAMs) compared to the low-expression group (Fig. 6A–E). Moreover, PLTP expression was positively correlated with clinical stage (Table 3) and immune checkpoints PD-1 and PD-L1, but not correlated with LAG3 (Table 3 and Fig. 6F–I). Likewise, regarding PHKA1, the high-expression group had worse prognosis and higher infiltration of M2 macrophages and TAMs compared to the low-expression group (Fig. 7A–E). Moreover, PHKA1 expression was positively correlated with immune checkpoints PD-1 and PD-L1, but not correlated with clinical features and LAG3 (Table 4 and Fig. 7F–I). Finally, univariate and multivariate Cox regression analyses indicated that PLTP and PHKA1 were both independent prognostic factors for patients with DLBCL (Additional file 10: Figure S10). Our verification results confirmed that these two genes were both overexpressed in DLBCL tissues, and their expression levels were closely related to the prognosis and immunosuppressive microenvironment of DLBCL. The conclusions were basically consistent with the results of our bioinformatics analyses.

Table 3 Association of PLTP expression levels with clinicopathological characteristics in patients with DLBCL
Table 4 Association of PHKA1 expression levels with clinicopathological characteristics in patients with DLBCL
Fig. 6
figure 6

Experimental verification of the relationship between PLTP and prognosis and immune microenvironment in the DLBCL TMA cohort. A Kaplan–Meier curve for the PLTP high- and low- expression groups in our TMA cohort. The optimal cutoff point was obtained from X-tile 3.6.1 software. B Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, CD68, CD163 and PLTP simultaneously on the same tissue slide. C M2 macrophages content in the PLTP high- and low-expression groups. CD68+CD163+ indicated the content of M2 macrophages (p value by Wilcoxon rank-sum test). D Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, CD11B and PLTP simultaneously on the same tissue slide. E TAMs content in the PLTP high- and low- expression groups. CD11B+ indicated the content of TAMs (p value by Wilcoxon rank-sum test). F Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, PD-1, PD-L1, LAG3 and PLTP simultaneously on the same tissue slide. CD274 indicated the content of PD-L1 and PDCD1 indicated the content of PD-1. GI Correlations between PLTP expression and immune checkpoints (including PD-1, PD-L1, and LAG3)

Fig. 7
figure 7

Experimental verification of the relationship between PHKA1 and prognosis and immune microenvironment in the DLBCL TMA cohort. A Kaplan–Meier curve for the PHKA1 high- and low-expression groups in our TMA cohort. The optimal cutoff point was obtained from X-tile 3.6.1 software. B Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, CD68, CD163 and PHKA1 simultaneously on the same tissue slide. C M2 macrophages content in the PHKA1 highand low- expression groups. CD68+CD163+ indicated the content of M2 macrophages (p value by Wilcoxon rank-sum test). D Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, CD11B and PHKA1 simultaneously on the same tissue slide. E TAMs content in the PHKA1 high- and low- expression groups. CD11B+ indicated the content of TAMs (p value by Wilcoxon rank-sum test). F Characterization of cell immunophenotypes with mIHC. A staining panel was developed to visualize DAPI, PD-1, PD-L1, LAG3 and PHKA1 simultaneously on the same tissue slide. CD274 indicated the content of PD-L1 and PDCD1 indicated the content of PD-1. GI Correlations between PHKA1 expression and immune checkpoints (including PD-1, PD-L1, and LAG3)

Discussion

The molecular heterogeneity of DLBCL brings great challenges to precision therapy. It is generally accepted that the traditional IPI score cannot adequately predict the prognosis of DLBCL, and developing more reliable strategies for subtype identification and prognostic classification is urgent [3, 19]. In this study, we identified two metabolism-associated molecular subtypes, and there were significant differences in prognosis and the immune microenvironment between these two subtypes. In addition, we developed a prognostic risk model based on 14 MAGs. We found that it was a powerful independent prognostic tool with better predictive performance than the IPI score and was closely related to the immunosuppressive microenvironment. Finally, we identified two hub genes among the model genes, and preliminarily verified them in our own TMA cohort using mIHC. Our results may contribute to the development of accurate immunotherapy for DLBCL that targets metabolic pathways.

Consensus clustering is an unsupervised clustering method that can identify different molecular subtypes according to a gene expression matrix [20]. Using consensus clustering, we identified two metabolism-associated molecular subtypes, which had significant differences in prognosis and the immune microenvironment. Compared to cluster 2, the prognosis of the patients in cluster 1 was poor, accompanied by a high abundance of immunosuppressive cells and a general increase in the expression of immune checkpoints, indicating an immunosuppressive microenvironment. This is consistent with findings regarding other malignancies [6, 13, 14, 21]. As the consensus clustering was based on a MAG expression matrix, we inferred that the expression of MAGs was related to the prognosis and immunosuppressive microenvironment of DLBCL patients.

To further evaluate the prognostic value of the MAGs, we established a 14-gene risk model in the GEO training cohort by univariate Cox regression and LASSO regression. We then constructed a prognostic nomogram that integrated the risk score based on this model and all significant clinical features. The risk score effectively predicted prognosis in the GEO training cohort and was validated in a GEO internal validation cohort and a TCGA external validation cohort. ROC curve analysis confirmed that the risk score was superior to the traditional IPI score. Multiple validation methods indicated the robustness of the risk model, and it is reasonable to believe that this risk model will be broadly applicable for individualized risk management. As previously mentioned, in view of the close relationship between metabolic reprogramming and the tumor immune microenvironment, we performed multiple immune analyses (ESTIMATE, ssGSEA, and CIBERSORT) to explore the differences in the immune landscape between the high- and low- risk groups. As expected, the high-risk group had a poor prognosis and an immunosuppressive microenvironment characterized by low immune score, low immune status, high abundance of immunosuppressive cells, and high expression of immune checkpoints. The low-risk group showed the opposite trend. This is also consistent with our immune analysis of metabolism-associated molecular subtypes. An increased risk score indicates a “cold tumor” [22], with attenuated immunotherapy effectiveness and an immunosuppressive tumor microenvironment caused by metabolic reprogramming, which is consistent with poor prognosis. These conclusions further indicated that MAGs might play important roles in the altered immune response in DLBCL.

Notably, in the two groups with poor prognosis (cluster 1 and the high-risk group), in addition to the increase in the abundance of immunosuppressive cells and the expression of immune checkpoints, there was a significant increase in the infiltration of resting and activated NK cells. This is consistent with the results of previous studies, that is, an increased abundance of activated NK cells is associated with poor prognosis [23]. NK cell dysfunction is common in hematological cancer, and it is related to tumor immune escape [24]. We also found that KIR2DL1 and KIR2DL3 [25], the common immune checkpoints on NK cells, were also significantly overexpressed in cluster 1 and the high-risk group. In the future, immunotherapy that blocks KIR2DL1/KIR2DL3 might reduce the abundance of activated NK cells.

Most of the MAGs in the risk model have been reported to be associated with cancer. To identify the most critical genes, i.e., the hub genes, among the 14 model genes for further experimental verification, we used the WGCNA algorithm to select key genes and then identified the overlapping genes among these genes and the model genes. As a result, we identified two hub genes: PLTP and PHKA1. The potential mechanisms of these two hub genes in DLBCL deserve further discussion.

Phospholipid transfer protein (PLTP) is a widely expressed lipid transfer protein that belongs to the lipopolysaccharide (LPS)-binding/lipid transfer gene family. PLTP can promote the transfer of a series of lipid molecules, including diacylglycerol, phosphatidic acid, sphingomyelin, phosphatidylcholine, phosphatidylglycerol, brain glycosides, and phosphatidylethanolamine. These transport functions play an important role in lipid and lipoprotein metabolism [26, 27]. PLTP is differentially expressed in many kinds of tumors, such as prostate cancer [27], ovarian cancer [28], breast cancer [29], lung cancer [30], gastric cancer [31] and glioma [32]. Such a wide range of cancer types with differential expression of PLTP indicate that PLTP may be an important regulator of some common processes related to tumors.

The phosphorylase kinase regulatory subunit alpha 1 (PHKA1) gene encodes the muscle-type isoform of the PHK alpha subunit [33]. PHKA1 plays a key role in glycogen metabolism [34] and PHKA1 mutations cause glycogen storage disease type 9D, also known as X-linked muscle glycogenosis [35]. However, research on PHKA1 in tumors is still limited. Research has shown that PHKA1, as an important gene related to glycogen metabolism, is related to the metastasis of prostate cancer [36]. In addition, the increased expression of PHKA1 was associated with younger ages of gastrointestinal stromal tumor patients [36].

We further preliminarily validated the two hub genes in our TMA cohort using mIHC, which can quantify immune cells in the tumor microenvironment more objectively than traditional semi-quantitative methods [37]. Our verification results confirmed that the two hub genes were both overexpressed in DLBCL tissues. Thereafter, using X-tile (a valuable tool for outcome-based cutoff optimization) [38] and the 5-year OS of patients, we determined the optimal cutoff value for PLTP and PHKA1 expression. Based on each cutoff value, we subdivided the DLBCL patients into high- and low-expression groups, and further studied the differences in the tumor immune microenvironment between the pairs of groups. We found that the prognosis of the high-expression groups was poorer, accompanied by an immunosuppressive microenvironment characterized by higher abundances of immunosuppressive cells (M2 macrophages and TAMs) and higher expression of immune checkpoints (PD-L1 and PD-1). Finally, univariate and multivariate Cox regression analyses indicated that PLTP and PHKA1 were both independent prognostic factors in DLBCL. These experimental results showed that high expression of the hub genes was closely related to the prognosis and immunosuppressive microenvironment of DLBCL, which was consistent with our bioinformatics analyses, and further verified the stability and accuracy of the risk model.

Studies have shown that metabolic reprogramming is an important feature of immune cell activation. Immune cells have different metabolic characteristics, which affect their immune function [16, 18]. Macrophages, as the main immune-infiltrating cells in solid tumors, can polarize into inflammatory (M1) or immunosuppressive (M2) phenotypes based on external stimuli. M1 macrophages have pro-inflammatory and anti-tumor effects, while M2 macrophages have anti-inflammatory and pro-tumor effects [39]. The metabolic reprogramming of tumors can affect the polarization process of macrophages [40, 41]. For example, hypoxia and lactic acid accumulation can promote the production of immunosuppressive M2 macrophages. The increase in tumor glycolysis produces a large amount of lactic acid, and the accumulation of lactic acid drives macrophages toward the M2 phenotype. M2 macrophages overexpress arginase 1 (ARG1). ARG1 consumes L-arginine, which is necessary for cytotoxic T lymphocytes to exert anti-tumor activity, and produces polyamines with strong immunosuppressive effects [18, 42]. Additionally, hypoxia promotes tumor development by inducing the production of angiogenic factors, mitogenic factors, and cytokines related to tumor metastasis in macrophages [9]. Additionally, macrophages can undergo lipid-based metabolic reprogramming to promote tumor progression via increased membrane cholesterol efflux [43, 44]. Moreover, M2 macrophages up-regulate fatty acid oxidation, mitochondrial respiration, and angiogenesis, thereby promoting tumor progression [9, 45]. Our mIHC results also confirmed that M2 macrophages in DLBCL patients with high metabolic gene expression were significantly increased. Therefore, M2 macrophages may have potential as immunotherapy targets.

Interactions between immune checkpoints and their cognate receptors can deliver inhibitory signals to immune cells leading to their dysfunction and exhaustion, resulting in immunosuppressive microenvironment and tumor progression [46]. Our study showed that the expression of most immune checkpoints significantly increased with increasing risk score, indicating an immunosuppressive microenvironment that was consistent with poor prognosis. Recent studies have shown that immune checkpoints are closely related to metabolism. On the one hand, checkpoint signals can regulate metabolism [18]. For example, PD-L1 in tumor cells can activate the PI3K-Akt-mTOR pathway, stimulate glycolysis, and enhance glucose uptake by the tumor cells [47]. CD155-TIGIT signaling in T cells of human gastric cancer inhibits glucose uptake, lactic acid production, and glycolytic enzyme expression [48]. On the other hand, metabolism also modulate the tumor response to checkpoint blockade immunotherapy. For instance, obesity is recognized to enhance the PD-1 expression, and is associated with better outcome to checkpoint blockade immunotherapy in metastatic melanoma and renal cell carcinoma [46]. Besides, 2-Deoxy-D-glucose (2-DG), a non-metabolizable glucose analog that inhibits normal glucose metabolism, can enhance the efficacy of anti-CTLA-4 immunotherapy by decreasing PD-L1 protein abundance and increasing expression of type-I interferon (IFN) and antigen presentation genes [49]. Moreover, Powell's team showed that a glutamine metabolism inhibitor not only improved the immunosuppressive microenvironment, but also effectively reversed PD-1 inhibitor resistance when combined with a PD-1 inhibitor [50]. Therefore, combining metabolic inhibitors with checkpoint inhibitors is expected to improve the efficacy of checkpoint blockade.

Our research has some unique advantages. In this study, two metabolism-associated DLBCL subtypes were identified, and a risk model based on MAGs was constructed. We used multiple validation methods to evaluate the model: first, we tested the model in a GEO internal testing cohort, then in a TCGA external validation cohort, and finally we identified two hub genes and carried out preliminary verification in our own TMA cohort. Satisfactory results were obtained from the multiple validation methods, confirming the robustness and accuracy of the risk model. In addition, we not only studied the predictive performance of the risk model, but also explored the effect of MAG expression on the tumor immune microenvironment in DLBCL.

Limitations

Although our findings have potential clinical significance, there are still some limitations. First, the clinical features extracted from the GEO and TCGA databases were limited, as they did not include potential prognostic factors such as smoking and background diseases. Second, this is a retrospective study, and an independent prospective cohort is needed to verify the risk model established in this study. Finally, the value of the two hub genes as potential pharmacological targets needs to be further investigated.

Conclusions

In this study, we identified two metabolism-associated DLBCL subtypes and constructed a risk model based on MAGs. Our risk model is related to the tumor immune microenvironment and prognosis of DLBCL. This study provides a new 14-gene signature for predicting the prognosis of DLBCL, which may facilitate personalized treatment strategies, and provides an important basis for further study on MAGs and the immune microenvironment.

Availability of data and materials

All the data corresponding to the DLBCL series used in this study are available in GEO (https://www.ncbi.nlm.nih.gov/geo) and TCGA (https://portal.gdc.cancer.gov/), which are public functional genomics data repositories.

Abbreviations

DLBCL:

Diffuse large B cell lymphoma

MAGs:

Metabolism-associated genes

GEO:

Gene Expression Omnibus

LASSO:

Least absolute shrinkage and selection operator

TCGA:

The Cancer Genome Atlas

GO:

Gene Ontology

KEGG:

Kyoto Encyclopedia of Genes and Genomes

GSVA:

Gene set variation analysis

ESTIMATE:

Estimation of Stromal and Immune cells in Malignant Tumor tissues using Expression data

CIBERSORT:

Cell-type Identification By Estimating Relative Subsets Of RNA Transcripts

ssGSEA:

Single-sample Gene Set Enrichment Analysis

ceRNA:

Competing endogenouse RNA

WGCNA:

Weighted gene co-expression network analysis

TMA:

Tissue microarray

mIHC:

Multiple fluorescence immunohistochemistry

ROC:

Receiver operating characteristic

IPI:

International Prognostic Index

NCBI:

National Center for Biotechnology Information

IC50:

Half maximal inhibitory concentration

AUC:

Area under the curve

LDH:

Lactate dehydrogenase

ECOG:

Eastern Cooperative Oncology Group

MHC:

Major histocompatibility complex

PD-1:

Programmed death-1

PD-L1:

Programmed cell death-1 ligand 1

References

  1. Patriarca A, Gaidano G. Investigational drugs for the treatment of diffuse large B-cell lymphoma. Expert Opin Investig Drugs. 2021;30(1):25–38.

    Article  CAS  PubMed  Google Scholar 

  2. Ennishi D, Hsi ED, Steidl C, Scott DW. Toward a new molecular taxonomy of diffuse large B-cell lymphoma. Cancer Discov. 2020;10(9):1267–81.

    Article  CAS  PubMed  Google Scholar 

  3. Wight JC, Chong G, Grigg AP, Hawkes EA. Prognostication of diffuse large B-cell lymphoma in the molecular era: moving beyond the IPI. Blood Rev. 2018;32(5):400–15.

    Article  CAS  PubMed  Google Scholar 

  4. Kamel HFM, Al-Amodi H. Exploitation of gene expression and cancer biomarkers in paving the path to era of personalized medicine. Genomics Proteomics Bioinformatics. 2017;15(4):220–35.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Ahluwalia P, Kolhe R, Gahlay GK. The clinical relevance of gene expression based prognostic signatures in colorectal cancer. Biochim Biophys Acta Rev Cancer. 2021;1875(2):188513.

    Article  CAS  PubMed  Google Scholar 

  6. Ye Z, Zou S, Niu Z, Xu Z, Hu Y. A novel risk model based on lipid metabolism-associated genes predicts prognosis and indicates immune microenvironment in breast cancer. Front Cell Dev Biol. 2021;9:691676.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Ohshima K, Morii E. Metabolic reprogramming of cancer cells during tumor progression and metastasis. Metabolites. 2021;11(1):28.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Faubert B, Solmonson A, DeBerardinis RJ. Metabolic reprogramming and cancer progression. Science. 2020;368(6487):eaaw5473.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Tong Y, Gao WQ, Liu Y. Metabolic heterogeneity in cancer: an overview and therapeutic implications. Biochim Biophys Acta Rev Cancer. 2020;1874(2):188421.

    Article  CAS  PubMed  Google Scholar 

  10. Peng X, Chen Z, Farshidfar F, Xu X, Lorenzi PL, Wang Y, et al. Molecular characterization and clinical relevance of metabolic expression subtypes in human cancers. Cell Rep. 2018;23(1):255-269.e4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Sinkala M, Mulder N, Patrick MD. Metabolic gene alterations impact the clinical aggressiveness and drug responses of 32 human cancers. Commun Biol. 2019;2:414.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. Lin D, Fan W, Zhang R, Zhao E, Li P, Zhou W, et al. Molecular subtype identification and prognosis stratification by a metabolism-related gene expression signature in colorectal cancer. J Transl Med. 2021;19(1):279.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Yu S, Hu C, Cai L, Du X, Lin F, Yu Q, et al. Seven-gene signature based on glycolysis is closely related to the prognosis and tumor immune infiltration of patients with gastric cancer. Front Oncol. 2020;10:1778.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Qian H, Lei T, Hu Y, Lei P. Expression of lipid-metabolism genes is correlated with immune microenvironment and predicts prognosis in osteosarcoma. Front Cell Dev Biol. 2021;9:673827.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Lei X, Lei Y, Li JK, Du WX, Li RG, Yang J, et al. Immune cells within the tumor microenvironment: biological functions and roles in cancer immunotherapy. Cancer Lett. 2020;470:126–33.

    Article  CAS  PubMed  Google Scholar 

  16. Li X, Wenes M, Romero P, Huang SC, Fendt SM, Ho PC. Navigating metabolic pathways to enhance antitumour immunity and immunotherapy. Nat Rev Clin Oncol. 2019;16(7):425–41.

    Article  CAS  PubMed  Google Scholar 

  17. Wang JX, Choi SYC, Niu X, Kang N, Xue H, Killam J, et al. Lactic acid and an acidic tumor microenvironment suppress anticancer immunity. Int J Mol Sci. 2020;21(21):8363.

    Article  CAS  PubMed Central  Google Scholar 

  18. Leone RD, Powell JD. Metabolism of immune cells in cancer. Nat Rev Cancer. 2020;20(9):516–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Wright GW, Huang DW, Phelan JD, Coulibaly ZA, Roulland S, Young RM, et al. A probabilistic classification tool for genetic subtypes of diffuse large B cell lymphoma with therapeutic implications. Cancer Cell. 2020;37(4):551-68.e14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Wilkerson MD, Hayes DN. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics (Oxford, England). 2010;26(12):1572–3.

    Article  CAS  Google Scholar 

  21. Qi J, Liu Y, Hu J, Lu L, Dou Z, Dai H, et al. Identification of FPR3 as a unique biomarker for targeted therapy in the immune microenvironment of breast cancer. Front Pharmacol. 2020;11:593247.

    Article  CAS  PubMed  Google Scholar 

  22. Galon J, Bruni D. Approaches to treat immune hot, altered and cold tumours with combination immunotherapies. Nat Rev Drug Discov. 2019;18(3):197–218.

    Article  CAS  PubMed  Google Scholar 

  23. Zhou H, Zheng C, Huang DS. A prognostic gene model of immune cell infiltration in diffuse large B-cell lymphoma. PeerJ. 2020;8:e9658.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Vo DN, Alexia C, Allende-Vega N, Morschhauser F, Houot R, Menard C, et al. NK cell activation and recovery of NK cell subsets in lymphoma patients after obinutuzumab and lenalidomide treatment. Oncoimmunology. 2018;7(4):e1409322.

    Article  PubMed  Google Scholar 

  25. Sim MJ, Stowell J, Sergeant R, Altmann DM, Long EO, Boyton RJ. KIR2DL3 and KIR2DL1 show similar impact on licensing of human NK cells. Eur J Immunol. 2016;46(1):185–91.

    Article  CAS  PubMed  Google Scholar 

  26. Albers JJ, Vuletic S, Cheung MC. Role of plasma phospholipid transfer protein in lipid and lipoprotein metabolism. Biochim Biophys Acta. 2012;1821(3):345–57.

    Article  CAS  PubMed  Google Scholar 

  27. Yazdanyar A, Yeang C, Jiang XC. Role of phospholipid transfer protein in high-density lipoprotein- mediated reverse cholesterol transport. Curr Atheroscler Rep. 2011;13(3):242–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Kuk C, Kulasingam V, Gunawardana CG, Smith CR, Batruch I, Diamandis EP. Mining the ovarian cancer ascites proteome for potential ovarian cancer biomarkers. Mol Cell Proteomics. 2009;8(4):661–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Bianchini G, Qi Y, Alvarez RH, et al. Molecular anatomy of breast cancer stroma and its prognostic value in estrogen receptor-positive and -negative cancers. J Clin Oncol. 2010;28(28):4316–23.

    Article  PubMed  Google Scholar 

  30. Rohrbeck A, Borlak J. Cancer genomics identifies regulatory gene networks associated with the transition from dysplasia to advanced lung adenocarcinomas induced by c-Raf-1. PLoS ONE. 2009;4(10):e7315.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Huang K, Chen S, Xie R, Jiang P, Yu C, Fang J, et al. Identification of three predictors of gastric cancer progression and prognosis. FEBS Open Bio. 2020;10(9):1891–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Dong W, Gong H, Zhang G, Vuletic S, Albers J, Zhang J, et al. Lipoprotein lipase and phospholipid transfer protein overexpression in human glioma cells and their effect on cell growth, apoptosis, and migration. Acta Biochim Biophys Sin (Shanghai). 2017;49(1):62–73.

    Article  CAS  Google Scholar 

  33. Davidson JJ, Ozçelik T, Hamacher C, Willems PJ, Francke U, Kilimann MW. cDNA cloning of a liver isoform of the phosphorylase kinase alpha subunit and mapping of the gene to Xp22.2-p22.1, the region of human X-linked liver glycogenosis. Proc Natl Acad Sci U S A. 1992;89(6):2096–100.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Wuyts W, Reyniers E, Ceuterick C, Storm K, de Barsy T, Martin JJ. Myopathy and phosphorylase kinase deficiency caused by a mutation in the PHKA1 gene. Am J Med Genet A. 2005;133(1):82–4.

    Article  Google Scholar 

  35. Herbert M, Goldstein JL, Rehder C, Austin S, Kishnani PS, Bali DS. Phosphorylase Kinase Deficiency. In: Adam MP, Ardinger HH, Pagon RA, eds. GeneReviews (®). Seattle: University of Washington. Copyright © 1993–2021, University of Washington, Seattle. GeneReviews is a registered trademark of the University of Washington, Seattle. All rights reserved; 1993.

  36. Chen J, Cao S, Situ B, et al. Metabolic reprogramming-based characterization of circulating tumor cells in prostate cancer. J Exp Clin Cancer Res. 2018;37(1):127.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Prakash S, Sarran L, Socci N, DeMatteo RP, Eisenstat J, Greco AM, et al. Gastrointestinal stromal tumors in children and young adults: a clinicopathologic, molecular, and genomic study of 15 cases and review of the literature. J Pediatr Hematol Oncol. 2005;27(4):179–87.

    Article  PubMed  Google Scholar 

  38. Schalper KA, Brown J, Carvajal-Hausdorf D, McLaughlin J, Velcheti V, Syrigos KN, et al. Objective measurement and clinical significance of TILs in non-small cell lung cancer. J Natl Cancer Inst. 2015;107(3):435.

    Article  CAS  Google Scholar 

  39. Camp RL, Dolled-Filhart M, Rimm DL. X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res. 2004;10(21):7252–9.

    Article  CAS  PubMed  Google Scholar 

  40. Locati M, Curtale G, Mantovani A. Diversity, mechanisms, and significance of macrophage plasticity. Annu Rev Pathol. 2020;15:123–47.

    Article  CAS  PubMed  Google Scholar 

  41. Mehla K, Singh PK. Metabolic regulation of macrophage polarization in cancer. Trends Cancer. 2019;5(12):822–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Biswas SK. Metabolic reprogramming of immune cells in cancer progression. Immunity. 2015;43(3):435–49.

    Article  CAS  PubMed  Google Scholar 

  43. Paolini L, Adam C, Beauvillain C, Preisser L, Blanchard S, Pignon P, et al. Lactic acidosis together with GM-CSF and M-CSF induces human macrophages toward an inflammatory protumor phenotype. Cancer Immunol Res. 2020;8(3):383–95.

    Article  CAS  PubMed  Google Scholar 

  44. Xiang Y, Miao H. Lipid metabolism in tumor-associated macrophages. Adv Exp Med Biol. 2021;1316:87–101.

    Article  CAS  PubMed  Google Scholar 

  45. Goossens P, Rodriguez-Vita J, Etzerodt A, Masse M, Rastoin O, Gouirand V, et al. Membrane cholesterol efflux drives tumor-associated macrophage reprogramming and tumor progression. Cell Metab. 2019;29(6):1376-89.e4.

    Article  CAS  PubMed  Google Scholar 

  46. Liu Y, Xu R, Gu H, Zhang E, Qu J, Cao W, et al. Metabolic reprogramming in macrophage responses. Biomark Res. 2021;9(1):1.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Deshpande RP, Sharma S, Watabe K. The confounders of cancer immunotherapy: roles of lifestyle, metabolic disorders and sociological factors. Cancers. 2020;12(10):2983.

    Article  PubMed Central  Google Scholar 

  48. Chang CH, Qiu J, O’Sullivan D, Buck MD, Noguchi T, Curtis JD, et al. Metabolic competition in the tumor microenvironment is a driver of cancer progression. Cell. 2015;162(6):1229–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. He W, Zhang H, Han F, Chen X, Lin R, Wang W, et al. CD155T/TIGIT signaling regulates CD8(+) T-cell metabolism and promotes tumor progression in human gastric cancer. Cancer Res. 2017;77(22):6375–88.

    Article  CAS  PubMed  Google Scholar 

  50. Dai X, Bu X, Gao Y, Guo J, Hu J, Jiang C, Zhang Z, Xu K, Duan J, He S, et al. Energy status dictates PD-L1 protein abundance and anti-tumor immunity to enable checkpoint blockade. Mol Cell. 2021;81(11):2317-2331.e2316.

    Article  CAS  PubMed  Google Scholar 

  51. Leone RD, Zhao L, Englert JM, Sun IM, Oh MH, Sun IH, et al. Glutamine blockade induces divergent metabolic programs to overcome tumor immune evasion. Science. 2019;366(6468):1013–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We are grateful to the patients for their contributions to this study.

Funding

This study was supported by the National Natural Science Foundation of China Grants (No. 81570184 to WS).

Author information

Authors and Affiliations

Authors

Contributions

WS, CZ and JH designed the study. WS, JH, ZC, QX, PS, YW and CZ conducted the experiments and data analysis. WS and JH wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Wenyu Shi.

Ethics declarations

Ethics approval and consent to participate

The informed consent of all patients was obtained before the study. The Ethics Committee of the Affiliated Hospital of Nantong University approved this research.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Flow chart of the data analyzing process. The GSE10846 dataset was randomly divided (4:1 ratio) into a training cohort (n = 330) and a testing cohort (n = 82).

Additional file 2: Figure S2.

92 Prognosis-related MAGs by univariate Cox regression.

Additional file 3: Figure S3.

The proportion of 22 types of immune cells in patients.

Additional file 4: Figure S4.

Functional analyses based on 92 prognostic MAGs. (A) Lollipop Chart for GO enrichment (BP, biological process; CC, cellular component; MF, molecular function). (B) Barplot graph for KEGG pathways. (C) PPI analysis showing the protein interaction network of MAGs.

Additional file 5: Figure S5.

LASSO regression analysis revealing the minimum criteria (A, B) and coefficients (C). Blue represented the coefficient greater than 0.1, and yellow represented the coefficient less than 0.1.

Additional file 6: Figure S6.

The heatmaps of immune cells and immune-regulatory genes expression in the high- and low-risk group. (A) The proportion of 22 types of immune cells in the training cohort patients. (B) Immune-regulatory genes expression in the high- and low-risk group. (C) Correlation between the risk score and immune checkpoints. p values were showed as: ns not significant; *p < 0.05; **p < 0.01; ***p < 0.001.

Additional file 7: Figure S7.

Prediction of drug sensitivity and signaling pathways in high- and low- risk groups. (A) Sensitivity analysis of common therapeutic drugs in patients of two groups. (B) GSVA analysis for differentially expressed signaling pathways between two groups.

Additional file 8: Figure S8.

The ceRNA network associated with the model genes. The red dots represent mRNAs, the green dots represent miRNAs and the blue dots represent lncRNAs.

Additional file 9: Figure S9.

Construction of weighted co-expression network. (A) Clustering dendrogram of 412 patients in the GSE10846 dataset and heatmap of clinical trait (i.e. risk group). According to the median risk score, 412 patients were divided into high- and low- risk groups. In terms of the color in heatmap, white means low-risk group and red means high-risk group. The cluster was based on the genes with variance within the first 5000 in the GSE10846 dataset. (B) Analysis of the scale-free fit index (left) and the mean connectivity (right) for various soft-thresholding power value. (C) Correlation between the modules and clinical trait (i.e. risk group). Blue represents the negative correlation and red represents the positive correlation.

Additional file 10: Figure S10.

Cox univariate and multivariate regression analysis revealed that PLTP (A) and PHKA1 (B) were both independent prognostic factors for DLBCL patients.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

He, J., Chen, Z., Xue, Q. et al. Identification of molecular subtypes and a novel prognostic model of diffuse large B-cell lymphoma based on a metabolism-associated gene signature. J Transl Med 20, 186 (2022). https://doi.org/10.1186/s12967-022-03393-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12967-022-03393-9

Keywords