- Open Access
Identification of key biomarkers and immune infiltration in systemic lupus erythematosus by integrated bioinformatics analysis
Journal of Translational Medicine volume 19, Article number: 35 (2021)
Systemic lupus erythematosus (SLE) is a multisystemic, chronic inflammatory disease characterized by destructive systemic organ involvement, which could cause the decreased functional capacity, increased morbidity and mortality. Previous studies show that SLE is characterized by autoimmune, inflammatory processes, and tissue destruction. Some seriously-ill patients could develop into lupus nephritis. However, the cause and underlying molecular events of SLE needs to be further resolved.
The expression profiles of GSE144390, GSE4588, GSE50772 and GSE81622 were downloaded from the Gene Expression Omnibus (GEO) database to obtain differentially expressed genes (DEGs) between SLE and healthy samples. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichments of DEGs were performed by metascape etc. online analyses. The protein–protein interaction (PPI) networks of the DEGs were constructed by GENEMANIA software. We performed Gene Set Enrichment Analysis (GSEA) to further understand the functions of the hub gene, Weighted gene co‐expression network analysis (WGCNA) would be utilized to build a gene co‐expression network, and the most significant module and hub genes was identified. CIBERSORT tools have facilitated the analysis of immune cell infiltration patterns of diseases. The receiver operating characteristic (ROC) analyses were conducted to explore the value of DEGs for SLE diagnosis.
In total, 6 DEGs (IFI27, IFI44, IFI44L, IFI6, EPSTI1 and OAS1) were screened, Biological functions analysis identified key related pathways, gene modules and co‐expression networks in SLE. IFI27 may be closely correlated with the occurrence of SLE. We found that an increased infiltration of moncytes, while NK cells resting infiltrated less may be related to the occurrence of SLE.
IFI27 may be closely related pathogenesis of SLE, and represents a new candidate molecular marker of the occurrence and progression of SLE. Moreover immune cell infiltration plays important role in the progession of SLE.
Systemic lupus erythematosus (SLE) is the common autoimmune diseases in the world, which have influenced the adult involving multiple organs, mostly in young women, with an increasing number of early, mild and atypical cases . SLE is the result of different pathogenesis, and has showed the different clinical manifestation and cellular and molecular foundation. The pathogenesis of SLE focuses on autoantibodies and immune complexes, inflammatory processe and tissue destruction [2,3,4,5]. However, the pathophysiologic mechanisms of SLE have not been investigated thoroughly. Therefore, it is very important to explore the molecular characteristics and mechanism of SLE occurrence, progression to provide new strategies for the effective prevention, diagnosis and treatment of SLE.
In recent studies, microarrays based on high-throughput platforms have widely used to explore and identify the promising biomarkers for diagnosis and prognosis of disease at the genome level. Numerous studies [6,7,8] have demonstrated that the pathophysiological process for the development of SLE are closely associated with the mutation and abnormal expression of genes, which include TNFSF4, NCF1-339, CXorf21, etc. A previous study demonstrated that IFI44L promoter methylation as a blood biomarker for systemic lupus erythematosus . In an animal research, researchers demonstrated that there is the association between expression of IFIT1 in podocytes of MRL/lpr mice and the renal pathological changes it causes . Wang et al.  showed that there was the association of abnormal elevations in IFIT3 with overactive cyclic GMP-AMP synthase/Stimulator of interferon genes signaling in human systemic lupus erythematosus monocytes. Moreover, It has been showed that cGAS activation causes lupus-like autoimmune disorders in a TREX1 mutant mouse model . In addition, IRF5 risk variants associate with elevated IRF5 expression and IFN production in SLE blood cells . Therefore, it is imperative to explore the accurate molecular targets included in occurrence and progression of SLE, in order to make a contribution to the diagnosis and treatment of SLE.
Herein, we analysed four mRNA microarray datasets from Gene Expression Omnibus (GEO) to screened differentially expressed genes (DEGs) between SLE and healthy samples. Subsequently, the molecular mechanisms of the pathogenesis of SLE were subsequently explored via enrichment analysis of functions and pathways. Protein–protein interaction (PPI) network analysis was carried out to explore relationships between DEGs, 6 hub genes were screened. IFI27 was identified as a key hub gene closely correlated with the progression of SLE. CIBERSORT was used to evaluate abundance of immune infiltrates. WGCNA and GSEA analysis was used to analyse the mechanism by which IFI27 may affect the pathogenesis of SLE. The findings provide new candidate molecular markers for studying the pathogenesis of SLE. Data processing was performed by using R software (Version 3.6.1; https://www.r-project.org/) and bioconductor packages (http://www.bioconductor.org/), together with the online website such as metascape etc. It is anticipated that the novel DEGs and pathways between SLE and healthy controls identified in this study may shed light on the underlying molecular.
Materials and methods
Access to GEO datasets
Genes were screened using the GEO (http://www.ncbi.nlm.nih.gov/geo) database . GSE144390, GSE4588, GSE50772 and GSE81622, which all identify genes and pathways involved in the formation of SLE compare with normal individuals, were obtained from the GEO. The GSE50772 , GSE4588 series on the GPL570 platform (Affymetrix Human Genome U133 Plus 2.0 Array), the GSE144390 series on the GPL6244 platform (Affymetrix Human Gene 1.0 ST Array), and the GSE81622 series  on the GPL10558 platform (Illumina HumanHT-12 V4.0 expression beadchip), the basic information of the datasets selected is showed in Additional file 1: Table S1. The probes were transformed into the homologous gene symbol by means of the platform’s annotation information.
DEGs identified by GEO2R
The GEO2R (http://www.ncbi.nlm.nih.gov/geo/geo2r), is an online data analysis tool, and was used to screen the DEGs between SLE and healthy controls. We established the six differential experimental groups for four GEO series, GSE4588 series divided into GSE4588 CD4 T cells and GSE4588 B cells series, GSE81622 divided into GSE81622 SLE and GSE81622 LN series, GEO2R could compare the differential classifications so that the DEGs would be identified. Genes without a corresponding gene symbol and genes with more than one probe set are separately removed, The values for statistical significance were set as adjusted p value ≤ 0.05 and |Fold change|≥ 1. In order to identify significant DEGs, the Venn online tool (http://bioinformatics.psb.ugent.be/webtools/Venn/) was used to draw a Venn map, and overlapping DEGs were retained for further analysis.
Analyses of DEGs
Volcano maps were drawn using the volcano plotting tool (http://soft.sangerbox.com/). TBtools (http://www.tbtools.com/) was used to draw expression heatmap of DEGs in different series. The correlation analysis between gene–gene and series-series was used in tool (http://soft.sangerbox.com/).
Functional annotation and pathway enrichment analysis
To functionally annotate DEGs identified by the aforementioned comparison groups, annotation and visualization of GO terms was used by GO enrichment analysis (http://enrich.shbio.com/index/ga.asp) and metascape (http://metascape.org/gp/index.html#/main/step1). The overlaps between differently expressed gene lists of GO terms are performed by enrichment analysis circle diagram (http://soft.sangerbox.com/). The DEGs were then introduced into the FunRich (functional enrichment analysis tool) (http://www.funrich.org/) for KEGG pathway analysis. GENEMANIA (http://genemania.org/search/) was used to construct a gene–gene interaction network for DEGs to evaluate the functions of these genes.
Enrichment analysis by gene set enrichment analysis (GSEA)
GSEA version 4.1.0 software was used to analyze genes function from the GSEA website MSIGDB database (http://software.broadinstitute.org/gsea/msigdb) . The default weighted enrichment method was applied for enrichment analysis. The random combination was set for 1000 times. GO and KEGG pathway enrichment analysis were performed for IFI27 high and low expression using GSEA analysis. FDR < 0.25, NOM p-value < 0.05 and |NES|> 1 were considered significant enrichment. Datas used were shown in Additional file 2: Table S2.
Construction of weighted gene co-expression network analysis (WGCNA)
The WGCNA package in R was utilized to build a coexpression network targeting DEGs . We established a weighted adjacency matrix, defined a correlation power (soft thresholding parameter) showing strong relations between genes and penalizing the weak correlation. Then we converted the adjacency into a topological overlap matrix (TOM) to measure the network connectivity of genes, and the TOM summed up the adjacent genes for the network gene ratio and calculated the corresponding dissimilarity. We used average linkage hierarchical clustering based on TOM dissimilarity measurement to classify genes showing similar expression profiles with gene modules, which were represented by branches and different colors of the cluster tree, constructed module relationships, calculation of the correlation between gene modules and phenotypes, and the modules related to clinical traits were identified. Datas used were shown in Additional file 3: Table S3. Scripts were shown in Additional file 4.
Evaluation of immune cell infiltration
To evaluate abundance of immune infiltrates, We uploaded the gene expression matrix data to CIBERSORT (https://cibersort.stanford.edu/),  and obtained the immune cell infiltration matrix. Then, we used “corrplot” package  to draw a correlation heatmap to visualize the correlation of 22 types of infiltrating immune cells, “ggplot2” package  was used to perform PCA clustering analysis on immune cell infiltration matrix data to draw a two-dimensional PCA clustering map, and to draw violin diagrams to visualize the differences in immune cell infiltration. Datas used were shown in Additional file 5: Table S4, Additional file 6: Table S5, Datas of results were shown in Additional file 7: Table S6. Scripts were shown in Additional file 8.
Principal component analysis (PCA)
The Pearson’s correlation test was performed to verify intra-group data repeatability in the per group. The R programming language was used to provide the software and operating environment for statistical analysis and drawing of graphs. The intra-group data repeatability of the dataset was tested by sample clustering analysis.
The multivariate modelling with combined selected genes were used to identify biomarkers with high sensitivity and specificity for SLE diagnosis by using visualization tool (https://hiplot.com.cn/basic/roc). Used one data as training and other as validation sample iteratively. The receiver operator characteristic curves were plotted and area under curve (AUC) was calculated separately to evaluate the performance of each model using the R packages “pROC” . A AUC > 0.9 indicated that the model had a good fitting effect.
Identification and analysis of DEGs in datasets
Based on the high throughput analysis, DEGs in the six microarray datasets (GSE4588(CD4 T cells), GSE4588(B cells), GSE81622(SLE), GSE81622 (LN), GSE144390, GSE50772) were screened after the chip results were normalised (Additional file 9: Table S7). As shown in the Venn map, 6 genes overlapped in the six datasets (Fig. 1a). Based on the integration analysis, 6 significantly up-regulated genes were shown by heatmap (Fig. 1b). GO enrichment analysis was used to evaluate the potential mechanism of DEGs from molecular function, biological process, and cellular component categories. The results showed that these genes were functionally associated with several immune related biological processes. The circos present the overlap between differently expressed gene lists of six datasets at the shared term level (Fig. 1c). KEGG pathway analysis showed the related genes were involved in interferon signaling, interferon alpha/beta signaling pathways (Fig. 1d). The circos present the overlap between differently expressed gene lists of six datasets at the gene level was shown in the Additional file 10: Figure S1. The bubble chart present GO term of 6 DEGs from the six datasets was shown in the Additional file 10: Figure S2. Corrgrams were derived based on pearson value between DEGs (Fig. 1e, Additional file 11: Table S8). Corrgrams were derived based on pearson value between six datasets (Fig. 1f, Additional file 12: Table S9). We further investigated the difference between the expression levels of the six genes, The results showed that IFI27 is the significantly up-regulated gene in six datasets (Fig. 2a–f).
Construction of a ceRNA network
To better understand the effect of circRNAs and lncRNAs on mRNAs mediated by combination with miRNAs, we built two ceRNA network based on the abovementioned data and used the Power BI (https://powerbi.microsoft.com/zh-cn/) to visualize the network (Fig. 3a, b). CircRNAs and lncRNAs interact with miRNAs retrieved from the TargetScan (http://www.targetscan.org) database. CircRNAs and lncRNAs with mircoRNAs have the weak interaction were removed, basing on clipExpNum from large-scale CLIP-Seq data of the TargetScan database (Additional file 13: Table S10, 11). Moreover, the miRNAs can interact with mRNAs more than two of seven the databases were chosen (Additional file 14: Table S12). Seven databases were PITA, RNA22, miRmap, microT, miRanda, PicTar, TargetScan.
Functional enrichment analyses and PCA of datasets
The enrichment analysis of metascape results revealed that there were markedly enriched in cytokines-regulate signaling pathways, interferon alpha/beta signaling pathways, response to bacterium (Fig. 4a, b). What’s more, the enrichment analysis of metascape also demonstrates that the DEGs between control and SLE were markedly enriched in the six datasets (Fig. 4c). We constructed a gene–gene interaction network for DEGs to analyze the function of these genes using the GeneMANIA database. The hub node representing DEGs was surrounded by 20 nodes representing genes that were significantly correlated with DEGs (Fig. 4d).
To further validate the intra-group data repeatability, Principal component analysis (PCA) demonstrated a different distribution pattern between the SLE and control groups, based on the expression of genes in all samples. The distances between per samples in the control group were close, and distances between per samples in the SLE group were also close in the dimension of PC1 (Fig. 5a–e). This was indicative of the difference of two groups.
Gene set enrichment analysis (GSEA) of IFI27-associated gene set
We used GSEA to analyze enriched GO and KEGG pathways in the samples with the IFI27 highly expressed in different datasets. Then we screened out one commonly enriched pathway: Response to type I interferon signaling and the protesome KEGG pathway (Fig. 6a–f). Results of this study indicated that interferon response is one of the biological pathway most relevant to the pathogenesis of SLE.
Construction of co-expression modules by weighted gene co-expression network analysis (WGCNA) of datasets
In this study, we obtained the expression matrices of all samples in four dataset (GSE50772, GSE4588(B Cell), GSE4588(CD4 T Cell), GSE81622, Additional file 5: Table S4). Then we selected the top 30–50% variant genes (less than 5000) for co‐expression analysis (Additional file 3: Table S3). We excluded dataset of GSE144390 because of the small number of samples. The eigengene adjacency heatmap showed that the red module was the most positively correlated with occurrence of SLE, and the green module was the most negatively correlated with occurrence of SLE. Enrichment analysis performed in this study indicated that IFI27 in the module of chiefly enriched in correlated with the occurence of SLE (Fig. 7a–d, Additional file 15: Figure S3, 4, Additional file 16: Table S13).
Immune cell infiltration results
The violin plot of the immune cell infiltration difference showed that, compared with the normal control sample, T cells CD4 naïve (p = 0.055) in GSE4588 (CD4 T cell) infiltrated more, T cells CD4 memory resting (p = 0.001) in GSE4588 (CD4 T cell) infiltrated less (Fig. 8a). Moncytes (p = 0.014) and neutrophils (p < 0.001) in GSE50772 infiltrated more, NK cells resting (p < 0.001) and T cells CD4 memory resting (p < 0.001) in GSE50772 infiltrated less (Fig. 8b). GSE4588 (B cell) dataset have no significant immune cell infiltration showed in Additional file 17: Figure S5. Moncytes (p < 0.001) in GSE81622 (LN) infiltrated more, NK cells resting (p < 0.001) in GSE81622 (LN) infiltrated less (Fig. 9a). Moncytes (p < 0.001) in GSE81622 (SLE) infiltrated more, while NK cells resting (p < 0.001) in GSE81622 (SLE) infiltrated less (Fig. 9b). Correlation heatmap of the 22 types of immune cells revealed that B cells memory in GSE4588 (B cell) had a negative correlation with B cells naive (Fig. 10a). T cell CD8 in GSE4588 (CD4 T cell) had a significant positive correlation with T cell gamma delta, T cells follicular helper in GSE4588 (CD4 T cell) also had a positive correlation with T cells regulatory (Treg), T cells CD4 memory resting in GSE4588 (CD4 T cell) had a negative correlation with T cells CD8 (Fig. 10b). NK cells actived had a significant positive correlation with neutrophils in GSE81622 (SLE) (Fig. 10c). NK cells resting had a negative correlation with moncytes in GSE81622 (SLE) (Fig. 10c) and GSE81622 (LN) (Fig. 10d). In addition, We excluded dataset of GSE144390 because of the small number of samples, And GSE50772 dataset have no significant correlation of cell infiltration showed in Additional file 18: Figure S6. By PCA, the proportions of immune cells from the samples of SLE patients and normal controls displayed distinct group-bias clustering and individual differences. (Fig. 11a–e, Additional file 19: Table S14).
Diagnose significance of DEGs
To determine which DEGs have the diagnose significance of SLE patients, The ROC analyses were conducted to explore the sensitivity and specificity of DEGs for SLE diagnosis. The results showed that IFI27 has the best diagnostic value for differentiating the patients with SLE from healthy controls (Fig. 12a, Additional file 20: Figure S7). The ROC curve analysis of the model in the GSE50772 (AUC = 0.934426), GSE81622 (AUC = 0.972000) training set demonstrated its promising predictive value for SLE. We then validated the model in the validation set, The AUC was 0.910880 and 0.948162 (Fig. 12a). This indicated that expression of IFI27 correlated with disease activity of SLE, IFI27 could act as a biomarker to estimate the activity of SLE and verify the effectiveness of the treatment of SLE.
SLE is one of the most common autoimmunity diseases worldwide . Therefore, there is an urgent need for a better understanding of the detailed mechanism to develop novel strategies to diagnose and treat SLE .
In this study, a series of bioinformatics analysis identified 6 common DEGs (IFI27, IFI44, IFI44L, EPSTI1, OAS1) between SLE and normal samples based on gene expression profiles obtained from GSE50772, GSE81622(LN), GSE81622(SLE), GSE144390, GSE4588(B Cell), GSE4588(CD4 T Cell) and GSE144390 datasets. Furthermore, we investigated the biological functions of these common DEGs by using online website, and GO analysis revealed that these DEGs are significantly associated with changes in immune function and interferon response. Both pathways and GSEA enrichment analyses indicated that the interferon signaling pathway is a key pathway involved in SLE, which was in line with previous studies [25,26,27,28,29]. Moreover, the PPI network of DEGs was constructed by GENEMANIA. Those DEGs could contribute to promote the diagnostic and therapeutic in SLE, which could indicate a new direction of the acquaintance of SLE. To have a better understanding of the SLE progression, candidate biomarkers of SLE were identified using WGCNA in the current study. Finally, Some modules correlated with SLE were constructed by WGCNA analysis. IFI27 genes with high functional significance were selected as central genes in the clinical significance module. Then, We analyzed the correlation between these genes and patient diagnose. The ROC analyses were conducted to explore the sensitivity and specificity of DEGs for SLE diagnosis. Among the 6 common DEGs identified, IFI27 showed high sensitivity and specificity in SLE diagnosis in the training set (AUC > 0.9) and validation sets (AUC > 0.9). Thus, IFI27 may be a potential molecular signature for the diagnosis of SLE patients. Therefore, we speculate that IFI27 may play an important role in the disease progression of SLE. Since IFI27 exhibited the most dramatic difference in expression, we focused on the IFI27 gene in our subsequent experiments.
IFI27 (Interferon Alpha Inducible Protein 27), involved in different biological processes [30, 31]. Also involved in type-I interferon-induced apoptosis characterized by a rapid and robust release of cytochrome C from the mitochondria and activation of BAX and caspases 2, 3, 6, 8 and 9 . In the innate immune response, IFI27 has an antiviral activity towards hepatitis C virus/HCV. May prevent the replication of the virus by recruiting both the hepatitis C virus non-structural protein 5A/NS5A and the ubiquitination machinery via SKP2, promoting the ubiquitin-mediated proteasomal degradation of NS5A [31, 33]. Although previous studies have explored the molecular mechanism by which diseases associated with IFI27 include Hepatitis C Virus and Oral Leukoplakia. Among its related pathways are Interferon gamma signaling and Innate Immune System [34, 35]. The relationship between IFI27 and the occurrence, progression of SLE has not been investigated. Other studies based on multiple datasets only focused on screening key genes [36,37,38,39], but did not specifcally analyse the molecular mechanism by which core genes play a role. Overall, our results indicate that targeting IFI27 might reduce the molecules that mediate immmune infection, suggesting that the potential value of combining blockades of IFI27 and coinhibitory molecules may serve as a new immunotherapy against SLE. In addition, research shows that immune cell infiltration plays an important role in the development of SLE . Therefore, finding specific diagnostic markers and analyzing the pattern of SLE immune cell infiltration have profound significance for improving the prognosis of SLE patients. To further explore the role of immune cell infiltration in SLE, we used CIBERSORT to conduct a comprehensive evaluation of SLE immune infiltration. We found that an increased infiltration of moncytes, while NK cells resting infiltrated less may be related to the occurrence and development of SLE. Previous studies have shown that the infiltration of cells in the SLE is relatively high and is related to structural damage in patients with SLE . It has also been shown that the tissue parenchyma has the capability of suppressing T cell responses and limiting damage to self. These findings suggest avenues for the treatment of autoimmunity based on selectively exploiting the exhausted phenotype of tissue-infiltrating T cells in SLE . Hironari Hanaoka et al.  found that CD4 + Foxp3 + IL-17A + cells were infiltrated into the renal biopsy specimens of patients with active lupus nephritis. Liao et al.  confirmed through in vivo experiments that renal-infiltrating CD11c + cells are pathogenic in murine lupus nephritis through promoting CD4 + T cell responses. The above literature evidence combined with our analysis results have shown that IFI27 induced SLE among its related pathways are Interferon gamma signaling, And immune cell infiltration play important roles in SLE and should be the highlight of further studies.
SLE is a disease caused by the interaction of multiple susceptibility genes. In recent years, more and more genes have been found to be associated with SLE. Several potential candidate genes existenced in the MHC region of SLE, a study verified strong association of STAT4 gene rs7574865, rs10168266 polymorphisms and SLE susceptibility . The PTPN22 rs1310182 A allele and rs1310182 AA genotype were associated with Pediatric systemic lupus erythematosus (PSLE) and may be a possible genetic marker for susceptibility to PSLE . Different gene backgrounds lead to differences in the incidence of SLE. When the function of TREX1 is weakened, the abnormal accumulation of single-stranded DNA may stimulate the production of IFN, which may be one of the important factors contributing to the pathogenesis of SLE . Sandling et al.  found that IKBKE and IL8 are susceptibility sites for SLE, emphasized the important function of the type I interferon pathway on the pathogenesis of SLE, but further analysis of the function remains to be seen. In addition to genetic factors, it is currently believed that the pathogenesis of SLE may be related to the abnormality of epigenetic modification. Yang et al.  identified five SLE related genes (CDKN1B, TET3, CD80, DRAM1 and ARID5B), revealing that cell cycle regulation, phagocytosis, DNA methylation and other mechanisms play an important role in the pathogenesis of SLE. This study also demonstrated that the pathogenesis of SLE has genetic heterogeneity. These candidate genes may play a pathogenic role through different biological pathways, and different gene mutations may lead to different system damage in SLE. The genetic pathogenesis of SLE will become a research hotspot once again.
In this study, we sought to identify biomarkers for SLE and further explore the role of immune cell infiltration in SLE. There are some limitations to our study. First, no further in vivo experiments to validate these results. Second, the exact mechanisms of immune reactions induced by IFI27 need to be further investigated. Third, CIBERSORT analysis is based on limited genetic data that may deviate from heterotypic interactions of cells, disease-induced disorders, or phenotypic plasticity. Terefore, our results still need to be verified through in vivo and in vitro experiments and clinical practice.
In summary, based on integrated bioinformatical analyses, we identified differences in biological functions in SLE compared to normal samples and explored the comprehensive role of IFI27 in SLE progression. In particular, we found that IFI27 was positively correlated with immune function. To our knowledge, this is the first demonstration that IFI27 functions as a positive modulator in SLE. Thus, targeting IFI27 may have therapeutic promise for SLE. In addition, We found that an increased infiltration of moncytes, while NK cells resting infiltrated less may be related to the pathogenesis of SLE.
Availability of data and materials
The datasets generated during and/or analyzed during the current study are available in the Gene Expression Omnibus (GEO) datasets (http://www.ncbi.nlm.nih.gov/geo/).
Systemic lupus erythematosus
Differentially expressed genes
Kyoto Encyclopedia of Genes and Genomes
Gene set enrichment analysis
Weighted gene co‐expression network analysis
Interferon Alpha Inducible Protein 27
Principal component analysis
Receiver operating characteristic
Topological overlap matrix
Gene expression omnibus
Long non-coding RNA
Peripheral blood mononuclear cell
Interferon induced protein 44
Interferon induced protein 44 like
Interferon induced protein 6
2′-5′-Oligoadenylate Synthetase 1
Epithelial Stromal Interaction 1
TNF Superfamily Member 4
Neutrophil Cytosolic Factor 1
Chromosome X Open Reading Frame 21
Interferon Induced Protein With Tetratricopeptide Repeats 1
Interferon Induced Protein With Tetratricopeptide Repeats 3
Cyclic GMP-AMP synthase
Non-structural protein 5A
S-Phase Kinase Associated Protein 2
BCL2 Associated X, Apoptosis Regulator
Hepatitis C virus
Signal Transducer And Activator Of Transcription 4
Protein Tyrosine Phosphatase Non-Receptor Type 22
Pediatric systemic lupus erythematosus
Three Prime Repair Exonuclease 1
Inhibitor Of Nuclear Factor Kappa B Kinase Subunit Epsilon
Cyclin Dependent Kinase Inhibitor 1B
Tet Methylcytosine Dioxygenase 3
DNA Damage Regulated Autophagy Modulator 1
AT-Rich Interaction Domain 5B
Dörner T, Furie R. Novel paradigms in systemic lupus erythematosus. Lancet. 2019;393:2344–58.
Tsokos GC, Lo MS, Costa Reis P, Sullivan KE. New insights into the immunopathogenesis of systemic lupus erythematosus. Nat Rev Rheumatol. 2016;12:716–30.
Justiz Vaillant AA, Goyal A, Bansal P, Varacallo M: Systemic Lupus Erythematosus (SLE). In StatPearls. Treasure Island (FL): StatPearls Publishing Copyright © 2020, StatPearls Publishing LLC.; 2020.
Murphy G, Isenberg DA. New therapies for systemic lupus erythematosus-past imperfect, future tense. Nat Rev Rheumatol. 2019;15:403–12.
Durcan L, O’Dwyer T, Petri M. Management strategies and future directions for systemic lupus erythematosus in adults. Lancet. 2019;393:2332–43.
Onuora S. Rare variants in SLE risk genes drive disease. Nat Rev Rheumatol. 2019;15:384.
Linge P, Arve S. NCF1-339 polymorphism is associated with altered formation of neutrophil extracellular traps, high serum interferon activity and antiphospholipid syndrome in systemic lupus erythematosus. Ann Rheumatic Dis. 2020;79:254–61.
Odhams CA, Roberts AL, Vester SK. Interferon inducible X-linked gene CXorf21 may contribute to sexual dimorphism in Systemic Lupus Erythematosus. Nat Communicat. 2019;10:2164.
Zhao M, Zhou Y, Zhu B, Wan M, Jiang T, Tan Q, Liu Y, Jiang J, Luo S, Tan Y, et al. IFI44L promoter methylation as a blood biomarker for systemic lupus erythematosus. Ann Rheum Dis. 2016;75:1998–2006.
Hu W, Niu G, Li H, Gao H, Kang R, Chen X, Lin L. The association between expression of IFIT1 in podocytes of MRL/lpr mice and the renal pathological changes it causes: an animal study. Oncotarget. 2016;7:76464–70.
Wang J, Dai M, Cui Y, Hou G, Deng J, Gao X, Liao Z, Liu Y, Meng Y, Wu L, et al. Association of abnormal elevations in IFIT3 with overactive cyclic GMP-AMP synthase/stimulator of interferon genes signaling in human systemic lupus erythematosus monocytes. Arthritis Rheumatol. 2018;70:2036–45.
Xiao N, Wei J, Xu S, Du H, Huang M, Zhang S, Ye W, Sun L, Chen Q. cGAS activation causes lupus-like autoimmune disorders in a TREX1 mutant mouse model. J Autoimmun. 2019;100:84–94.
Barnes BJ. Genetic versus non-genetic drivers of SLE: implications of IRF5 dysregulation in both roads leading to SLE. Curr Rheumatol Rep. 2019;21:2.
Wang Z, Monteiro CD, Jagodnik KM, Fernandez NF, Gundersen GW, Rouillard AD, Jenkins SL, Feldmann AS, Hu KS, McDermott MG, et al. Extraction and analysis of signatures from the gene expression omnibus by the crowd. Nat Commun. 2016;7:12846.
Zhu H, Mi W, Luo H, Chen T, Liu S, Raman I, Zuo X, Li QZ. Whole-genome transcription and DNA methylation analysis of peripheral blood mononuclear cells identified aberrant gene regulation pathways in systemic lupus erythematosus. Arthritis Res Ther. 2016;18:162.
Kennedy WP, Maciuca R, Wolslegel K, Tew W, Abbas AR, Chaivorapol C, Morimoto A, McBride JM, Brunetta P, Richardson BC, et al. Association of the interferon signature metric with serological disease manifestations but not global activity scores in multiple cohorts of patients with SLE. Lupus Sci Med. 2015;2:e000080.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27:1739–40.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformat. 2008;9:559.
Corces MR, Buenrostro JD, Wu B, Greenside PG, Chan SM, Koenig JL, Snyder MP, Pritchard JK, Kundaje A. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat Genet. 2016;48:1193–203.
Friendly M. Corrgrams: exploratory displays for correlation matrices. Am Stat. 2002;56:316–24.
Ginestet C. ggplot2: elegant graphics for data analysis. J R Stat Soc Ser A Stat Soc. 2011;174:245.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Müller M. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformat. 2011;12:77.
Nishide M, Kumanogoh A. The role of semaphorins in immune responses and autoimmune rheumatic diseases. Nat Rev Rheumatol. 2018;14:19–31.
Gatto M, Zen M, Iaccarino L, Doria A. New therapeutic strategies in systemic lupus erythematosus management. Nat Rev Rheumatol. 2019;15:30–48.
Becker AM, Dao KH, Han BK, Kornu R, Lakhanpal S, Mobley AB, Li QZ, Lian Y, Wu T, Reimold AM, et al. SLE peripheral blood B cell, T cell and myeloid cell transcriptomes display unique profiles and each subset contributes to the interferon signature. PLoS ONE. 2013;8:e67003.
Barrat FJ, Crow MK, Ivashkiv LB. Interferon target-gene expression and epigenomic signatures in health and disease. Nat Immunol. 2019;20:1574–83.
Wahadat MJ, Bodewes ILA, Maria NI, van Helden-Meeuwsen CG, van Dijk-Hummelman A, Steenwijk EC, Kamphuis S, Versnel MA. Type I IFN signature in childhood-onset systemic lupus erythematosus: a conspiracy of DNA- and RNA-sensing receptors? Arthritis Res Ther. 2018;20:4.
Ugolini-Lopes MR, Torrezan GT, Gândara APR, Olivieri EHR, Nascimento IS, Okazaki E, Bonfá E, Carraro DM, de Andrade DCO. Enhanced type I interferon gene signature in primary antiphospholipid syndrome: association with earlier disease onset and preeclampsia. Autoimmun Rev. 2019;18:393–8.
Crow MK, Olferiev M, Kirou KA. Type I interferons in autoimmune disease. Annu Rev Pathol. 2019;14:369–93.
Papac-Milicevic N, Breuss JM, Zaujec J, Ryban L, Plyushch T, Wagner GA, Fenzl S, Dremsek P, Cabaravdic M, Steiner M, et al. The interferon stimulated gene 12 inactivates vasculoprotective functions of NR4A nuclear receptors. Circ Res. 2012;110:e50-63.
Xue B, Yang D, Wang J, Xu Y, Wang X, Qin Y, Tian R, Chen S, Xie Q, Liu N, Zhu H. ISG12a restricts hepatitis C virus infection through the ubiquitination-dependent degradation pathway. J Virol. 2016;90:6832–45.
Gytz H, Hansen MF, Skovbjerg S, Kristensen AC, Hørlyck S, Jensen MB, Fredborg M, Markert LD, McMillan NA, Christensen EI, Martensen PM. Apoptotic properties of the type 1 interferon induced family of human mitochondrial membrane ISG12 proteins. Biol Cell. 2017;109:94–112.
Chen Y, Jiao B, Yao M, Shi X, Zheng Z, Li S, Chen L. ISG12a inhibits HCV replication and potentiates the anti-HCV activity of IFN-α through activation of the Jak/STAT signaling pathway independent of autophagy and apoptosis. Virus Res. 2017;227:231–9.
Shrivastava S, Meissner EG, Funk E, Poonia S, Shokeen V, Thakur A, Poonia B, Sarin SK, Trehanpati N, Kottilil S. Elevated hepatic lipid and interferon stimulated gene expression in HCV GT3 patients relative to non-alcoholic steatohepatitis. Hepatol Int. 2016;10:937–46.
Tsuzuki H, Fujieda S, Sunaga H, Narita N, Tokuriki M, Saito H. Expression of p27 and apoptosis in oral leukoplakia. Anticancer Res. 2003;23:1265–70.
Ishii T, Onda H, Tanigawa A, Ohshima S, Fujiwara H, Mima T, Katada Y, Deguchi H, Suemura M, Miyake T, et al. Isolation and expression profiling of genes upregulated in the peripheral blood cells of systemic lupus erythematosus patients. DNA Res. 2005;12:429–39.
Chiche L, Jourde-Chiche N, Whalen E, Presnell S, Gersuk V, Dang K, Anguiano E, Quinn C, Burtey S, Berland Y, et al. Modular transcriptional repertoire analyses of adults with systemic lupus erythematosus reveal distinct type I and type II interferon signatures. Arthritis Rheumatol. 2014;66:1583–95.
Bing PF, Xia W, Wang L, Zhang YH, Lei SF, Deng FY. Common marker genes identified from various sample types for systemic lupus erythematosus. PLoS ONE. 2016;11:e0156234.
O’Hanlon TP, Rider LG, Gan L, Fannin R, Paules RS, Umbach DM, Weinberg CR, Shah RR, Mav D, Gourley MF, Miller FW. Gene expression profiles from discordant monozygotic twins suggest that molecular pathways are shared among multiple systemic autoimmune diseases. Arthritis Res Ther. 2011;13:R69.
Moulton VR, Tsokos GC. T cell signaling abnormalities contribute to aberrant immune cell function and autoimmunity. J Clin Invest. 2015;125:2220–7.
Furie R, Werth VP, Merola JF, Stevenson L, Reynolds TL, Naik H, Wang W, Christmann R, Gardet A, Pellerin A, et al. Monoclonal antibody targeting BDCA2 ameliorates skin lesions in systemic lupus erythematosus. J Clin Invest. 2019;129:1359–71.
Tilstra JS, Avery L, Menk AV, Gordon RA, Smita S, Kane LP, Chikina M, Delgoffe GM, Shlomchik MJ. Kidney-infiltrating T cells in murine lupus nephritis are metabolically and functionally exhausted. J Clin Invest. 2018;128:4884–97.
Hanaoka H, Nishimoto T, Okazaki Y, Takeuchi T, Kuwana M. A unique thymus-derived regulatory T cell subset associated with systemic lupus erythematosus. Arthritis Res Ther. 2020;22:88.
Liao X, Ren J, Reihl A, Pirapakaran T, Sreekumar B, Cecere TE, Reilly CM, Luo XM. Renal-infiltrating CD11c(+) cells are pathogenic in murine lupus nephritis through promoting CD4(+) T cell responses. Clin Exp Immunol. 2017;190:187–200.
Wang JM, Xu WD, Huang AF. Association of STAT4 Gene Rs7574865, Rs10168266 polymorphisms and systemic lupus erythematosus susceptibility: a meta-analysis. Immunol Invest. 2020;3:1–13.
Bahrami T, Valilou SF, Sadr M, Soltani S, Salmaninejad A, Soltaninejad E, Yekaninejad MS, Ziaee V, Rezaei N. PTPN22 gene polymorphisms in pediatric systemic lupus erythematosus. Fetal Pediatr Pathol. 2020;39:13–20.
Stetson DB, Ko JS, Heidmann T, Medzhitov R. Trex1 prevents cell-intrinsic initiation of autoimmunity. Cell. 2008;134:587–98.
Sandling JK, Garnier S, Sigurdsson S, Wang C, Nordmark G, Gunnarsson I, Svenungsson E, Padyukov L, Sturfelt G, Jonsen A. A candidate gene study of the type I interferon pathway implicates IKBKE and IL8 as risk loci for SLE[J]. Eur J Hum Genet. 2011;19:479–84.
Yang W, Tang H, Zhang Y, Tang X, Zhang J, Sun L, Yang J, Cui Y, Zhang L, Hirankarn N, Cheng H, Pan HF, Gao J, Lee TL, Sheng Y, Lau CS, Li Y, Chan TM, Yin X, Ying D, Lu Q, Leung AM, Zuo X, Chen X, Tong KL, Zhou F, Diao Q, Tse NK, Xie H, Mok CC, Hao F, Wong SN, Shi B, Lee KW, Hui Y, Ho MH, Liang B, Lee PP, Cui H, Guo Q, Chung BH, Pu X, Liu Q, Zhang X, Zhang C, Chong CY, Fang H, Wong RW, Sun Y, Mok MY, Li XP, Avihingsanon Y, Zhai Z, Rianthavorn P, Deekajorndej T, Suphapeetiporn K, Gao F, Shotelersuk V, Kang X, Ying SK, Zhang L, Wong WH, Zhu D, Fung SK, Zeng F, Lai WM, Wong CM, Ng IO, Garcia-Barceló MM, Cherny SS, Shen N, Tam PK, Sham PC, Ye DQ, Yang S, Zhang X, Lau YL. Meta-analysis followed by replication identifies loci in or near CDKN1B, TET3, CD80, DRAM1, and ARID5B as associated with systemic lupus erythematosus in Asians. Am J Hum Genet. 2013;92:41–51.
This study was supported by National Natural Science Foundation of China (No. 81673058), Chongqing Basic Science and Frontier Technology Research (cstc2017jcyjAX0251).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original version of this article was revised: a typesetting mistake in the corresponding authorship for this article has been corrected.
Additional file 1: Table S1.
Data cohort characteristics.
Additional file 2: Table S2.
Data used for GSEA analysis.
Additional file 3: Table S3.
Data used for WGCNA analysis.
Additional file 4.
Scripts used for WGCNA.
Additional file 5: Table S4.
Input data sets of immune cell infiltration.
Additional file 6: Table S5.
Sample information of datasets.
Additional file 7: Table S6.
Results of inmmune cell infiltration.
Additional file 8.
Scripts used for immune cell infiltration.
Additional file 9: Table S7.
DEGs in the datasets.
Additional file 10: Figure S1.
Overlap between differently expressed gene lists of six datasets Figure S2. GO enrichment analyses of 6 DEGs from 6 datasets.
Additional file 11: Table S8.
Pearson value between DEGs.
Additional file 12: Table S9.
Pearson value between datasets.
Additional file 13: Table S10.
miRNAs interact with circRNAs Table S11. miRNAs interact with lincRNAs.
Additional file 14:
Table S12. miRNAs interact with mRNAs.
Additional file 15: Figure S3.
Identification of weighted gene co-expression network modules associated with SLE in GSE4588(B cell) and GSE4588(CD4 T cell) datasets. Figure S4. Identification of weighted gene co-expression network modules associated with SLE in GSE81622 and GSE50772 datasets.
Additional file 16: Table S13.
Co-expression modules results of datasets.
Additional file 17: Figure S5.
Violin diagram of the proportion of 22 types of immune cells in GSE4588 (B cell) dataset.
Additional file 18: Figure S6.
Cell infiltration of the GSE50772 dataset.
Additional file 19: Table S14.
PCA results of immune cell infiltration.
Additional file 20: Figure S7.
The diagnostic performance of the six genes of three datasets.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Zhao, X., Zhang, L., Wang, J. et al. Identification of key biomarkers and immune infiltration in systemic lupus erythematosus by integrated bioinformatics analysis. J Transl Med 19, 35 (2021). https://doi.org/10.1186/s12967-020-02698-x
- Systemic lupus erythematosus
- Immune infiltration
- Integrated bioinformatics