- Open Access
Identification of hub genes in papillary thyroid carcinoma: robust rank aggregation and weighted gene co-expression network analysis
Journal of Translational Medicine volume 18, Article number: 170 (2020)
Papillary thyroid carcinoma (PTC), which is the most common endocrine malignancy, has been steadily increasing worldwide in incidence over the years, while mechanisms underlying the pathogenesis and diagnostic for PTC are incomplete. The purpose of this study is to identify potential biomarkers for diagnosis of PTC, and provide new insights into pathogenesis of PTC.
Based on weighted gene co-expression network analysis, Robust Rank Aggregation, functional annotation, GSEA and DNA methylation, were employed for investigating potential biomarkers for diagnosis of PTC.
Black and turquoise modules were identified in the gene co-expression network constructed by 1807 DEGs that from 6 eligible gene expression profiles of Gene Expression Omnibus database based on Robust Rank Aggregation and weighted gene co-expression network analysis. Hub genes were significantly down-regulated and the expression levels of the hub genes were different in different stages in hub gene verification. ROC curves indicated all hub genes had good diagnostic value for PTC (except for ABCA6 AUC = 89.5%, the 15 genes with AUC > 90%). Methylation analysis showed that hub gene verification ABCA6, ACACB, RMDN1 and TFPI were identified as differentially methylated genes, and the decreased expression level of these genes may relate to abnormal DNA methylation. Moreover, the expression levels of 8 top hub genes were correlated with tumor purity and tumor-infiltrating immune cells. These findings, including functional annotations and GSEA provide new insights into pathogenesis of PTC.
The hub genes and methylation of hub genes may as potential biomarkers provide new insights for diagnosis of PTC, and all these findings may be the direction to study the mechanisms underlying of PTC in the future.
Thyroid carcinoma is the most common endocrine cancer, and papillary thyroid cancer (PTC) accounts for the highest proportion of thyroid carcinoma. In recent years, the incidence of PTC has been steadily increasing worldwide, which may be due to a real increase, or may be due to the improvement and widespread use of screening techniques . Increased TSH, autoimmune diseases and inflammation were considered risk factors for thyroid cancer . In recent years, researchers believe that the major carcinogenic event of PTC is the activation of mitogen-activated protein kinase (MAPK) . However, the mechanisms underlying the pathogenesis of PTC have not yet been elucidated. Surgical resection, TSH inhibition therapy and radioactive iodine therapy are the conventional treatment methods for PTC . With these therapeutic approaches, most patients with PTC have a good prognosis, but there were some challenges for both patients and clinicians. The consequence of surgical excision that is one of the important treatments for thyroid cancer may be the increased incidence of hypothyroidism ; PTC is often difficult to diagnose because of similarities between malignant and benign nodules, which results in some benign patients having their thyroid removed ; treatment of refractory radioiodine differentiated thyroid cancer still faces challenges and some slow-moving tumors were overdiagnosed and overtreated. In terms of molecular therapy, the combined use of immune checkpoint inhibitors and BRAF (especially BRAFV600E) inhibitors is aussichtsreich in the treatment of thyroid cancer and some progress has been made . However, not all tumors have mutations in BRAF [8,9,10], and other biomarkers are needed. In addition, immunotherapy is associated with immune-related adverse events (autoimmune toxicities) . For example, the use of mAbs anti-cytotoxic T lymphocyte antigen 4 (anti-CTLA-4) and anti-programmed cell death protein-1/programmed cell death ligand-1 (anti-PD-1/PD-L1) causes thyroid dysfunction (including painless thyroiditis and so on)  in 10 percent of cancer patients . DNA methylation, which belongs to epigenetics , can affect gene expression by affecting the structural stability of chromosomes and the interaction between DNA and proteins . Abnormal levels of DNA methylation had been reported in almost all cancers. The study of DNA methylation is very important for the pathogenesis, early diagnosis and prognosis prediction of tumors and methylation drugs are the darlings of recent targeted therapies for cancer for DNA methylation itself is reversible. The combination of genetic alterations and DNA methylation alterations may improve their clinical value. However, the specific gene map and methylation map of PTC are not complete. Therefore, it is necessary to further understand the underlying biological mechanisms underlying the onset and development of PTC and to identify potential biomarkers for diagnosis. More accurate diagnosis leads to more personalized treatment, which is valuable in further improving patient survival.
As a bioinformatics method that can integrate gene lists of different technology platforms, Robust Rank Aggregation (RRA) is widely used in cancer research [15,16,17]. This method can reduce noise increase signal while integrating data information of different platforms, which makes the research results more reliable . The weighted gene co-expression network analysis (WGCNA)  method is widely used in cancer research, but there is still room for the research of establishing gene co-expression network to identify the hub genes closely related to PTC. In order to better explain the biological function genes, gene set enrichment analysis (GSEA) on hub genes was performed. GSEA can assess whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states . Furthermore, we analyzed the relationship between hub genes and tumor immune infiltrating cells since microenvironment composed of tumor immune infiltrating cells can play an important role in the occurrence and progression of tumor by promoting tumor and anti-tumor , and the influence of different infiltrating immune cells on tumor is different . Our study might provide some new insights into current diagnosis and pathogenesis for PTC.
Thyroid cancer gene expression datasets collection and identification of robust DEGs
The data sets enrolled in this study needs to meet the main conditions : The data set must include the gene expression profile of PTC and normal thyroid tissue . The genes in the platform need to be above 5000. The 6 unprocessed gene expression profiles that met the inclusion criteria of this study were downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) database: GSE6004 , GSE58545 , GSE27155 [24, 25], GSE53157 , GSE60542  and GSE33630 [28, 29]. The relevant data are shown in Table 1. For more detailed sample information, please see Additional file 2: Table S1. After data normalization using robust multi-array averaging algorithm , “limma”  package were used to assess statistical significance (P-value) for each gene based on a linear model implemented. Then, gene lists in six datasets were integrated using the RRA method to identify differentially expressed genes (DEGs) according to P-value in “RobustRankAggreg” package of R software, and P-value < 0.05 was set as the threshold for DEGs.
Function enrichment analysis
The top 300 DEGs were uploaded to Database for Annotation, Visualization, and Integrated Discovery (DAVID)  for Gene Ontology (GO) functional annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis. The top terms were visualized in “GOplot”  of R software.
Weighted co-expression network construction and key modules identification
1807 DEGs with P < 0.05 obtained from RRA to construct gene co-expression networks with expression data retrieved from The Cancer Genome Atlas (TCGA). Specifically, 1807 genes were selected from the TCGA gene list, and then the 1807 genes with their gene expression values and TCGA sample Numbers formed a correlation matrix. The threshold value of outliers was Z.K value < − 2.5. The correlation matrix (Sij) was converted to an adjacency matrix (Aij) based on soft threshold β that can approximate the scale-free distribution (R2 > 0.8). This transformation allows us to build networks with higher biological signals, which is the focus of the WGCNA approach. Topological overlap matrix (TOM) realizes the visualization of network, which is a simplified network diagram for identifying modules. The hierarchical clustering tree formed by average linkage hierarchical clustering also participates in the formation of modules. By the way, some genes without characteristics were assigned to the gray module. In order to identify the key modules closely related to PTC, module eigengene (ME), gene significance (GS), module membership (MM) and other parameters were calculated . The genes in the key modules were uploaded to DAVID  for GO functional annotation and KEGG enrichment analysis to explore the biological functions of the key modules.
Identification, validation and efficacy evaluation for hub genes
The hub gene is defined as the gene with the highest degree of connectivity in the key module. Specifically, the genes with geneModuleMembership > 0.9 and geneTraitSignificance > 0.5 were determined as the hub genes in the study. Samples in the Cancer Genome Atlas-Thyroid carcinoma (TCGA-THCA) and a separate data set GSE29265 was used to verify that hub genes can distinguish between non-tumor and PTC. If the P-value < 0.01, the selection of the hub gene is considered statistically significant. In addition, we conducted a study to understand the expression patterns of hub genes between different stages of PTC based on GEPIA (Gene Expression Profiling Interactive Analysis, http://gepia.cancer-pku.cn/) , which was a web server for cancer and normal gene expression profiling and interactive analyses. To assess diagnostic values of hub genes, receiver operating characteristic (ROC) curve was plotted and area under the ROC curve (AUC) was calculated with “pROC” of R package  to evaluate the capability of distinguishing tumor and normal tissue.
Methylation analysis of hub genes
Methylation is one of the earliest and most widely studied epigenetics to be included in our study. We conducted methylation analysis of hub genes based on methylation data obtained from the human disease methylation database version 2.0 [37, 38] (DiseaseMeth 2.0, http://bioinfo.hrbmu.edu.cn/diseasemeth/), which collects and annotates the abnormal methylation of various cancers, is a useful resource platform for further understanding the molecular mechanisms of human disease. Data in the platform based on huge international disease projects including TCGA and public genome databases including GEO . Furthermore, the relationship between hub genes expression and their DNA methylation status were investigated based on MEXPRESS (http://mexpress.be) .
Hub genes and tumor-infiltrating immune cells
Due to tumor immune infiltrating cells (B cells, CD4+ T cells, CD8+ T cells, neutrophils, macrophages, and dendritic cells) are closely related to prognosis, thus it is necessary to investigate the correlation between the expression of identified hub genes and tumor infiltrating immune cells, which might provide new ideas for immunotherapy. This analysis for immune infiltrating cells and their interactions with tumor cells [40, 41] was performed on the basis of TIMER, which use to comprehensively investigate molecular characterization of tumor-immune interactions.
Gene set enrichment analysis
GSEA, which is widely used to predict the biological functions of hub genes, were performed in GSEA 4.0.3 and the top terms were visualized in R. The samples in TCGA were divided into two groups (high expression vs. low expression) with median expression values of each hub gene.
Identification of robust DEGs
The P-value of each gene in 6 datasets was calculated based on the “limma” package. Gene lists of six datasets were integrated by RRA method and 1807 DEGs including 796 up-regulated genes and 1011 down-regulated genes were identified to the threshold of P-value < 0.05. The top 20 down-regulated and up-regulated genes were listed based on heat maps (Additional file 1: Figure S1).
Function enrichment analysis
Chord diagrams showed the biological pathways in which the top 300 genes were involved. The top terms were illustrated in Fig. 1. Figure 1a shows that these genes are enriched into extracellular exosome, plasma membrane, integral component of plasma membrane etc. based on GO cellular components. Figure 1b shows that these genes are enriched into signal transduction, response to interleukin − 1, stem cell differentiation and positive regulation of MAP kinase activity etc. based on GO biological processes. Figure 1c shows that these genes are enriched to glycoprotein binding, protein binding and metalloendopeptidase activity etc. based on GO molecular function. These genes were enriched into the transcriptional misregulation in cancer, Rap1 signaling pathway and PI3K − Akt signaling pathway etc. based on KEGG (Fig. 1d).
Weighted co-expression network construction and key modules identification
Since there were 5 samples with Z.K value < − 2.5 (TCGA.KS.A41F, TCGA.EM.A3O3, TCGA.DJ.A2Q1, TCGA.DE.A4MA and TCGA.BJ.A3F0), these 5 samples were considered as outliers and excluded from the subsequent analysis (Additional file 1: Figure S2). Matrix transformation is based on soft threshold β = 7 (scale free R2 = 0.88) (Additional file 1: Figure S3), which is selected according to scale-free topological criteria. 10 gene modules were identified based on TOM and average linkage hierarchical clustering (Additional file 1: Figure S4). In order to identify the modules associated with clinical characteristics, the ME that represents the gene expression profile of each module was calculated. The MEblack (r = − 0.64, P = 5e − 38) and MEturquoise (r = − 0.65, P = 3e − 39) were considered to be the key modules most associated with thyroid cancer (Additional file 1: Figure S5). At the same time, the module membership vs. gene significance showed that the black and turquoise modules were closely related to the disease (Additional file 1: Figure S6). GO and KEGG indicated that black module (Fig. 2a) was mainly enriched to extracellular region, extracellular matrix and extracellular space etc. GO and KEGG indicated that turquoise module was (Fig. 2b) mainly enriched to retinoic acid receptor signaling pathway, oxidoreductase activity and cellular response to zinc ion etc.
Identification, validation and efficacy evaluation for hub genes
16 genes with geneModuleMembership > 0.9 and geneTraitSignificance > 0.5 were determined as the hub genes in Table 2. Figure 3 (TCGA-THCA) and Fig. 4 (GSE29265) intuitively showed that 16 hub genes are significantly down-regulated. Figure 5 ploted in GEPIA illustrated that there were some differences in expression patterns in different stages of PTC. The main difference was between stage I, II, and III: the expression patterns of ABCA6, PID1and TFPI were down in stage I–II–up in stage II–III, ACACB, BCL2, CASC2, ITPR1, MPPED2, MRO, PRKCQ, RMDN1, RNF150, RPS6KA6,SLC26A7, SLC4A4 and TTC30A were up in stage I–II–down in stage II–III (Table 2). Furthermore, ROC curves showed high diagnostic value of 16 hub genes for PTC (Additional file 1: Figure S7A, B): except for ABCA6 AUC = 89.5%, the 15 genes with AUC > 90%. To our knowledge, the eight hub genes (ABCA6, ACACB, TTC30A, RMDN1, RNF150, RPS6KA6, PID1 and TFPI) in Fig. 3 have been poorly studied. Therefore, we focused on these 8 hub genes in this study.
Methylation analysis of hub genes
As far as we know, DNA methylation shuts down the activity of some genes and demethylation induces gene expression. Additional file 1: Figure S8 shown that ABCA6, ACACB, RMDN1 and TFPI were significantly different (P < 0.05), so ABCA6, ACACB, RMDN1 and TFPI were defined as differentially methylated genes (DMGs). Moreover, the Additional file 1: Figure S9A–D illustrated that the expression levels of 4 DMGs are negatively correlated with DNA methylation in MEXPRESS. That indicate the abnormal down-regulation of DMGs likely resulted from hypermethylation.
Hub genes and tumor-infiltrating immune cells
Immune cells play an important role in the development and progression of tumors. Immunotherapy for cancer is becoming more and more important, so it is necessary to study the relationship between hub genes and immune cell infiltration. We studied the relationship between 8 hub genes and different immune cells based on TIMER. However, RMDN1was not involved in this analysis since hub genes RMDN1 is not recognized by TIMER. The seven hub genes were all correlated with the immune cells (B cells, CD4+ T cells, CD8+ T cells, neutrophils, macrophages, and dendritic cells) and tumor purity in the study (Additional file 1: Figure S10A–G).
Gene set enrichment analysis
In order to further explore the expression pathways of 8 hub genes, we conducted GSEA, which is widely used to predict the biological functions of hub genes. These results suggest that these genes are associated with PTC (Fig. 6). ABCA6, PID1, RMDN1, RPS6KA6, TTC30A and TFPI were involved in amino acid metabolism. ACACB, RMDN1, RNF150, TTC30A have been shown to be involved in carbohydrate anabolism. PID1, TFPI and ABCA6 may be involved in the niacin and niacinamide metabolism.
The information of 6 data sets was integrated based on RRA, and 1807 DEGs were screened as the threshold of P-value < 0.05. In order to explore the specific biological role of these genes, the top 300 DEGs were analyzed by GO functional annotation and KEGG enrichment based on DAVID. Functional annotation of the top 300 DEGs may be helpful to the underlying mechanism of PTC. GO terms were obtained such as signal transduction, response to interleukin-1, positive regulation of MAP kinase activity, extracellular exosome, plasma membrane, integral component of plasma membrane and metalloendopeptidase activity. Signal transduction is very important in the occurrence of tumor . Interleukin-1 plays a role in regulating innate immunity and adaptive immunity . In particular, interleukin-1 has dual roles of anti-tumor and pro-tumor . Positive regulation of MAP kinase activity and metalloendopeptidase activity are thought to be associated with thyroid cancer [44, 45]. This is a validation of the major carcinogenic event of PTC is the activation of MAPK. Meanwhile, KEGG terms such as transcriptional misregulation in cancer, Rap1 signaling pathway and PI3K-Akt signaling pathway were obtianed. PI3K-Akt signaling pathway had been reported to play an important role in the development of thyroid cancer [46, 47], breast cancer , colorectal cancer , non-small cell lung cancer  and gastric cancer  etc.
In order to identify the hub genes in the 1807 DEGs, 1807 genes were used to construct a gene co-expression network. MEblack and MEturquoise showed high correlations with PTC. Enrichment analysis showed that the genes in the black module were mainly related to extracellular matrix, and the genes in the turquoise module were enriched to retinoic acid receptor signaling pathway, oxidoreductase activity and cellular response to zinc ion etc. As the main bioactive metabolite of vitamin A, retinoic acid plays a regulatory role in proliferation and differentiation , and it has a profound impact on adipogenesis by activating retinoic acid receptors . It is important to note that retinoic acid receptor signaling pathway may play a role in immune suppression. Previous studies have shown that abnormalities in the retinoic acid receptor signaling pathway are associated with various malignancies , but the specific mechanism between this pathway and PTC remains unclear. We identified 16 hub genes based on our selection criteria and ROC curve indicated that 16 hub genes had good diagnostic value, 8 of which were focused on. Not only did these genes differ significantly between tumours and non-tumours, but their expression patterns differed at different stages based on validation for hub genes. Methylation analysis showed that ABCA6, ACACB, RMDN1 and TFPI were identified as DMGs, and significant down-regulation of DMGs in patients with PTC might be realized by the hypermethylation. DNA methylation is reversible, so targeted therapies are attractive in cancer. The combination of genetic alterations and DNA methylation alterations may improve their clinical value. Therefore, 16 hub genes and DNA methylation of DMGs as potential biomarkers may be used to diagnose state (inert or invasive) of PTC for selecting the most appropriate treatment plan at present, so as to avoid overtreatment, improve the diagnosis of invasive tumor. This may provide new insights into refractory radioiodine differentiated tumors. The expression levels of 8 hub genes were significantly correlated with tumor purity, and these genes were moderately correlated with tumor-infiltrating immune cells. Furthermore, GSEA for hub genes predicted biological functions of these genes. ABCA6 belongs to the superfamily of ATP-binding cassette transporters and ABCA6 may be related to lipid homeostasis in macrophages . Although some members of the superfamily of ATP-binding cassette transporters play important roles in tumor-generating mechanisms and drug resistance , the specific biological role of ABCA6 is not clearly understood. ACACB has been reported in laryngeal squamous cell carcinoma , nasopharyngeal carcinoma  and hepatocellular carcinoma . PID1, a gene that regulates the sensitivity of fat and muscle cells to insulin signals, has been reported as a feature gene in several cancers, including thyroid cancer . RPS6KA6, also known as RSK4, is considered as a tumor suppressor gene due to its resistance to invasion and metastasis, which may be related with inhibition of the MAPK pathway . The activation of MAPK likely resulted from down-regulated of RPS6KA6. RPS6KA6 has been significantly down-regulated in colorectal cancer , ovarian cancer , non-small cell lung cancer , breast cancer, acute myeloid leukemia , etc., while this study is the first to mention that RPS6KA6 is significantly down-regulated in PTC. By the way, RPS6KA6 is also considered a DMG in esophageal cancer . It is worth mentioning that RPS6KA6 was considered as a drug resistance marker for the treatment of cancer by protein kinase inhibitors in a study in 2012 . In general, there has been little research on these 8 hub genes, especially RMDN1, TTC30A and RNF150, thus, it is imperative to fully uncover the specific relationship between hub genes and PTC.
Although WGCNA is widely used as a powerful data-driven tool for various diseases including various solid malignancies and hematologic malignancies, there is little research on establishing gene co-expression networks to identify genes that play a pivotal role in PTC. A new study using the WGCNA method to identify hub genes suggests that 11 hub genes may be involved in the recurrence of papillary thyroid cancer . A total of 16 hub genes were identified in this study, and these 16 genes did not overlap with the above 11 genes. In particular, our study is the first to study PTC using a combination of RRA and WGCNA. However, we cannot deny that there may be bias due to the small sample sizes, more studies are needed to validate our results.
In summary, we identified 1807 DEGs from 6 datasets of PTC data based on RRA methods. From the gene co-expression network, we identified 16 hub genes, which were shown to be significantly down-regulated and expression patterns differed at different stages according to hub gene verification. ROC curve indicated that 16 hub genes had good diagnostic value. For methylation analysis, ABCA6, ACACB, RMDN1 and TFPI were identified as DMGs, and MEXPRESS indicated that decreased expression level of these genes may relate to abnormal DNA methylation. Hub genes and methylation of DMGs may as potential biomarkers provide new insights for diagnosis of PTC. The expression levels of 8 hub genes were correlated with tumor purity as well as tumor-infiltrating immune cells. Although these hub genes were found in PTC, the specific role of these hub genes in the underlying mechanism of PTC is not clear. Therefore, GSEA provides insights into the biological functions of hub genes, which might be the direction of future work. In addition, functional annotations for the top300 DEGs and key modules provide insight into the underlying mechanism of PTC. To further understand the pathogenesis of PTC is the key to adjust the current diagnosis, which can further improve the prognosis of PTC patients.
Availability of data and materials
The data that support the findings of this study are openly available in GEO (https://www.ncbi.nlm.nih.gov/geo/) and TCGA (https://www.cancer.gov/about-nci/organization/ccg/research/structuralgenomics/tcga).
Papillary thyroid cancer,
Mitogen-activated protein kinase,
Anti-cytotoxic T lymphocyte antigen 4,
Anti-programmed cell death protein-1/programmed cell death ligand-1
B-Raf proto-oncogene, serine/threonine kinase
Database for annotation, visualization, and integrated discovery
Differentially expressed genes
Receiver operating characteristic
Area under the ROC curve
Differentially methylated genes,
Gene expression profiling interactive analysis
Gene expression omnibus
Gene set enrichment analysis
Kyoto encyclopedia of genes and genomes
Robust rank aggregation
The cancer genome atlas
The cancer genome atlas-thyroid carcinoma
Topological overlap matrix
Weighted gene co-expression network analysis
Kitahara CM, Devesa SS, Sosa JA. Increases in thyroid cancer incidence and mortality-reply. JAMA. 2017;318(4):390–1.
Ferrari SM, Fallahi P, Elia G, Ragusa F, Ruffilli I, Paparo SR, et al. Thyroid autoimmune disorders and cancer. Semin Cancer Biol. 2019. https://doi.org/10.1016/j.semcancer.2020.03.012.
Riesco-Eizaguirre G, Santisteban P. ENDOCRINE TUMOURS: advances in the molecular pathogenesis of thyroid cancer: lessons from the cancer genome. Eur J Endocrinol. 2016;175(5):R203–17.
Zhang K, Li C, Liu J, Tang X, Li Z. DNA methylation alterations as therapeutic prospects in thyroid cancer. J Endocrinol Invest. 2019;42(4):363–70.
Yang LX, Wu J, Guo ML, Zhang Y, Ma SG. Suppression of long non-coding RNA TNRC6C-AS1 protects against thyroid carcinoma through DNA demethylation of STK4 via the Hippo signalling pathway. Cell Prolif. 2019;52(3):e12564.
Yim JH, Choi AH, Li AX, Qin H, Chang S, Tong ST, et al. Identification of tissue-specific DNA methylation signatures for thyroid nodule diagnostics. Clin Cancer Res. 2019;25(2):544–51.
Varricchi G, Loffredo S, Marone G, Modestino L, Fallahi P, Ferrari SM, et al. The immune landscape of thyroid cancer in the context of immune checkpoint inhibition. Int J Mol Sci. 2019;20(16):3934.
Almubarak H, Qassem E, Alghofaili L, Alzahrani AS, Karakas B. Non-invasive molecular detection of minimal residual disease in papillary thyroid cancer patients. Front Oncol. 2019;9:1510.
Celik M, Bulbul BY, Ayturk S, Durmus Y, Gurkan H, Can N, et al. The relation between BRAFV600E mutation and clinicopathological characteristics of papillary thyroid cancer. Medicinski Glasnik. 2020;17(1).
Ge J, Wang J, Wang H, Jiang X, Liao Q, Gong Q, et al. The BRAF V600E mutation is a predictor of the effect of radioiodine therapy in papillary thyroid cancer. J Cancer. 2020;11(4):932–9.
Ihara K. Immune checkpoint inhibitor therapy for pediatric cancers: a mini review of endocrine adverse events. Clin Pediatr Endocrinol. 2019;28(3):59–68.
Ferrari SM, Fallahi P, Elia G, Ragusa F, Ruffilli I, Patrizio A, et al. Autoimmune endocrine dysfunctions associated with cancer immunotherapies. Int J Mol Sci. 2019;20(10):2560.
Liu Z, Wang Z, Jia E, Ouyang T, Pan M, Lu J, et al. Analysis of genome-wide in cell free DNA methylation: progress and prospect. The Analyst. 2019;144(20):5912–22.
Chandhok NS, Prebet T. Insights into novel emerging epigenetic drugs in myeloid malignancies. Ther Adv Hematol. 2019;10:2040620719866081.
Song ZY, Chao F, Zhuo Z, Ma Z, Li W, Chen G. Identification of hub genes in prostate cancer using robust rank aggregation and weighted gene co-expression network analysis. Aging. 2019;11(13):4736–56.
Kolde R, Laur S, Adler P, Vilo J. Robust rank aggregation for gene list integration and meta-analysis. Bioinform. 2012;28(4):573–80.
Yang Y, Lu Q, Shao X, Mo B, Nie X, Liu W, et al. Development of a three-gene prognostic signature for hepatitis B virus associated hepatocellular carcinoma based on integrated transcriptomic analysis. J Cancer. 2018;9(11):1989–2002.
Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4:17.
Subramanian A, Kuehn H, Gould J, Tamayo P, Mesirov JP. GSEA-P: a desktop application for gene set enrichment analysis. Bioinform. 2007;23(23):3251–3.
Finotello F, Trajanoski Z. Quantifying tumor-infiltrating immune cells from transcriptomics data. Cancer Immunol Immunother. 2018;67(7):1031–40.
Fridman WH, Pages F, Sautes-Fridman C, Galon J. The immune contexture in human tumours: impact on clinical outcome. Nat Rev Cancer. 2012;12(4):298–306.
Vasko V, Espinosa AV, Scouten W, He H, Auer H, Liyanarachchi S, et al. Gene expression and functional evidence of epithelial-to-mesenchymal transition in papillary thyroid carcinoma invasion. Proc Natl Acad Sci USA. 2007;104(8):2803–8.
Rusinek D, Swierniak M, Chmielik E, Kowal M, Kowalska M, Cyplinska R, et al. BRAFV600E-Associated gene expression profile: early changes in the transcriptome, based on a transgenic mouse model of papillary thyroid carcinoma. PLoS ONE. 2015;10(12):e0143688.
Giordano TJ, Au AY, Kuick R, Thomas DG, Rhodes DR, Wilhelm KG Jr, et al. Delineation, functional validation, and bioinformatic evaluation of gene expression in thyroid follicular carcinomas with the PAX8-PPARG translocation. Clin Cancer Res. 2006;12(7 Pt 1):1983–93.
Giordano TJ, Kuick R, Thomas DG, Misek DE, Vinco M, Sanders D, et al. Molecular classification of papillary thyroid carcinoma: distinct BRAF, RAS, and RET/PTC mutation-specific gene expression profiles discovered by DNA microarray analysis. Oncogene. 2005;24(44):6646–56.
Pita JM, Banito A, Cavaco BM, Leite V. Gene expression profiling associated with the progression to poorly differentiated thyroid carcinomas. Br J Cancer. 2009;101(10):1782–91.
Tarabichi M, Saiselet M, Tresallet C, Hoang C, Larsimont D, Andry G, et al. Revisiting the transcriptional analysis of primary tumours and associated nodal metastases with enhanced biological and statistical controls: application to thyroid cancer. Br J Cancer. 2015;112(10):1665–74.
Dom G, Tarabichi M, Unger K, Thomas G, Oczko-Wojciechowska M, Bogdanova T, et al. A gene expression signature distinguishes normal tissues of sporadic and radiation-induced papillary thyroid carcinomas. Br J Cancer. 2012;107(6):994–1000.
Tomas G, Tarabichi M, Gacquer D, Hebrant A, Dom G, Dumont JE, et al. A general method to derive robust organ-specific gene expression-based differentiation indices: application to thyroid cancer diagnostic. Oncogene. 2012;31(41):4490–8.
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4(2):249–64.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.
Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, et al. DAVID: database for annotation, visualization, and integrated discovery. Genome Biol. 2003;4(5):P3.
Walter W, Sanchez-Cabo F, Ricote M. GOplot: an R package for visually combining expression data with functional analysis. Bioinform. 2015;31(17):2912–4.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559.
Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017;45(W1):w98.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: an open-source package for R and S + to analyze and compare ROC curves. BMC Bioinform. 2011;12:77.
Xiong Y, Wei Y, Gu Y, Zhang S, Lyu J, Zhang B, et al. DiseaseMeth version 2.0: a major expansion and update of the human disease methylation database. Nucleic acids research. 2017;45(D1):D888–d895.
Lv J, Liu H, Su J, Wu X, Liu H, Li B, et al. DiseaseMeth: a human disease methylation database. Nucleic acids research. 2012;40(Database issue):D1030–5.
Koch A, Jeschke J, Van Criekinge W, van Engeland M, De Meyer T. MEXPRESS update 2019. Nucleic Acids Res. 2019;47(W1):W561.
Li B, Severson E, Pignon JC, Zhao H, Li T, Novak J, et al. Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biol. 2016;17(1):174.
Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, et al. TIMER: a web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Res. 2017;77(21):e108–10.
da Cruz Silva E, Dontenwill M, Choulier L, Lehmann M. Role of integrins in resistance to therapies targeting growth factor receptors in cancer. Cancers. 2019;11(5):692.
Baker KJ, Houston A, Brint E. IL-1 family members in cancer; two sides to every story. Front Immunol. 2019;10:1197.
Hanly EK, Rajoria S, Darzynkiewicz Z, Zhao H, Suriano R, Tuli N, et al. Disruption of mutated BRAF signaling modulates thyroid cancer phenotype. BMC Res Notes. 2014;7:187.
Roncevic J, Djoric I, Selemetjev S, Jankovic J, Dencic TI, Bozic V, et al. MMP-9-1562 C/T single nucleotide polymorphism associates with increased MMP-9 level and activity during papillary thyroid carcinoma progression. Pathology. 2019;51(1):55–61.
Li X, Li Q, Jin X, Guo H, Li Y. Long non-coding RNA H19 knockdown inhibits the cell viability and promotes apoptosis of thyroid cancer cells through regulating the PI3K/AKT pathway. Exp Ther Med. 2019;18(3):1863–9.
Wang H, Yan X, Zhang H, Zhan X. CircRNA circ_0067934 overexpression correlates with poor prognosis and promotes thyroid carcinoma progression. Med Sci Monit. 2019;25:1342–9.
He H, Wang X, Chen J, Sun L, Sun H, Xie K. High-Mobility group box 1 (HMGB1) promotes angiogenesis and tumor migration by regulating hypoxia-inducible factor 1 (HIF-1alpha) expression via the Phosphatidylinositol 3-Kinase (PI3K)/AKT signaling pathway in breast cancer cells. Med Sci Monit. 2019;25:2352–60.
Peng Q, Yao W, Yu C, Zou L, Shen Y, Zhu Y, et al. Identification of microRNA-181 as a promising biomarker for predicting the poor survival in colorectal cancer. Cancer Med. 2019;8(13):5995–6009.
Zhou F, Geng J, Xu S, Meng Q, Chen K, Liu F, et al. FAM83A signaling induces epithelial-mesenchymal transition by the PI3K/AKT/Snail pathway in NSCLC. Aging. 2019;11:6069.
Pan HM, Lang WY, Yao LJ, Wang Y, Li XL. shRNA-interfering LSD1 inhibits proliferation and invasion of gastric cancer cells via VEGF-C/PI3K/AKT signaling pathway. World J Gastrointest Oncol. 2019;11(8):622–33.
Schmutzler C, Brtko J, Bienert K, Kohrle J. Effects of retinoids and role of retinoic acid receptors in human thyroid carcinomas and cell lines derived therefrom. Exp Clin Endocrinol Diabetes. 1996;104(4):16–9.
Wang B, Yang Q, Harris CL, Nelson ML, Busboom JR, Zhu MJ, et al. Nutrigenomic regulation of adipose tissue development—role of retinoic acid: a review. Meat Sci. 2016;120:100–6.
di Masi A, Leboffe L, De Marinis E, Pagano F, Cicconi L, Rochette-Egly C, et al. Retinoic acid receptors: from molecular mechanisms to cancer therapy. Mol Aspects Med. 2015;41:1–115.
Kaminski WE, Wenzel JJ, Piehler A, Langmann T, Schmitz G. ABCA6, a novel a subclass ABC transporter. Biochem Biophys Res Commun. 2001;285(5):1295–301.
Hedditch EL, Gao B, Russell AJ, Lu Y, Emmanuel C, Beesley J, et al. ABCA transporter gene expression and poor outcome in epithelial ovarian cancer. J Nat Cancer Instr. 2014. https://doi.org/10.1093/jnci/dju149.
Ding Y, Wu Y, Gao W, Zhang C, Zhao Q, Guo H, et al. Analysis of gene expression profiling variations induced by hsamiR1455poverexpression in laryngeal squamous cell carcinoma cell line Tu177. Mol Med Rep. 2017;16(5):5863–70.
Ge Y, He Z, Xiang Y, Wang D, Yang Y, Qiu J, et al. The identification of key genes in nasopharyngeal carcinoma by bioinformatics analysis of high-throughput data. Mol Biol Rep. 2019;46(3):2829–40.
Pan WY, Zeng JH, Wen DY, Wang JY, Wang PP, Chen G, et al. Oncogenic value of microRNA-15b-5p in hepatocellular carcinoma and a bioinformatics investigation. Oncol Lett. 2019;17(2):1695–713.
Wen JX, Li XQ, Chang Y. Signature gene identification of cancer occurrence and pattern recognition. J Comput Biol. 2018;25(8):907–16.
Lopez-Vicente L, Pons B, Coch L, Teixido C, Hernandez-Losa J, Armengol G, et al. RSK4 inhibition results in bypass of stress-induced and oncogene-induced senescence. Carcinogenesis. 2011;32(4):470–6.
Cai J, Ma H, Huang F, Zhu D, Zhao L, Yang Y, et al. Low expression of RSK4 predicts poor prognosis in patients with colorectal cancer. Int J Clin Exp Pathol. 2014;7(8):4959–70.
Arechavaleta-Velasco F, Zeferino-Toquero M, Estrada-Moscoso I, Imani-Razavi FS, Olivares A, Perez-Juarez CE, et al. Ribosomal S6 kinase 4 (RSK4) expression in ovarian tumors and its regulation by antineoplastic drugs in ovarian cancer cell lines. Med Oncol. 2016;33(2):11.
Li A, Liu D, Liu Y, Zhou Y, Du Z, Song J. A pilot study of RSK4 expression in patients with human non-small cell lung carcinoma. Ann Clin Lab Sci. 2018;48(4):484–9.
Rafiee M, Keramati MR, Ayatollahi H, Sadeghian MH, Barzegar M, Asgharzadeh A, et al. Down-regulation of ribosomal S6 kinase RPS6KA6 in acute myeloid leukemia patients. Cell J. 2016;18(2):159–64.
Xi T, Zhang G. Epigenetic regulation on the gene expression signature in esophagus adenocarcinoma. Pathol Res Pract. 2017;213(2):83–8.
Bender C, Ullrich A. PRKX, TTBK2 and RSK4 expression causes Sunitinib resistance in kidney carcinoma- and melanoma-cell lines. Int J Cancer. 2012;131(2):E45–55.
Zhai T, Muhanhali D, Jia X, Wu Z, Cai Z, Ling Y. Identification of gene co-expression modules and hub genes associated with lymph node metastasis of papillary thyroid cancer. Endocrine. 2019;66(3):573–84.
Ethics approval and consent to participate
Consent for publication
That the article is original, has not already been published in a journal, and is not currently under consideration by another journal.
Conflicts of interest
All authors declare no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Figure S1. Heat map for top 20 down-regulated and top 20 up-regulated genes. Figure S2. Sample dendrogram and trait heatmap. Figure S3. Analysis of network topology for various soft-thresholding powers. Figure S4. Cluster dendrogram. Modules identified by WGCNA based on a dissimilarity measure (1-TOM). Figure S5. Heat map for the correlation between ME and clinical traits of thyroid cancer. Each gene module was colored according to legend. Figure S6. Scatter diagram for black and turquoise modules. Figure S7A-B. ROC curves for hub genes. Figure S8. Boxplot for methylation analysis of 8 hub genes. Figure S9A-D. Association of Methylation sites with expression of 4 differentially methylated genes (DMGs). Figure S10. The relationship between hub gene expression and tumor purity and tumor infiltrating immune cells.
About this article
Cite this article
Liu, Y., Chen, T., Yang, Z. et al. Identification of hub genes in papillary thyroid carcinoma: robust rank aggregation and weighted gene co-expression network analysis. J Transl Med 18, 170 (2020). https://doi.org/10.1186/s12967-020-02327-7
- Papillary thyroid carcinoma
- DNA Methylation
- Robust rank aggregation
- Weighted gene co-expression network analysis