Skip to main content

Bioinformatic gene analysis for potential biomarkers and therapeutic targets of atrial fibrillation-related stroke



Atrial fibrillation (AF) is one of the most prevalent sustained arrhythmias, however, epidemiological data may understate its actual prevalence. Meanwhile, AF is considered to be a major cause of ischemic strokes due to irregular heart-rhythm, coexisting chronic vascular inflammation, and renal insufficiency, and blood stasis. We studied co-expressed genes to understand relationships between atrial fibrillation (AF) and stroke and reveal potential biomarkers and therapeutic targets of AF-related stroke.


AF-and stroke-related differentially expressed genes (DEGs) were identified via bioinformatic analysis Gene Expression Omnibus (GEO) datasets GSE79768 and GSE58294, respectively. Subsequently, extensive target prediction and network analyses methods were used to assess protein–protein interaction (PPI) networks, Gene Ontology (GO) terms and pathway enrichment for DEGs, and co-expressed DEGs coupled with corresponding predicted miRNAs involved in AF and stroke were assessed as well.


We identified 489, 265, 518, and 592 DEGs in left atrial specimens and cardioembolic stroke blood samples at < 3, 5, and 24 h, respectively. LRRK2, CALM1, CXCR4, TLR4, CTNNB1, and CXCR2 may be implicated in AF and the hub-genes of CD19, FGF9, SOX9, GNGT1, and NOG may be associated with stroke. Finally, co-expressed DEGs of ZNF566, PDZK1IP1, ZFHX3, and PITX2 coupled with corresponding predicted miRNAs, especially miR-27a-3p, miR-27b-3p, and miR-494-3p may be significantly associated with AF-related stroke.


AF and stroke are related and ZNF566, PDZK1IP1, ZFHX3, and PITX2 genes are significantly associated with novel biomarkers involved in AF-related stroke.


Atrial fibrillation (AF) is one of the most prevalent sustained arrhythmias, having an age-adjusted hospitalization incidence of 1–4% of the general population and an prevalence rising of > 13% for those older than 80-years-of-age [1, 2]. However, epidemiological data may understate its actual prevalence, because 40% of patients are asymptomatic and remain undiagnosed with subclinical AF [3]. There is also evidence that patients with AF have significantly increased cardiovascular-related morbidity, given its association with atrial and ventricular mechanical or electrical failure, structural and hemodynamic alterations, and thromboembolic events [3].

Stroke is the leading cause of disability and death and has an estimated incidence of 3.73 (95% CI 3.51–3.96) per 1000 person-years among black- and white- adults in an atherosclerosis risk in communities (ARIC) cohort [4]. Furthermore, global increases in stroke prevalence plus stroke-related disability and mortality associated with aging will increase [5, 6]. Thus, we may not now know the actual true burden of stroke due to limits in brain imaging identification in < 10 mm small hypointense areas and silent infarctions for 28% of those patients older than 65-years-of-age [7]. AF is commonly classified as paroxysmal, persistent or permanent, or new onset arrhythmia basing on the present continuous time, which mainly included that paroxysmal AF was self-terminates within 7 days, while persistent AF was lasts longer than 7 days or needs cardioversion, and usually has lasted for 3 months [8]. As we all kwon, AF is considering to be a major cause of ischemic strokes due to irregular heart-rhythm, coexisting chronic vascular inflammation, and renal insufficiency, and blood stasis. According to Rivaroxaban Once Daily Oral Direct Factor Xa Inhibition Compared With Vitamin K Antagonism for Prevention of Stroke and Embolism Trial in Atrial Fibrillation (ROCKET-AF) trial study, Steinberg et al. [9] suggested that the paroxysmal AF patients carrying a lower adjusted rate of stroke or systemic embolism (adjusted HR: 0.78, 95% CI 0.61–0.99, P = 0.045), all-cause mortality (adjusted HR: 0.79, 95% CI 0.67–0.94, P = 0.006), and the composite of stroke or systemic embolism or death (adjusted HR: 0.82, 95% CI 0.71–0.94, P = 0.005) than persistent AF patients after adjusted efficacy and safety outcomes. According to the Oxford vascular study (OXVASC), nearly 43.9% of ischemic strokes were associated with AF among patients 80 years-of-age or older who had a threefold increase in AF in the past 3 decades [10]. However, this assumption has been challenged by the atrial fibrillation reduction atrial pacing trial (ASSERT) which identified a temporal association between subclinical AF and stroke risk among patients with implantable pacemakers and defibrillators. They reported that only 8% and 16% of patients had an association between pre-detected and post-detected AF within months of stroke or systemic embolism, respectively [11]. Of note, AF is often intermittent and asymptomatic, and presents as an electromechanical disassociation of atrial fibrillation. Clinically, current stroke risk scores and traditional diagnosis with an electrocardiogram are practical, while the limitation of predict stroke risk accurately in individual AF patients was significantly identified, especially in persistent AF which carrying a higher risk of stroke or systemic embolism and all-cause mortality [12]. In this study, we identified co-expressed differentially expressed genes (co-DEGs) of persistent AF and stroke and elucidated molecular mechanisms and pathology of AF-related DEGs (AF-DEGs) and stroke-related DEGs (stroke-DEGs). Finally, we provide a bioinformatic analysis of DEGs and predicted microRNAs (miRNAs) for AF patients prone to stroke.


Materials and methods

GSE79768 and GSE58294 datasets were downloaded from GEO ( [13] and expression profiling arrays were generated using GPL570 (HG‑U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array (Affymetrix, Santa Clara, CA). Additionally, the GSE79768 dataset, including 26 specimens with paired left atrial (LA) and right atrial (RA) tissue obtained from 13 patients was used to identify differential LA-to-RA gene expression and molecular mechanisms for patients with persistent AF or sinus rhythm (SR) abnormalities and we describe potential mechanisms of AF-related remodeling in the LA and the relationship between LA arrhythmogenesis and thrombogenesis. In this study, persistent AF patients has lasts continuously for > 6 months, while the SR patients had no evidence of AF clinically and any anti-arrhythmic drug history. Blood samples of GSE58294 were collected from cardioembolic stroke (N = 69) and control patents (N = 23) at < 3, 5, and 24 h.

Data processing

R packages of “affy”, “affyPLM”, and “limma” (, provided by a bioconductor project [14], were applied to assess GSE79768 and GSE58294 RAW datasets. We used background correction, quantile normalization, probe summarization and log2‑transformation, to create a robust multi-array average (RMA), a log-transformed perfect match, and a mismatch probe (PM and MM) methods. The Benjamini‑Hochberg method was used to adjust original p-values, and the false discovery rate (FDR) procedure was used to calculate fold-changes (FC). Genes expression values of the|log2 FC| > 1and adjusted p < 0.05 were used for filtering AF-DEGs. However, the |log2 FC| > 1.5 and adjusted p < 0.05 were used to identify stroke-DEGs, given that blood sample specificity pointed to many genes. Additionally, we calculated and made Venn diagrams for co-DEGs for AF- and stroke-DEGs.

Finally, we applied online prediction tools utilizing microRNA Data Integration Portal (mirDIP) ( [15], miRDB ( [16], TargetScan (v7.1; [17], and DIANA Tools ( [18], to predict potential microRNA targeting. Subsequently, we used the mirDIP, miRDB, TargetScan, and Diana Tools software to predict which of the selected miRNAs could target co-DEGs. We determined 5 top candidate miRNAs based on higher predicted scores for ≥ 3 prediction tools for each co-DEG.

Identification of protein–protein interaction (PPI) networks of DEGs

PPI networks of AF- and stroke-DEGs were analyzed using the search tool for the retrieval of interacting genes (STRING database, V10.5; that predicted protein functional associations and protein–protein interactions. Subsequently, Cytoscape software (V3.5.1; was applied to visualize and analyze biological networks and node degrees, after downloading analytic results of the STRING database with a confidence score > 0.4 [19].

Functional enrichment analysis

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses of AF- and stroke-DEGs were carried out using the database for annotation, visualization and integrated discovery bioinformatics resources (DAVID Gene Functional Classification Tool, [20], and REACTOME databases (v62; [21]. GO terms and KEGG maps of biological functions associated with a p < 0.05 was considered to be significantly enriched. In addition, we presented different biofunctions of AF- and stroke-DEGs in biological processes, molecular functions, and cellular components from DAVID and REACTOME databases, respectively.

Subsequently, the AmiGO database (v2.0; was used to analyze the GO consortium for selected co-DEGs to verify the accuracy and annotate biofunctions of identified co-DEGs [22]. Using microRNA target prediction, online tools from Diana-miRPath (v3.0; [18] were applied to evaluate interactions between miRNA previously identified using prediction tools and co-DEGs involved in AF and stroke.

Identification of co-DEGs associated with nervous or cardiovascular diseases

The comparative toxicogenomics database ( was used to find integrated chemical-gene, chemical-disease, and gene-disease interactions to generate expanded networks and predict novel associations [23]. We used these data to analyze relationships between gene products and nervous or cardiovascular diseases. Here, relationships between co-DEGs and diseases and association or an implied association were identified.


Identification of DEGs

We identified 54,674 probes corresponding to 20,484 genes in GSE79768 and GSE58294 datasets and AF- and stroke-DEGs were confirmed. We found 489 DEGs in LA specimens of AF patients compared with SR patients, including 428 down-regulated genes and 61 up-regulated genes. However, total of 265, 518, and 592 DEGs were identified following the time points of less than 3, 5, and 24 h after stroke, respectively. Here, we defined 210 co expressed DEGs in the three time points mentioned above as the stroke-DEGs. Heatmaps of AF-DEGs in relation to inflammatory and immune response, ion channels, and cell signaling were conducted for genes expression and these data appear in Fig. 1 and Additional file 1: S1. Simultaneously, Fig. 2 and Additional file 2: S2 has shown the genes expression value in relation to inflammatory response, energy metabolism, ions channel and transportation, and neuronal regulation above the stroke-DEGs.

Fig. 1
figure 1

Hierarchical clustering analysis of AF-related differentially expressed genes: ad results of hierarchical clustering analysis for DEGs expression in relation to cellular signaling, ion channel, inflammatory and immune responses. Red, greater expression. Blue, less expression

Fig. 2
figure 2

Hierarchical clustering analysis of stroke-related DEGs: ac results of hierarchical clustering analysis for DEG expression in relation to energy metabolism, ion channel, inflammatory response, and neuronal regulation. Red, greater expression. Blue, less expression. a PPI network of AF-related DEGs; b PPI network of stroke-related DEGs. Red, greater degree. yellow, lesser degree; c Venn diagrams of DEGs

Functional enrichment in Co-DEGs

Figure 3c illustrates expressed AF- and stroke-DEGs and co expressed genes. Interestingly, four co expressed DEGs, including zinc finger protein 566 (ZNF566), PDZK1 interacting protein 1(PDZK1IP1), zinc finger homeobox 3 (ZFHX3), paired-like homeodomain 2 (PITX2), were observed. The AmiGO database was used to confirm GO term enrichment related to biological processes, molecular functions, and cellular components and Co-DEGs were associated with various processes as indicated in Table 1.

Fig. 3
figure 3

PPI network and Venn diagrams: (1) PPI networks from a and b constructed using STRING database for DEGs (threshold > 0.4). (2) Venn diagrams of c of DEGs related to AF and < 3, 5, and 24 h after stroke, respectively. Co-expressed genes, including ZNF566, PDZK1IP1, ZFHX3, and PITX2, are identified

Table 1 The Gene Ontology (GO) terms enrichment for the co-expressed genes of the AF-related stroke

PPI network analysis and functional GO terms and pathway enrichment analyses

We identified 256 and 43 nodes from PPI network of AF- and stroke-DEGs, respectively and these data appear in Fig. 3. Here, the hub nodes, including leucine-rich repeat kinase 2 (LRRK2; degree = 38), calmodulin 1 (CALM1; degree = 25), chemokine (C-X-C motif) receptor 4 (CXCR4; degree = 25), toll-like receptor 4 (TLR4; degree = 21), catenin (cadherin-associated protein), beta 1(CTNNB1; degree = 21), and chemokine (C-X-C motif) receptor 2 (CXCR2; degree = 21) are considering as hub-genes in related to AF maintaining. However, the hub-genes, involved in CD19 (degree = 5), fibroblast growth factor 9 (FGF9; degree = 5), SRY (sex determining region Y)-box 9 (SOX9; degree = 5), guanine nucleotide binding protein (G protein), gamma transducing activity polypeptide 1(GNGT1; degree = 4), and noggin (NOG; degree = 4), are demonstrated in stroke-DEGs with a relative higher degree.

Using the DAVID database, the top 5 GO terms related biological processes among those genes were primarily associated with inflammatory response (Fold Enrichment: 4.08; p value: 1.11E−07), immune response (Fold Enrichment: 3.50; p-value: 2.49E−06), regulation of MAP kinase activity (Fold Enrichment: 9.53; p-value: 1.92E−05), and regulation of NF-kappa B activety (Fold Enrichment: 16.73; p-value: 1.95E−04). There is significant correlation in plasma membrane (Fold Enrichment: 1.60; p-value: 1.37E−06), extracellular region (Fold Enrichment: 1.75; p-value: 8.88E−04), and MHC class II protein complex (Fold Enrichment: 13.47; p-value: 0.003) in relation to cellular components. In addition, the terms related molecular functions were mainly involved in ion channel binding (Fold Enrichment: 5.06; p-value: 0.001), neuropeptide Y receptor activity (Fold Enrichment: 23.84; p-value: 0.007), and transmembrane receptor protein tyrosine kinase adaptor activity (Fold Enrichment: 21.46; p-value: 0.008). With respect to stroke-DEGs, the biological processes terms of regulation of myoblast differentiation (Fold Enrichment: 25.39; p-value: 4.89E−04), endocardial cushion morphogenesis (Fold Enrichment: 27.38; p-value: 0.005), positive regulation of epithelial cell proliferation (Fold Enrichment: 9.73; p-value: 0.008), and fibroblast growth factor receptor signaling pathway (Fold Enrichment: 7.12; p-value: 0.018) were significantly enriched. Similarly, the terms of RNA polymerase II transcription factor activity, sequence-specific DNA binding (Fold Enrichment: 5.15; p-value: 0.006), ISG15-specific protease activity (Fold Enrichment: 146.79; p-value: 0.013), and nucleic acid binding (Fold Enrichment: 2.09; p-value: 0.015) related molecular functions were primarily enriched (As shown in Fig. 4 and Additional file 3: S3).

Fig. 4
figure 4

GO terms and KEGG pathway enrichment: a, b AF-and stroke-related GO term enrichment for DEGs, respectively. c KEGG pathway of AF- and stroke-related DEGs. d Functional and pathway enrichment of AF-and stroke-related DEGs from REACTOME database. Dot sizes represent counts of enriched DEGs, and dot colors represent negative Log10-p values

KEGG pathway analysis data appear in Fig. 4c. The results suggesting that the AF-DEGs were mainly enriched in pathways of cytokine–cytokine receptor interaction (p-value: 1.02E−04), cGMP-PKG signaling pathway (p-value: 0.025), antigen processing and presentation (p-value: 0.022), and NF-kappa B signaling pathway (p-value: 0.037). However, KEGG terms included PI3 K-Akt signaling pathway (p-value: 0.017) and B cell receptor signaling pathway (p-value: 0.045) were enriched in stroke-DEGs. (As shown in Fig. 4c and Additional file 4: S4).GO terms enrichment using the REACTOME database identified additional associations and these appear in Fig. 4d. The CTD database showed that Co-DEGs targeted several nervous system and cardiovascular diseases and these data appear in Fig. 5 and Additional file 5: S5.

Fig. 5
figure 5

Relationship to nervous system and cardiovascular diseases related to co-expressed genes based on the CTD database. *Direct evidence of marker or mechanism in this disease

Identification of functional and pathway enrichment among predicted miRNAs and Co-DEGs

Prediction analysis using mirDIP, miRDB, TargetScan, and DIANA bioinformatic tools identified the top 5 selected miRNAs targeting each Co-DEG involved in AF-related stroke and these data appear in Table 2. These data enable us to understand how predicted miRNAs are related to AF-related stroke progress.

Table 2 The Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enrichment among predicted miRNAs and Co-DEGs


Predicting AF is needed for stroke prevention but 30% of patients have no signs of AF despite months of continuous cardiac rhythm monitoring. Thus, cardiovascular malignant events may be correlated with irregular and infrequent cardiovascular incidents as well as limitations in electromechanical indices that should predict problems with atrial contractility [7, 11, 12]. Estimating markers and associations between atrial dysfunction and embolic stroke are thus of interest and may be novel therapeutic targets for primary care. The inflammatory and immune response, and ion channel and transportation are significantly associated with AF recurrence and maintenance, as well as the stroke occurrence. Several hub-genes involved directly or indirectly that regulate the nervous system were found among AF-DEGs. Visanji’s group compared resting electrocardiograms of LRRK2-associated Parkinson’s disease (PD) patients, nonmanifesting carriers, noncarriers, and idiopathic PD patients to investigate heart rate variability in LRRK2-associated PD [24]. There is evidence that LRRK2 may act as a biofunctional mediator to correlate heart rate variability and PD [24]. In a molecular mechanistic study, the neural protective role for regulating mitochondrial complex I function and oxidative stress in ischemia/reperfusion was identified [25, 26]. According gene–gene interaction analysis, Timasheva’s group illustrated that the loci of CXCR2 is significantly associated with stroke development in patients with hypertension [26]. In addition, CXCR2 antagonism attenuated neurological deficits and infarct volumes via decreased cerebral neutrophil infiltration and peripheral neutrophilia in a hyperlipidemic ApoE−/− mice stroke model [27]. CALM1 is recognized as a major regulator of cardiac ion-current expression and calcium handling, and a key determinant of cardiac electrical function [28]. Also, specific risk alleles for CALM1 were identified as being associated with increased risk of stroke in studies of coronary heart disease [29]. Thus, there may be a relationship between cardiovascular and nervous system disease and they may arise from loci mutations or gene variants.

Additionally, PITX2, of the pituitary homeobox (Pitx) family, has a critical role in organ morphogenesis and AF maintenance which is related to short stature homeobox 2 (Shox2) [30]. Pitx2 is expressed in the LA and the pulmonary vein, which is considered a substrate and trigger for AF maintenance respectively. However, several experimental data indicate a trend that PITX2 gene expression is silenced during aging in LA samples, suggesting genetic evidence for gene silencing for increased AF susceptibility [30, 31]. Then, miRNAs function analysis and a genomic approach showed that miR-17-92 and miR-106b-25 were associated with Pitx2 expression regulation and are implicated in human AF susceptibility [31]. To reveal relationships between genetic variants and the risk of ischemic stroke, Malik’s group studied PITX2 and ZFHX3 genes and found a significant association with cardioembolic stroke (CE) in a meta-analysis [31, 32]. Similarly, in a genome-wide association study using clinical samples from paroxysmal or persistent AF patients, ZFHX3 was significantly associated with LA enlargement and persistent AF and subsequently with ablation outcomes [33]. Correspondingly, Choi’s group found a significant association between top susceptibility loci (chromosomes 4q25 [PITX2], 16q22 [ZFHX3]) and AF recurrence after ablation in a Korean population, despite no top single nucleotide polymorphisms (SNPs) that predicted clinical recurrence after catheter ablation [34]. A regulatory role for PDZK1IP1 (MAP17) in reactive oxygen species production has been confirmed and is considered as a marker for increased oxidative stress and may be a new therapeutic target [35]. and recent research suggests a potential role for ions channels regulation, linked to the Na+/H+ exchanger 3 and A-kinase anchor protein 2/protein kinase A pathway [36]. However, ZNF566 plays a central role in heart regeneration and repair, and endocardial and epicardial epithelial to mesenchymal transitions [37, 38].

Research suggests potential beneficial effects of miRNA transformation therapy vectored by adenovirus, plasmid, and lentivirus for AF therapy [39]. We found that miR-27a-3p, miR-27b-3p, and miR-494-3p were co-DEGs and may be potential biomarkers of AF-related stroke. Interestingly, Vegter’s group compared heart failure-specific circulating miRNAs in 114 heart failure patients with/without different manifestations of atherosclerotic disease, and reported that miR-18a-5p, miR-27a-3p, miR-199a-3p, miR-223-3p and miR-652-3p abundance were associated with atherosclerosis and cardiovascular-related rehospitalizations [40]. Similarly, Marques and colleagues found that several miRNAs involved in let-7b-5p, let-7c-5p, let-7e-5p, miR-122-5p, and miR-21-5p, and absorbed miR-16-5p, miR-17-5p, miR-27a-3p, and miR-27b-3p are target pathways related to heart failure and considered to be potential biomarkers [41]. In contrast, expression of miR-27b-3p is significantly related to embryonic myogenesis and protein synthesis but miR-494-3p expression is associated with cerebral blood supply and functional recovery in a rat stroke model according to cerebral cortical miRNA profile changes [42, 43].


The hub-genes of LRRK2, CALM1, CXCR4, TLR4, CTNNB1, CXCR2, KIT, and IL1B may be associated with AF recurrence and maintenance and CD19, FGF9, SOX9, GNGT1, and NOG may be associated with stroke. Additionally, co-DEGs of ZNF566, PDZK1IP1, ZFHX3, and PITX2 link AF and stroke. Finally, the top 5 miRNAs for each co-DEGs may be potential biomarkers or therapeutic targets for AF-stroke, especially miR-27a-3p, miR-27b-3p, and miR-494-3p. Thus, there is an association between AF and stroke, and expression of ZNF566, PDZK1IP1, ZFHX3, and PITX2 genes favor AF-related stroke.


Several limitations still detected in our study. First, this study is a microarray analysis that all the results based on gene expression value. However, owing to gene expression may be not directly equivalent to protein expression, the biomarkers of this study should consider as gene, not in protein. In application, assay of PCR and microarray chip may be better for accessing the risk of AF-related stroke. Second, validation should be carried out both in vitro, in vivo and clinical trials. However, as of now the techniques of in vivo or in vitro models for AF and stroke was immature. And the larger, prospective clinical studies may be better to validate our results to some extent.



atrial fibrillation


differentially expressed genes


gene expression omnibus


protein–protein interaction


Gene Ontology


atherosclerosis risk in communities


the Oxford vascular study


atrial fibrillation reduction atrial pacing trial


co-expressed differentially expressed genes


AF-related differentially expressed genes


stroke-related differentially expressed genes




left atrial


right atrial


sinus rhythm


robust multi-array average


perfect match


mismatch probe


false discovery rate




microRNA Data Integration Portal


Kyoto Encyclopedia of Genes and Genomes


Parkinson’s disease


pituitary homeobox

Shox2 :

short stature homeobox 2


cardioembolic stroke


single nucleotide polymorphisms


  1. Rahman F, Kwan GF, Benjamin EJ. Global epidemiology of atrial fibrillation. Nat Rev Cardiol. 2014;11(11):639–54.

    Article  Google Scholar 

  2. Freeman JV, Wang Y, Akar J, Desai N, Krumholz H. National trends in atrial fibrillation hospitalization, readmission, and mortality for medicare beneficiaries, 1999–2013. Circulation. 2017;135(13):1227–39.

    Article  Google Scholar 

  3. Thijs V. Atrial fibrillation detection: fishing for an irregular heartbeat before and after stroke. Stroke. 2017;48(10):2671–7.

    Article  Google Scholar 

  4. Koton S, Schneider AL, Rosamond WD, Shahar E, Sang Y, Gottesman RF, Coresh J. Stroke incidence and mortality trends in US communities, 1987 to 2011. JAMA. 2014;312(3):259–68.

    Article  CAS  Google Scholar 

  5. Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, Aboyans V, Adetokunboh O, Afshin A, Agrawal A, Ahmadi A. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet (London, England). 2017;390(10100):1151–210.

    Article  Google Scholar 

  6. Krishnamurthi RV, Moran AE, Feigin VL, Barker-Collo S, Norrving B, Mensah GA, Taylor S, Naghavi M, Forouzanfar MH, Nguyen G, et al. Stroke prevalence, mortality and disability-adjusted life years in adults aged 20–64 years in 1990–2013: data from the global burden of disease 2013 Study. Neuroepidemiology. 2015;45(3):190–202.

    Article  Google Scholar 

  7. Boehme AK, Esenwa C, Elkind MS. Stroke risk factors, genetics, and prevention. Circ Res. 2017;120(3):472–95.

    Article  CAS  Google Scholar 

  8. Lip GY, Fauchier L, Freedman SB, Van Gelder I, Natale A, Gianni C, Nattel S, Potpara T, Rienstra M, Tse HF, et al. Atrial fibrillation. Nat Rev Dis Primers. 2016;2:16016.

    Article  Google Scholar 

  9. Steinberg BA, Hellkamp AS, Lokhnygina Y, Patel MR, Breithardt G, Hankey GJ, Becker RC, Singer DE, Halperin JL, Hacke W, et al. Higher risk of death and stroke in patients with persistent vs. paroxysmal atrial fibrillation: results from the ROCKET-AF Trial. Eur Heart J. 2015;36(5):288–96.

    Article  Google Scholar 

  10. Yiin GS, Howard DP, Paul NL, Li L, Luengo-Fernandez R, Bull LM, Welch SJ, Gutnikov SA, Mehta Z, Rothwell PM. Age-specific incidence, outcome, cost, and projected future burden of atrial fibrillation-related embolic vascular events: a population-based study. Circulation. 2014;130(15):1236–44.

    Article  CAS  Google Scholar 

  11. Brambatti M, Connolly SJ, Gold MR, Morillo CA, Capucci A, Muto C, Lau CP, Van Gelder IC, Hohnloser SH, Carlson M, et al. Temporal relationship between subclinical atrial fibrillation and embolic events. Circulation. 2014;129(21):2094–9.

    Article  Google Scholar 

  12. Calenda BW, Fuster V, Halperin JL, Granger CB. Stroke risk assessment in atrial fibrillation: risk factors and markers of atrial myopathy. Nat Rev Cardiol. 2016;13(9):549–59.

    Article  CAS  Google Scholar 

  13. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, et al. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res. 2013;41(Database issue):D991–5.

    CAS  PubMed  Google Scholar 

  14. Chen Z, McGee M, Liu Q, Scheuermann RH. A distribution free summarization method for Affymetrix GeneChip arrays. Bioinformatics (Oxford, England). 2007;23(3):321–7.

    Article  CAS  Google Scholar 

  15. Pio G, Malerba D, D’Elia D, Ceci M. Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach. BMC Bioinformatics. 2014;15(Suppl 1):S4.

    Article  Google Scholar 

  16. Wong N, Wang X. miRDB: an online resource for microRNA target prediction and functional annotations. Nucleic Acids Res. 2015;43(Database issue):D146–52.

    Article  CAS  Google Scholar 

  17. Agarwal V, Bell GW. Nam JW. Bartel DP: Predicting effective microRNA target sites in mammalian mRNAs. eLife; 2015. p. 4.

    Google Scholar 

  18. Paraskevopoulou MD, Georgakilas G, Kostoulas N, Vlachos IS, Vergoulis T, Reczko M, Filippidis C, Dalamagas T, Hatzigeorgiou AG. DIANA-microT web server v.50: service integration into miRNA functional analysis workflows. Nucleic Acids Res. 2013;41(Web Server issue):W169–73.

    Article  Google Scholar 

  19. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue):D447–52.

    Article  CAS  Google Scholar 

  20. da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.

    Article  CAS  Google Scholar 

  21. Croft D, O’Kelly G, Wu G, Haw R, Gillespie M, Matthews L, Caudy M, Garapati P, Gopinath G, Jassal B, et al. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 2011;39(Database issue):D691–7.

    Article  CAS  Google Scholar 

  22. Munoz-Torres M, Carbon S. Get go! retrieving go data using AmiGO, QuickGO, API, Files, and Tools. Methods Mol Biol. 2017;1446:149–60.

    Article  CAS  Google Scholar 

  23. Davis AP, Grondin CJ, Johnson RJ, Sciaky D, King BL, McMorran R, Wiegers J, Wiegers TC, Mattingly CJ. The comparative toxicogenomics database: update 2017. Nucleic Acids Res. 2017;45(D1):D972–d978.

    Article  CAS  Google Scholar 

  24. Visanji NP, Bhudhikanok GS, Mestre TA, Ghate T, Udupa K, AlDakheel A, Connolly BS, Gasca-Salas C, Kern DS, Jain J, et al. Heart rate variability in leucine-rich repeat kinase 2-associated Parkinson’s disease. Mov Disord. 2017;32(4):610–4.

    Article  CAS  Google Scholar 

  25. Liu J, Li T, Thomas JM, Pei Z, Jiang H, Engelender S, Ross CA, Smith WW. Synphilin-1 attenuates mutant LRRK2-induced neurodegeneration in Parkinson’s disease models. Hum Mol Genet. 2016;25(4):672–80.

    Article  Google Scholar 

  26. Timasheva YR, Nasibullin TR, Mustafina OE. The CXCR2 Gene Polymorphism Is Associated with Stroke in Patients with Essential Hypertension. Cerebrovasc Dis Extra. 2015;5(3):124–31.

    Article  Google Scholar 

  27. Herz J, Sabellek P, Lane TE, Gunzer M, Hermann DM, Doeppner TR. Role of neutrophils in exacerbation of brain injury after focal cerebral ischemia in hyperlipidemic mice. Stroke. 2015;46(10):2916–25.

    Article  CAS  Google Scholar 

  28. Gaborit N, Le Bouter S, Szuts V, Varro A, Escande D, Nattel S, Demolombe S. Regional and tissue specific transcript signatures of ion channel genes in the non-diseased human heart. J Physiol. 2007;582(Pt 2):675–93.

    Article  CAS  Google Scholar 

  29. Luke MM, O’Meara ES, Rowland CM, Shiffman D, Bare LA, Arellano AR, Longstreth WT Jr, Lumley T, Rice K, Tracy RP, et al. Gene variants associated with ischemic stroke: the cardiovascular health study. Stroke. 2009;40(2):363–8.

    Article  CAS  Google Scholar 

  30. Wang J, Klysik E, Sood S, Johnson RL, Wehrens XH, Martin JF. Pitx2 prevents susceptibility to atrial arrhythmias by inhibiting left-sided pacemaker specification. Proc Natl Acad Sci U S A. 2010;107(21):9753–8.

    Article  CAS  Google Scholar 

  31. Wang J, Bai Y, Li N, Ye W, Zhang M, Greene SB, Tao Y, Chen Y, Wehrens XH, Martin JF. Pitx2-microRNA pathway that delimits sinoatrial node development and inhibits predisposition to atrial fibrillation. Proc Natl Acad Sci U S A. 2014;111(25):9181–6.

    Article  CAS  Google Scholar 

  32. Malik R, Traylor M, Pulit SL, Bevan S, Hopewell JC, Holliday EG, Zhao W, Abrantes P, Amouyel P, Attia JR, et al. Low-frequency and common genetic variation in ischemic stroke: the METASTROKE collaboration. Neurology. 2016;86(13):1217–26.

    Article  CAS  Google Scholar 

  33. Husser D, Buttner P, Ueberham L, Dinov B, Sommer P, Arya A, Hindricks G, Bollmann A. Association of atrial fibrillation susceptibility genes, atrial fibrillation phenotypes and response to catheter ablation: a gene-based analysis of GWAS data. J Transl Med. 2017;15(1):71.

    Article  Google Scholar 

  34. Choi EK, Park JH, Lee JY, Nam CM, Hwang MK, Uhm JS, Joung B, Ko YG, Lee MH, Lubitz SA, et al. Korean atrial fibrillation (AF) network: genetic variants for AF do not predict ablation success. J Am Heart Assoc. 2015;4(8):e002046.

    Article  Google Scholar 

  35. Carnero A. MAP17 and the double-edged sword of ROS. Biochem Biophys Acta. 2012;1826(1):44–52.

    CAS  PubMed  Google Scholar 

  36. Coady MJ, El Tarazi A, Santer R, Bissonnette P, Sasseville LJ, Calado J, Lussier Y, Dumayne C, Bichet DG, Lapointe JY. MAP17 Is a necessary activator of renal Na+/glucose cotransporter SGLT2. J Am Soc Nephrol. 2017;28(1):85–93.

    Article  CAS  Google Scholar 

  37. Xin M, Olson EN, Bassel-Duby R. Mending broken hearts: cardiac development as a basis for adult heart regeneration and repair. Nat Rev Mol Cell Biol. 2013;14(8):529–41.

    Article  CAS  Google Scholar 

  38. von Gise A, Pu WT. Endocardial and epicardial epithelial to mesenchymal transitions in heart development and disease. Circ Res. 2012;110(12):1628–45.

    Article  Google Scholar 

  39. Friedman RC, Farh KK, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19(1):92–105.

    Article  CAS  Google Scholar 

  40. Vegter EL, Ovchinnikova ES, van Veldhuisen DJ, Jaarsma T, Berezikov E, van der Meer P, Voors AA. Low circulating microRNA levels in heart failure patients are associated with atherosclerotic disease and cardiovascular-related rehospitalizations. Clin Res Cardiol. 2017;106(8):598–609.

    Article  CAS  Google Scholar 

  41. Marques FZ, Vizi D, Khammy O, Mariani JA, Kaye DM. The transcardiac gradient of cardio-microRNAs in the failing heart. Eur J Heart Fail. 2016;18(8):1000–8.

    Article  CAS  Google Scholar 

  42. Lie S, Morrison JL, Williams-Wyss O, Suter CM, Humphreys DT, Ozanne SE, Zhang S, MacLaughlin SM, Kleemann DO, Walker SK, et al. Impact of periconceptional and preimplantation undernutrition on factors regulating myogenesis and protein synthesis in muscle of singleton and twin fetal sheep. Physiol Rep. 2015;3:8.

    Article  Google Scholar 

  43. Zheng HZ, Jiang W, Zhao XF, Du J, Liu PG, Chang LD, Li WB, Hu HT, Shi XM. Electroacupuncture induces acute changes in cerebral cortical miRNA profile, improves cerebral blood flow and alleviates neurological deficits in a rat model of stroke. Neural Regen Res. 2016;11(12):1940–50.

    Article  Google Scholar 

Download references

Authors’ contributions

RZ, DZ, and LL: takes responsibility for all aspects of the reliability and freedom from bias of the data presented and their discussed interpretation, drafting the article. WS, ZS, BY, BL and QC: takes responsibility for statistical analyses, and interpretation of data. SY and PH: takes responsibility for full text evaluation and guidance, final approval of the version to be submitted. All authors read and approved the final manuscript.


Not applicable.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

GSE79768 and GSE58294 datasets were downloaded from GEO ( [11] and expression profiling arrays were generated using GPL570 (HG‑U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array (Affymetrix, Santa Clara, CA). R packages of “affy”, “affyPLM”, and “limma” (, provided by a bioconductor project [12], were applied to assess GSE79768 and GSE58294 RAW datasets. We applied online prediction tools utilizing microRNA Data Integration Portal (mirDIP) ( [13], miRDB ( [14], TargetScan (v7.1; [15], and DIANA Tools ( [16], to predict potential microRNA targeting.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.


This work was supported by the National Natural Science Foundation of China (NSFC No. 81771165), the Natural Science Foundation Project in Guangdong province, China (Grant Nos. 2016A030313295; 2018A030313172) and Guangzhou Science and Technology project of Major Special Research Topics on International Collaborative Innovation (Grant Nos. 201704030032; 201807010010).

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Songran Yang or Ping Hua.

Additional files

Additional file 1: S1.

Differentially expressed genes involved in AF samples.

Additional file 2: S2.

Differentially expressed genes involved in stroke.

Additional file 3: S3.

Gene Ontology (GO) terms enrichment analysis of AF- and stroke-related differentially expressed genes.

Additional file 4: S4.

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis among the AF- and stroke-related differentially expressed genes.

Additional file 5: S5.

Relationship between diseases and co-expressed genes.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zou, R., Zhang, D., Lv, L. et al. Bioinformatic gene analysis for potential biomarkers and therapeutic targets of atrial fibrillation-related stroke. J Transl Med 17, 45 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: