- Research
- Open access
- Published:
Polygenic risk score of metabolic dysfunction-associated steatotic liver disease amplifies the health impact on severe liver disease and metabolism-related outcomes
Journal of Translational Medicine volume 22, Article number: 650 (2024)
Abstract
Background
Although the inherited risk factors associated with fatty liver disease are well understood, little is known about the genetic background of metabolic dysfunction-associated steatotic liver disease (MASLD) and its related health impacts. Compared to non-alcoholic fatty liver disease (NAFLD), MASLD presents significantly distinct diagnostic criteria, and epidemiological and clinical features, but the related genetic variants are yet to be investigated. Therefore, we conducted this study to assess the genetic background of MASLD and interactions between MASLD-related genetic variants and metabolism-related outcomes.
Methods
Participants from the UK Biobank were grouped into discovery and replication cohorts for an MASLD genome-wide association study (GWAS), and base and target cohorts for polygenic risk score (PRS) analysis. Autosomal genetic variants associated with NAFLD were compared with the MASLD GWAS results. Kaplan–Meier and Cox regression analyses were used to assess associations between MASLD and metabolism-related outcomes.
Results
Sixteen single-nucleotide polymorphisms (SNPs) were identified at genome-wide significance levels for MASLD and duplicated in the replication cohort. Differences were found after comparing these SNPs with the results of NAFLD-related genetic variants. MASLD cases with high PRS had a multivariate-adjusted hazard ratio of 3.15 (95% confidence interval, 2.54–3.90) for severe liver disease (SLD), and 2.81 (2.60–3.03) for type 2 diabetes mellitus. The high PRS amplified the impact of MASLD on SLD and extrahepatic outcomes.
Conclusions
High PRS of MASLD GWAS amplified the impact of MASLD on SLD and metabolism-related outcomes, thereby refining the process of identification of individuals at high risk of MASLD. Supplementation of this process with relevant genetic backgrounds may lead to more effective MASLD prevention and management.
Background
Non-alcoholic fatty liver disease (NAFLD) is the leading cause of cirrhosis and hepatocellular carcinoma (HCC), occurring in 25% of the global population [1]. Over the past ten years, significant conceptual advances have been made in understanding the complex pathophysiological mechanisms of this highly prevalent liver condition [2]. It has been progressively recognized that NAFLD is a multisystem disease where insulin resistance and related metabolic dysfunction play a critical pathogenic role [3]. NAFLD is associated with several liver-related morbidities, including cirrhosis, liver failure and HCC and extrahepatic complications, such as cardiovascular disease (CVD), type 2 diabetes mellitus (T2DM), chronic kidney disease (CKD) [4,5,6]. Recently, after a Delphi consensus process engaged global experts, the new term “metabolic dysfunction-associated steatotic liver disease” (MASLD) was proposed to redefine NAFLD [7]. As interpreted in the Delphi consensus, the clinical characteristics and diagnostic methods of MASLD performed differently from those of NAFLD, with no stigmatization. Compared to NAFLD, which is diagnosed by exclusion criteria, MASLD defined by the presence of hepatic steatosis in the context of co-existing cardiometabolic risk factors like elevated body mass index (BMI), insulin resistance, hypertension or dyslipidaemia [8].
In addition to clinical factors [9,10,11,12], genetic background profoundly influences fatty liver disease (FLD) and its related outcomes, as previous studies have revealed the associations between inherited risk factors and FLD [13, 14]. One such study demonstrated that FLD-related genetic variants amplified the health impact of metabolic dysfunction-associated fatty liver disease (MAFLD) [4]. However, this study applied the genetic variants from previous genome-wide association studies (GWAS) of FLD directly. Additionally, over the past few years, GWAS have revealed more than five single-nucleotide polymorphisms (SNPs) linked to the occurrence and development of NAFLD (e.g., PNPLA3, TM6SF2, MBOAT7, and GCKR); these are classic SNPs that have been well-explored in several independent studies [15, 16].
However, phenotypic variations may lead to differences in genetics, and differences in diagnostic criteria and clinical features exist between NAFLD and MASLD [7]. Although previous studies have identified genetic variants related to NAFLD [17], hepatic lipid accumulation [18, 19], liver enzymes, and various forms of liver diseases [20, 21], little is known regarding the genetic background of MASLD and its health impacts. Therefore, to further explore the progression of MASLD and its impact on severe health outcomes such as severe liver diseases (SLD), coronary artery disease (CAD), and other extrahepatic outcomes, it is necessary to understand the related genetic determinants.
Thus, in this study, we aimed to report a large MASLD GWAS by analyzing the epidemiological and genetic data of the UK Biobank (UKBB) to further assess the genetic background of MASLD and the interactions between MASLD-related gene variants and metabolism-related outcomes.
Methods
Study population
This investigation was conducted using the UKBB resource (application number: 92668). The UKBB is a prospective population-based cohort study that contains data from over 0.5 million participants aged 40–69 years. This data was collected between 2006 and 2010 and contains combined extensive measurements of baseline and genotype data. To assess the genetic background of MASLD, and the interactions between MASLD-related genetic variants and metabolism-related outcomes, we conducted a GWAS for 165,984 MASLD cases and 269,322 controls. Participants were randomly grouped into two cohorts for the GWAS: the discovery (n = 304,714) and replication cohorts (n = 130,592). For polygenic risk score (PRS) analysis, the discovery cohort was assigned as the base cohort, and the replication cohort served as the targeted cohort (Table 1).
Diagnosis of MASLD
For the UKBB MASLD cohort, we calculated the fatty liver index (FLI) for each participant and defined hepatic steatosis as FLI ≥ 60 [22], as imaging or histological data of liver were not available. The diagnostic criteria of MASLD were set with reference to the Delphi consensus. Briefly, MASLD was diagnosed based on FLI-diagnosed hepatic steatosis and presence of one of the following four criteria: (1) overweight or obesity, (2) diagnosis of T2DM or prediabetes, (3) hypertension, or (4) dyslipidemia [7, 23].
Diagnosis of NAFLD
NAFLD was defined based on the International Classification of Diseases (ICD) Ninth and Tenth Revision codes from in-patient hospital diagnoses (Table S1). The diagnostic criteria required evidence of hepatic steatosis in the absence of significant alcohol consumption (< 30 g per day for men and < 20 g per day for women). Additional exclusion criteria included other causes of liver fat accumulation such as viral hepatitis, medication use, or other chronic liver diseases.
GWAS and PRS analyses
In the discovery cohort, a total of 304,714 Caucasian British individuals (165,182 females and 139,532 males) with genotype data meeting MASLD diagnosis criteria were analyzed for GWAS, comprising 116,509 MASLD cases. In the replication cohort, a total of 130,592 Caucasian British individuals comprising 50,041 MASLD cases were analyzed for GWAS (Fig. S1). For NAFLD, 301,846 Caucasian British individuals comprising 3881 NAFLD cases were analyzed.
Two very similar genotyping arrays (Affymetrix UK BiLEVE and UK Biobank Axiom arrays) were used for genotyping participants in the UK Biobank, and imputation was performed using the merged UK10K and 1000 Genomes phase 3 reference panels [24]. Variants was restricted to high-quality autosomal variants with a minor allele frequency > 0.1%, minor allele count > 5%, info score > 0.3, genotype hard call rate > 0.95, and Hardy–Weinberg P > 1 × 10–6. Finally, a total of 12,250,143 and 12,248,938 SNPs were included in the GWAS in the discovery and replication cohorts, respectively. We tested autosomal genetic variants for association with MASLD, assuming an additive allelic effect and using FastGWA-GLMM [25] implemented in Genome-wide Complex Trait Analysis software to account for population structure and cryptic relatedness. All models included the following covariates as fixed effects: sex, age, genotyping array, and principal components 1–20. The genome-wide significance threshold was set as 5 × 10−8. The genome-wide significant variants in the discovery cohort were extracted and analyzed in the replication cohort.
To detect multiple independent association signals at each genome-wide significant MASLD locus, we applied approximate conditional and joint genome-wide association analysis using the software package GCTA v1.91.14. Variants with high collinearity (multiple regression R2 > 0.9) were ignored, and those situated more than 1000 Kbp away were assumed to be independent. A reference sample of 50,000 unrelated White British individuals randomly selected from the UK Biobank was used to model linkage disequilibrium (LD) patterns between variants. The reference genotyping dataset comprised the same variants as those assessed in the GWAS. Conditionally independent variants reaching genome-wide significance were annotated to the physically closest gene using 3DSNPv1.0 [26].
We used LD score regression (LDSC) [27] to estimate the amount of genomic inflation present in the data due to residual population stratification, cryptic relatedness, and other latent sources of bias. PRSice-2 software (version 2.3.3 for R) was used to estimate the PRS using odds ratios (ORs) from GWAS data in the base cohort, and we estimated the individual PRS of MASLD phenotypes for the target cohort.
Outcome data
ICD codes were used to define incident diseases (Table S1), and associations between MASLD and SLD, hypertension, CAD, stroke, heart failure (HF), CKD, T2DM, and overall survival were examined. SLD was defined as a composite diagnosis of cirrhosis, decompensated liver disease (i.e., esophageal varices with or without bleeding, portal hypertension, hepatorenal syndrome, and liver failure), HCC, and/or liver transplantation in any of the aforementioned records.
Statistical analysis
Continuous data were summarized as means and standard deviations (SD) when normally distributed and as median and interquartile range when skewed; categorical data were summarized as frequencies and percentages. For comparison between groups, continuous data were assessed using the independent t-test or Mann–Whitney U test. In contrast, categorical data were evaluated using the chi-square or Fisher’s exact tests.
Cox proportional hazards models were used to assess the health impact of MASLD, and the hazard ratio (HR) derived from the three models was used to quantify the health impact. Model 1 was unadjusted. Model 2 was adjusted for sex, age at recruitment, genotyping chips, and BMI. Model 3 was further adjusted for hypoglycemic drugs, antihypertensive drugs and statins, based on model2 (Table S2). The hypoglycemic drugs included PPAR agonists, biguanides, alpha-glucosidase inhibitors, benzoic acid derivatives, sulfonylureas, and insulin [28]. The antihypertensive drugs include ACE inhibitors, calcium channel blockers, beta-blockers, thiazides, and angiotensin II receptor blockers [29].
We categorized the participants into low- and high-PRS groups using the midpoint of the PRS as the cut-off. The associations between MASLD and morbidities were reassessed by considering the PRS. All analyses were performed using R software (version 4.0.2; R Foundation for Statistical Computing, Vienna, Austria). For the GWAS, significance was set at P < 5 × 10−8; for other analyses, it was set at P < 0.05.
Ethics statement
All participants provided informed consent via electronic signature at the baseline assessment. Ethical approval was granted for the use of the UK Biobank by the North West-Haydock Research Ethics Committee (REC reference: 16/NW/0274). The study protocol conformed to the ethical guidelines of the 1975 Declaration of Helsinki.
Results
Characteristics of study participants
A total of 435,306 participants of White British descent from the UK Biobank were selected for a GWAS of MASLD, which comprised 166,550 subjects diagnosed with MASLD (Table 1, Fig. S1). Table 1 presents the basic characteristics of the study participants. In the discovery cohorts, over half of the participants were women, and the median BMI indicated an overweight status (BMI ≥ 25 kg/m2). The demographic, clinical and biochemical parameters were similar between the discovery and replication cohorts (P > 0.05).
MASLD case–control GWAS
To explore the genetic association of SNPs with phenotypes, we analyzed autosomal SNPs and identified 114 conditionally independent signals associated with MASLD mapping to loci at P < 5 × 10–8 (Table S3). The MASLD of the discovery cohort case–control analysis is presented as a Manhattan plot in Fig. 1A and a QQ plot of the association results is shown in Fig. 1B. Although there was substantial inflation of the test statistics (λ = 1.57), LD score regression indicated most of the inflation to be a result of polygenicity rather than population stratification (LD score regression intercept, 1.0598 (0.0142); ratio, 0.077). After OR analysis in the replication cohort for variants significantly related to MASLD found in the discovery, 16 conditionally independent SNPs were replicated as significantly related (P < 5 × 10–8; Table 2), located in the genes GCKR, LOC124905962, MON1A, MLXIPL, LPL, ZPR1, BDNF, FAIM2, EXOC3L4, FTO, APOBR, BP11-795H16.2, GGT1, and BCRP3.
To further explore the 16 independent SNPs, we analyzed their association with survival, as well as their cell-specific differential effects. After adjusting for sex, age at recruitment, genotyping chips, and BMI, the independent SNPs were not associated with OS (Table S4–6). Additionally, MASLD GWAS was conducted within subgroups based on sex (male and female) and BMI categories (normal weight, overweight, and obese; Table S7–11, Fig. S2–3).
Comparison of GWAS results between MASLD and NAFLD
We conducted a GWAS of NAFLD in the discovery cohort to compare the significant SNPs between MASLD and NAFLD, as well as a GWAS of NAFLD in the discovery cohort. In addition, we extracted previously reported SNPs associated with NAFLD diagnosed using biopsy [20] or imaging [30]. The details of the three NAFLD cohorts are shown in Table S12. The effects of 83 intersectional significant SNPs that reached genome-wide significance for the NAFLD imaging and NAFLD UKBB cohorts were identified; however, only 18 and 24 effect sizes estimated from the GWAS variants for these two cohorts were found in our discovery cohort of MASLD UKBB, respectively (Fig. S4A).
To further explore the distinction of significant genetic variants between MASLD and NAFLD, we annotated the significant SNPs among the GWAS results and compared them between the MASLD UKBB cohort and three NAFLD cohorts using various diagnostic methods. The results showed that the effect at PNPLA3, SAMM50, and PARVB loci reached genome-wide significance only in the NAFLD cohort but not in MASLD. However, other chromosomal loci including TM6SF2, SUGP1, PBX4, ZNF101, ZNF512, LOC124904656, HAPLN4, GATAD2A, C2orf16, and TRIB1 that were identified as associated with NAFLD in various NAFLD cohorts were significantly associated with MASLD (Fig. S4).
Polygenic risk score for MASLD
We used a base cohort to estimate the OR related to MASLD and subsequently applied the OR to estimate PRS in the target cohort (n = 130,592) recruited from the replication cohort. The median PRS of all participants in the target cohort was 1.476e−05 (8.805e−07, 2.871e−05), and we identified 65,296 participants with a high PRS. As shown in Fig. 1C, the MASLD-PRS was normally distributed in both the MASLD and non-MASLD groups. The mean PRS was higher in the MASLD group than the non-MASLD group (P < 0.05), and the mean MASLD-PRS percentile (SD) in the non-MASLD group was 47.97 (11.06) vs 51.15 (11.11) in the MASLD group (P < 0.05; Fig. 1D). In addition, we generated receiver operating characteristic (ROC) curves of the PRS derived from different GWAS results in the target cohort, which provided a measure of their diagnostic power for MASLD. MASLD-PRS outperformed NAFLD-PRS in estimating MASLD status and show that MASLD-PRS achieved the highest area under the ROC curve (AUC = 0.581, 95% confidence interval (CI): 0.577–0.584). (Fig. S4C, D) External validation was performed using the Finngen dataset to calculate the NAFLD-PRS.
PRS amplified the health impact of MASLD
During a median follow-up of 13.8 years, we identified 779, 9083, 2869, 3939, 15,512, 5192, 6508, and 9893 incident events of SLD, CAD, stroke, HF, hypertension, CKD, T2DM, and death, respectively in the replication cohort. MASLD was associated with an increased risk of both SLD and extrahepatic comorbidities. To further verify the amplified effect of high PRS, we conducted Kaplan–Meier and Cox regression analyses. The impact of MASLD on SLD and extrahepatic morbidities was amplified by a high PRS, especially for SLD and T2DM (Figs. 2 and 3). In model 2, compared with those who were non-MASLD, MASLD participants and those with high PRS had a multivariate-adjusted HR for SLD of 3.15 (95% CI 2.54–3.90); for CAD, it was 1.36 (95% CI 1.28–1.44); for stroke, it was 1.24 (95% CI 1.11–1.39); for HF, it was 1.24 (95% CI 1.13–1.36); for hypertension, it was 1.41 (95% CI 1.35–1.48); for CKD, it was 1.41 (95% CI 1.30–1.52); for T2DM, it was 2.81 (95% CI 2.60–3.03); and for overall survival, it was 1.26 (95% CI 1.19–1.34; Table S2, Fig. 3). After further adjustment for medication usage, including hypoglycemic drugs, antihypertensive drugs, and statins for adjustment, these associations were unchanged (Model 3).
Discussion
The GWAS in this study found 16 conditionally independent SNPs to be associated with MASLD at genome-wide significance levels. Differences were found when comparing the results of NAFLD-related genetic variants. Additionally, we constructed a MASLD PRS based on this GWAS and examined the association between the PRS and MASLD. We also revealed the impact of MASLD on SLD, with extrahepatic metabolism-related outcomes shown to be amplified by MASLD PRS.
The complexity of the MASLD phenotype is due to interactions between the components of metabolic syndrome and a genetic predisposition to environmental factors. Further research on the utility of MASLD as a sub-phenotype will strengthen its recognition within the field of fatty liver [31]. However, to our knowledge, few previous studies have explored the genetics of MASLD. He et al. explored the Mendelian randomization of MAFLD and iron status but did so by directly using the results of a previous GWAS of NAFLD [32]. This study’s results identified 16 variants primarily located in genes such as GCKR, MON1A, and LPL that were independently associated with MASLD at genome-wide significance levels and determined differences in genetic variants between MASLD and NAFLD.
Our study corroborated some previous identified variants associated with FLD, such as GCKR, LPL, FTO [17], and MLXIPL [33], which indicated the good validity of the data sources used in the current analysis. Additionally, it is essential to elucidate the variants found in our GWAS of MASLD. GWAS and multiple candidate gene studies have identified GCKR variants as being linked to several metabolic parameters, including triglyceride (TG) levels, insulin resistance, and fasting plasma glucose levels [34,35,36], as well as metabolic disorders like T2DM and dyslipidemia [37, 38]. While previous research has reported GCKR as a disease-predisposing variant for NAFLD [16], this study found an association between GCKR and a decreased risk of MASLD (OR = 0.938 [95% CI 0.927–0.949]. This association could be attributed to the stronger relationship between GCKR variants and improved glucose metabolism. A previous study found that the association between rs3812316 (MLXIPL) and alpha-linolenic acid intake as well as TG levels in Mexican Mestizo women [39], and Hehl et al. highlighted the link between MLXIPL variants and lower serum TG and apolipoprotein-B levels [33]. LPL variants have also been shown to significantly contribute to dyslipidemia, being associated with several conditions including obesity, metabolic syndrome, and atherosclerosis [40,41,42,43]. Additionally, the rs964184 (ZPR1) has been reported to be associated with variations in lipid levels [44], as well as metabolic disorders such as NAFLD, T2DM, and CVD [45,46,47]. Notably, after stratified analysis of the MASLD GWAS by BMI categories or sex, rs964184 (ZPR1) remained significant across all subgroups, highlighting its potential as a biomarker and therapeutic target for effective MASLD management. The rs7132908 (FAIM2) was found to be significantly associated with obesity [48, 49], and FTO was the first GWAS-identified obesity gene [50]. Therefore, these variants might be potential pharmacological targets for treatment of hyperlipidemia and MASLD, especially for high-risk patients.
Moreover, although there have been fewer studies exploring the relationship between BDNF and metabolic disorders, there is evidence of an association between BDNF and the reduction in BMI, waist circumference, glucose, insulin, and risk for T2DM, mainly in Asian populations [51, 52]. MON1A has been identified as having a critical role in controlling macrophage iron metabolism [53], and a growing body of evidence has suggested that macrophage infiltration in adipose tissue causes inflammation and cytokine production and contributes to the development of metabolic decompensation, insulin resistance, and T2DM [54, 55]. GWAS revealed an association between GGT1 and plasma levels of liver enzymes, as well as alcohol-associated liver disease [56, 57]. The role of LOC124905962, EXOC3L4, BP11-795H16.2, and BCRP3 in metabolic disorders has not been well-documented yet. To further investigate the underlying mechanisms, we explored the cell-type-specific expression patterns of these genes using single-cell RNA sequencing data from liver cells in metabolic syndrome mouse models (Fig. S5) [58]. This analysis was conducted using online tools available at the Single Cell Portal (SCP1404). The findings revealed that GCKR and MLXIPL are highly expressed in pericentral and periportal hepatocytes, LPL is specific to Kupffer cells, and MON1A is predominantly expressed in T/NK cells. Currently, there is no MASLD-specific single-cell RNA sequencing dataset available, which underscores a potential area for future research. Overall, more mechanistic studies are necessary to gain a deeper understanding of the role of these variants in metabolic pathways and in the development of steatosis.
Sex and BMI were found to be strongly correlated with the incidence of MASLD [10, 59, 60]. Therefore, we conducted MASLD GWAS within subgroups stratified by sex and BMI categories. In the sex subgroup analysis, SNPs such as rs10889356 (DOCK7) and rs72836561 (CD300LG) showed significant associations exclusively in males, highlighting the roles of lipid and glucose metabolism [61, 62], while SNPs such as rs545608 (SEC16B), previously proven to be female-specific, was exclusively found in the female group [63]. The presence of these SNPs suggests potential gender-specific genetic influences on MASLD. For the BMI subgroups, the identification of significant SNPs such as rs17145750 (MLXIPL) in the normal BMI population suggests that lipid metabolism is also pivotal in the development of MASLD among lean individuals [39]. Further research is needed to validate the identified specific-SNPs and explore their functional implications. These findings underscore the need to consider demographic differences, particularly sex and BMI, in the development of personalized diagnostic and therapeutic strategies.
We constructed the genome-wide PRS and further assessed the interactions between the PRS and health outcomes. Previous studies have reported genetic variations associated with an increased risk of liver disease progression and adverse extrahepatic outcomes [64,65,66]. For example, Liu et al. demonstrated that the genetic risk score derived from FLD-associated variants increases the risk of hepatic events and extrahepatic outcomes [4]. In our study, the GWAS-based PRS of MASLD was shown to have better performance in identifying MASLD than that based on a GWAS of NAFLD, indicating the difference in genetic backgrounds between MASLD and NAFLD. Further analysis showed that PRS derived from GWAS of MASLD amplified the effect of MASLD on SLD and metabolism-related outcomes such as T2DM, CAD, stroke, HF, and CKD, thus complementing the findings from a genetic perspective. Both intrahepatic and extrahepatic outcomes indicate that individuals with MASLD in the high-PRS group are at an increased risk of disease progression. This finding highlights the potential of the PRS to predict MASLD progression. These findings are essential for preventing and managing metabolism-related diseases in patients with MASLD. Future research should focus on longitudinal studies to validate these findings and establish the PRS as a reliable marker for disease monitoring and management. The inclusion of additional cohorts and extended follow-up periods will be essential to the constancy of the PRS across diverse populations.
Our study has several limitations. First, it only included individuals from Caucasian British ethnic backgrounds, compromising the generalizability of its results to other ethnicities. In addition, MASLD is a common phenotype in the European population; thus, our population controls cannot be considered entirely free of MASLD, and there is no known way of investigating this further. Finally, we used serum biomarkers to diagnose fatty liver, but not liver biopsy or imaging data. Although the diagnosis of steatotic liver disease requires supporting biopsy imaging by definition of MASLD, FLI remains a useful diagnostic biomarker for FLD with acceptable accuracy and is widely used in population-based studies.
Conclusions
This study is the first to combine GWAS and PRS to identify the genetic components of MASLD. We found that high PRS amplified the health of patients with MASLD, especially those with SLD and T2DM. Therefore, the construction of PRS may help identify individuals at high risk of MASLD and metabolism-related outcomes. Supplementation of this process with MASLD-related genetics information may lead to a more accurate prediction of disease progression and more effective management of MASLD.
Availability of data and materials
The UK Biobank data used in this study is publicly available online (www.ukbiobank.co.uk; application number 92668). The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- BMI:
-
Body mass index
- CAD:
-
Coronary artery disease
- CVD:
-
Cardiovascular disease
- CKD:
-
Chronic kidney disease
- ESRD:
-
End-stage renal disease
- FLD:
-
Fatty liver disease
- GWAS:
-
Genome-wide association study
- HCC:
-
Hepatocellular carcinoma
- HF:
-
Heart failure
- HR:
-
Hazard ratio
- ICD:
-
International Classification of Diseases
- LD:
-
Linkage disequilibrium
- LDSC:
-
LD score regression
- MAFLD:
-
Metabolic dysfunction-associated fatty liver disease
- MASLD:
-
Metabolic dysfunction-associated steatotic liver disease
- NAFLD:
-
Non-alcoholic fatty liver disease
- OR:
-
Odds ratio
- PRS:
-
Polygenic risk score
- SLD:
-
Severe liver disease
- SNPs:
-
Single-nucleotide polymorphisms
- T2DM:
-
Type 2 diabetes mellitus
- UKBB:
-
UK Biobank
- WC:
-
Waist circumference
References
Powell EE, Wong VWS, Rinella M. Non-alcoholic fatty liver disease. Lancet. 2021;397:2212–24.
Targher G, Byrne CD, Tilg H. MASLD: a systemic metabolic disorder with cardiovascular and malignant complications. Gut. 2024;73:691–702.
Byrne CD, Targher G. NAFLD: a multisystem disease. J Hepatol. 2015;62(1 Suppl):S47–64.
Liu Z, Suo C, Shi O, Lin C, Zhao R, Yuan H, et al. The health impact of MAFLD, a novel disease cluster of NAFLD, is amplified by the integrated effect of fatty liver disease-related genetic variants. Clin Gastroenterol Hepatol. 2022;20:e855–75.
Kaya E, Yilmaz Y. Metabolic-associated fatty liver disease (MAFLD): a multi-systemic disease beyond the liver. J Clin Transl Hepatol. 2022;10:329–38.
Targher G, Tilg H, Byrne CD. Non-alcoholic fatty liver disease: a multisystem disease requiring a multidisciplinary and holistic approach. Lancet Gastroenterol Hepatol. 2021;6:578–88.
Rinella ME, Lazarus JV, Ratziu V, Francque SM, Sanyal AJ, Kanwal F, et al. A multi-society Delphi consensus statement on new fatty liver disease nomenclature. J Hepatol. 2023. https://doi.org/10.1016/j.jhep.2023.06.003.
Hsu CL, Loomba R. From NAFLD to MASLD: implications of the new nomenclature for preclinical and clinical research. Nat Metab. 2024;6:600–2.
Wang J, Zeng L, Hong C, Cui H, Wang W, Zhu H, et al. Lower creatinine to cystatin C ratio is associated with an increased risk of MASLD: a cross-sectional and prospective study of 368,634 UK Biobank participants. Clin Endocrinol. 2024;100:116–23.
Lonardo A, Nascimbeni F, Ballestri S, Fairweather D, Win S, Than TA, et al. Sex differences in nonalcoholic fatty liver disease: state of the art and identification of research gaps. Hepatology. 2019;70:1457–69.
Hong C, Wu C, Ma P, Cui H, Chen L, Li R, et al. Positive association of nap duration with risk of non-alcoholic fatty liver disease in an occupational population in Guangdong Province, China: a cross-sectional study. BMC Gastroenterol. 2022;22:185.
Ma P, Li Q, Li R, Hong C, Cui H, Zhang Z, et al. A high reticulocyte count is a risk factor for the onset of metabolic dysfunction-associated steatotic liver disease: cross-sectional and prospective studies of data of 310,091 individuals from the UK Biobank. Front Pharmacol. 2024;15:1281095.
Meroni M, Longo M, Fracanzani AL, Dongiovanni P. MBOAT7 down-regulation by genetic and environmental factors predisposes to MAFLD. EBioMedicine. 2020;57: 102866.
Xia M, Zeng H, Wang S, Tang H, Gao X. Insights into contribution of genetic variants towards the susceptibility of MAFLD revealed by the NMR-based lipoprotein profiling. J Hepatol. 2021;74:974–7.
Zhu X, Xia M, Gao X. Update on genetics and epigenetics in metabolic associated fatty liver disease. Ther Adv Endocrinol Metab. 2022;13:20420188221132136.
Eslam M, Valenti L, Romeo S. Genetics and epigenetics of NAFLD and NASH: clinical impact. J Hepatol. 2018;68:268–79.
Ghodsian N, Abner E, Emdin CA, Gobeil É, Taba N, Haas ME, et al. Electronic health record-based genome-wide meta-analysis provides insights on the genetic architecture of non-alcoholic fatty liver disease. Cell Rep Med. 2021;2: 100437.
Speliotes EK, Yerges-Armstrong LM, Wu J, Hernaez R, Kim LJ, Palmer CD, et al. Genome-wide association analysis identifies variants associated with nonalcoholic fatty liver disease that have distinct effects on metabolic traits. PLoS Genet. 2011;7: e1001324.
Parisinos CA, Wilman HR, Thomas EL, Kelly M, Nicholls RC, McGonigle J, et al. Genome-wide and Mendelian randomisation studies of liver MRI yield insights into the pathogenesis of steatohepatitis. J Hepatol. 2020;73:241–51.
Anstee QM, Darlay R, Cockell S, Meroni M, Govaere O, Tiniakos D, et al. Genome-wide association study of non-alcoholic fatty liver and steatohepatitis in a histologically characterised cohort. J Hepatol. 2020;73:505–15.
Alcohol Genome-wide Association (AlcGen) Consortium, Diabetes Genetics Replication and Meta-analyses (DIAGRAM+) Study, Genetic Investigation of Anthropometric Traits (GIANT) Consortium, Global Lipids Genetics Consortium, Genetics of Liver Disease (GOLD) Consortium, International Consortium for Blood Pressure (ICBP-GWAS), et al. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma. Nat Genet. 2011;43:1131–8.
Bedogni G, Bellentani S, Miglioli L, Masutti F, Passalacqua M, Castiglione A, et al. The Fatty Liver Index: a simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 2006;6:33.
Eslam M, Newsome PN, Sarin SK, Anstee QM, Targher G, Romero-Gomez M, et al. A new definition for metabolic dysfunction-associated fatty liver disease: an international expert consensus statement. J Hepatol. 2020;73:202–9.
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–9.
Jiang L, Zheng Z, Fang H, Yang J. A generalized linear mixed model association tool for biobank-scale data. Nat Genet. 2021;53:1616–21.
Lu Y, Quan C, Chen H, Bo X, Zhang C. 3DSNP: a database for linking human noncoding SNPs to their three-dimensional interacting genes. Nucleic Acids Res. 2017;45:D643–9.
Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P-R, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47:1228–35.
Addendum. 2. Classification and diagnosis of diabetes: standards of medical care in diabetes-2021. Diabetes Care 2021;44(Suppl. 1):S15–S33. Diabetes Care. 2021;44:2182.
World Health Organization. Guideline for the pharmacological treatment of hypertension in adults. Geneva: World Health Organization; 2021.
Namjou B, Lingren T, Huang Y, Parameswaran S, Cobb BL, Stanaway IB, et al. GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network. BMC Med. 2019;17:135.
Eslam M, Sanyal AJ, George J, Sanyal A, Neuschwander-Tetri B, Tiribelli C, et al. MAFLD: a consensus-driven proposed nomenclature for metabolic associated fatty liver disease. Gastroenterology. 2020;158:1999-2014.e1.
He H, Liao S, Zeng Y, Liang L, Chen J, Tao C. Causal relationships between metabolic-associated fatty liver disease and iron status: two-sample Mendelian randomization. Liver Int. 2022;42:2759–68.
Hehl L, Creasy KT, Vitali C, Scorletti E, Seeling KS, Vell MS, et al. A genome-first approach to variants in MLXIPL and their association with hepatic steatosis and plasma lipids. Hepatol Commun. 2024;8: e0427.
Yonezawa Y, Takahashi I, Ohseto H, Ueno F, Onuma T, Noda A, et al. Genome-wide association study of nausea and vomiting during pregnancy in Japan: the TMM BirThree Cohort Study. BMC Pregnancy Childbirth. 2024;24:209.
Orho-Melander M, Melander O, Guiducci C, Perez-Martinez P, Corella D, Roos C, et al. Common missense variant in the glucokinase regulatory protein gene is associated with increased plasma triglyceride and C-reactive protein but lower fasting glucose concentrations. Diabetes. 2008;57:3112–21.
Onuma H, Tabara Y, Kawamoto R, Shimizu I, Kawamura R, Takata Y, et al. The GCKR rs780094 polymorphism is associated with susceptibility of type 2 diabetes, reduced fasting plasma glucose levels, increased triglycerides levels and lower HOMA-IR in Japanese population. J Hum Genet. 2010;55:600–4.
Akbarzadeh M, Alipour N, Moheimani H, Zahedi AS, Hosseini-Esfahani F, Lanjanian H, et al. Evaluating machine learning-powered classification algorithms which utilize variants in the GCKR gene to predict metabolic syndrome: Tehran cardio-metabolic genetics study. J Transl Med. 2022;20:164.
Shen H, Pollin TI, Damcott CM, McLenithan JC, Mitchell BD, Shuldiner AR. Glucokinase regulatory protein gene polymorphism affects postprandial lipemic response in a dietary intervention study. Hum Genet. 2009;126:567–74.
Maldonado-González M, Hernández-Nazara ZH, Torres-Castillo N, Martínez-López E, de la Cruz-Color L, Ruíz-Madrigal B. Association between the rs3812316 single nucleotide variant of the MLXIPL gene and alpha-linolenic acid intake with triglycerides in Mexican mestizo women. Nutrients. 2022;14:4726.
Ference BA, Kastelein JJP, Ray KK, Ginsberg HN, Chapman MJ, Packard CJ, et al. Association of triglyceride-lowering LPL variants and LDL-C-lowering LDLR variants with risk of coronary heart disease. JAMA. 2019;321:364–73.
Dron JS, Hegele RA. Genetics of hypertriglyceridemia. Front Endocrinol. 2020;11:455.
Hsu C-S, Chang S-T, Nfor ON, Lee K-J, Ho C-C, Liu C-C, et al. Association of metabolic syndrome with aerobic exercise and LPL rs3779788 polymorphism in Taiwan biobank individuals. Diabetes Metab Syndr Obes. 2021;14:3997–4004.
Darci-Maher N, Alvarez M, Arasu UT, Selvarajan I, Lee SHT, Pan DZ, et al. Cross-tissue omics analysis discovers ten adipose genes encoding secreted proteins in obesity-related non-alcoholic fatty liver disease. EBioMedicine. 2023;92: 104620.
Valente-Frossard TNS, Cruz NRC, Ferreira FO, Belisario AR, Pereira BM, Gomides AFDF, et al. Polymorphisms in genes that affect the variation of lipid levels in a Brazilian pediatric population with sickle cell disease: rs662799 APOA5 and rs964184 ZPR1. Blood Cells Mol Dis. 2020;80: 102376.
Esteve-Luque V, Padró-Miquel A, Fanlo-Maresma M, Corbella E, Corbella X, Pintó X, et al. Implication between genetic variants from APOA5 and ZPR1 and NAFLD severity in patients with hypertriglyceridemia. Nutrients. 2021;13:552.
Guan F, Niu Y, Zhang T, Liu S, Ma L, Qi T, et al. Two-stage association study to identify the genetic susceptibility of a novel common variant of rs2075290 in ZPR1 to type 2 diabetes. Sci Rep. 2016;6:29586.
Paquette M, Fantino M, Bernard S, Baass A. The ZPR1 genotype predicts myocardial infarction in patients with familial hypercholesterolemia. J Clin Lipidol. 2020;14:660–6.
Wu L, Zhao X, Shen Y, Zhang M-X, Yan Y, Hou D, et al. Promoter methylation of fas apoptotic inhibitory molecule 2 gene is associated with obesity and dyslipidaemia in Chinese children. Diabetes Vasc Dis Res. 2015;12:217–20.
Voisin S, Almén MS, Zheleznyakova GY, Lundberg L, Zarei S, Castillo S, et al. Many obesity-associated SNPs strongly associate with DNA methylation changes at proximal promoters and enhancers. Genome Med. 2015;7:103.
Loos RJF, Yeo GSH. The bigger picture of FTO: the first GWAS-identified obesity gene. Nat Rev Endocrinol. 2014;10:51–61.
Han X, Luo Y, Zhang X, Lv C, Sun X, Zhang X, et al. Rs4074134 near BDNF gene is associated with type 2 diabetes mellitus in Chinese Han population independently of body mass index. PLoS ONE. 2013;8: e56898.
Thorleifsson G, Walters GB, Gudbjartsson DF, Steinthorsdottir V, Sulem P, Helgadottir A, et al. Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity. Nat Genet. 2009;41:18–24.
McCreedy RA, Fleet JC. Forward genetics used to identify new gene Mon1a with critical role in controlling macrophage iron metabolism and iron recycling from erythrocytes. Nutr Rev. 2009;67:607–10.
Weisberg SP, McCann D, Desai M, Rosenbaum M, Leibel RL, Ferrante AW. Obesity is associated with macrophage accumulation in adipose tissue. J Clin Invest. 2003;112:1796–808.
Christou KA, Christou GA, Karamoutsios A, Vartholomatos G, Gartzonika K, Tsatsoulis A, et al. Metabolically healthy obesity is characterized by a proinflammatory phenotype of circulating monocyte subsets. Metab Syndr Relat Disord. 2019;17:259–65.
Kim KY, Kim JO, Kim Y-S, Choi J-E, Park J-M, Han K, et al. Genome-wide association of individual vulnerability with alcohol-associated liver disease: a Korean genome and epidemiology study. Hepatology. 2022;75:391–402.
Yuan X, Waterworth D, Perry JRB, Lim N, Song K, Chambers JC, et al. Population-based genome-wide association studies reveal six loci influencing plasma levels of liver enzymes. Am J Hum Genet. 2008;83:520–8.
Arneson D, Majid S, Ahn IS, Kurt Z, Diamante G, Zang G, et al. 1812-P: multitissue single-cell analysis reveals differential tissue, cellular, and molecular sensitivity between fructose and high-fat diets. Diabetes. 2019;68(Supplement_1):1812-P.
Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M. Global epidemiology of nonalcoholic fatty liver disease-meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology. 2016;64:73–84.
Zeng L, Li Y, Hong C, Wang J, Zhu H, Li Q, et al. Association between fatty liver index and controlled attenuation parameters as markers of metabolic dysfunction-associated fatty liver disease and bone mineral density: observational and two-sample Mendelian randomization studies. Osteoporos Int. 2024;35:679–89.
Pashos EE, Park Y, Wang X, Raghavan A, Yang W, Abbey D, et al. Large, diverse population cohorts of hiPSCs and derived hepatocyte-like cells reveal functional genetic variation at blood lipid-associated loci. Cell Stem Cell. 2017;20:558-570.e10.
Støy J, Kampmann U, Mengel A, Magnusson NE, Jessen N, Grarup N, et al. Reduced CD300LG mRNA tissue expression, increased intramyocellular lipid content and impaired glucose metabolism in healthy male carriers of Arg82Cys in CD300LG: a novel genometabolic cross-link between CD300LG and common metabolic phenotypes. BMJ Open Diabetes Res Care. 2015;3: e000095.
Dai Z, Wang Y, Wu Z, Feng Z, Sun X, Qiu Y, et al. Female-specific susceptibility locus in BOC and SEC16B are associated with adolescent idiopathic scoliosis. Spine. 2021;46:E1178–84.
Unalp-Arida A, Ruhl CE. Patatin-like phospholipase domain-containing protein 3 I148M and liver fat and fibrosis scores predict liver disease mortality in the US population. Hepatology. 2020;71:820.
Gellert-Kristensen H, Richardson TG, Davey Smith G, Nordestgaard BG, Tybjaerg-Hansen A, Stender S. Combined effect of PNPLA3, TM6SF2, and HSD17B13 variants on risk of cirrhosis and hepatocellular carcinoma in the general population. Hepatology. 2020;72:845–56.
Ismaiel A, Dumitrascu DL. Genetic predisposition in metabolic-dysfunction-associated fatty liver disease and cardiovascular outcomes—systematic review. Eur J Clin Invest. 2020;50: e13331.
Acknowledgements
This research has been conducted using the UK Biobank resource under Application Number 92668. The authors would like to thank all participants recorded in the UK Biobank resource for their selfless contribution to the study.
Funding
This study was supported by the National Key R&D Program of China (No. 2020YFC2006400), National Nature Science Foundation of China (No. 81972897, 82172751), Guangzhou Science and Technology Project (No. 202201011183), and Guangdong Natural Science Foundation (No. 2022A1515110656).
Author information
Authors and Affiliations
Contributions
LX, YL, and LZ conceptualized this study. LX, and LL gathered resources. LZ, YL, and CH designed the methodology and conducted investigation. LZ, HBZ, YL, PM, XZ, and CH curated the research data. LZ analyzed and presented the data. HBZ, RL, and JH validated the research. YL and LZ wrote the original draft of the manuscript. LX, JW, HBZ, and HC reviewed and edited the manuscript. LX, and LL were responsible for funding acquisition. LL supervised the study. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Use of the ‘UK Biobank’ in this study was approved by the North West-Haydock Research Ethics Committee (REC reference: 16/NW/0274). The study protocol conformed to the ethical guidelines of Declaration of Helsinki (1975). All study participants provided informed consent via electronic signature during the baseline assessment.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
12967_2024_5478_MOESM1_ESM.pdf
Supplementary Material 1: Fig. S1. Study flow chart. MASLD: metabolic dysfunction-associated steatotic liver disease; GWAS: genome-wide association study.
12967_2024_5478_MOESM2_ESM.pdf
Supplementary Material 2: Fig. S2. Results of GWAS case–control analysis within subgroups based on sex. Manhattan plot (A) and Q–Q plot (B) of genome-wide markers for MASLD in the discovery cohort among male participants. Manhattan plot (C) and Q–Q plot (D) of genome-wide markers for MASLD in the discovery cohort among female participants. MASLD: metabolic dysfunction-associated steatotic liver disease; GWAS: genome-wide association study.
12967_2024_5478_MOESM3_ESM.pdf
Supplementary Material 3: Fig. S3. Results of GWAS case–control analysis within subgroups based on BMI categories. Manhattan plot (A) and Q–Q plot (B) of genome-wide markers for MASLD in the discovery cohort among the participants with normal BMI. Manhattan plot (C) and Q–Q plot (D) of genome-wide markers for MASLD in the discovery cohort among overweight participants. Manhattan plot (E) and Q–Q plot (F) of genome-wide markers for MASLD in the discovery cohort among obese participants. MASLD: metabolic dysfunction-associated steatotic liver disease; GWAS: genome-wide association study; Normal BMI: BMI < 25 kg/m2; overweight: 25 kg/m2 ≤ BMI < 30 kg/m2; obese: BMI ≥ 30 kg/m2.
12967_2024_5478_MOESM4_ESM.pdf
Supplementary Material 4: Fig. S4. Comparison of GWAS and PRS results between MASLD and NAFLD. (A) The intersection of significant SNPs associated with MASLD and published SNPs associated with NAFLD. (B) Comparison of MASLD and NAFLD-related genes. (C) and (D) ROC graphical plot the diagnostic ability of different sources of PRS for MASLD in the replication cohort. NAFLD Biopsy: the summary statistics of GWAS for NAFLD diagnosed with biopsy; NAFLD Image: the summary statistics of GWAS for NAFLD diagnosed with image; NAFLD UKBB: Summary statistics of GWAS for NAFLD in the UKBB discovery cohort; MASLD UKBB: Summary statistics of the GWAS for MASLD in the discovery cohort of UKBB; MASLD: metabolic dysfunction-associated steatotic liver disease; GWAS: genome-wide association study; NAFLD: non-alcoholic fatty liver disease; UKBB: UK Biobank.
12967_2024_5478_MOESM5_ESM.pdf
Supplementary Material 5: Fig. S5. Cell type-specific expression of the genes identified in MASLD GWAS based on the single-cell RNA sequencing data from liver cells in metabolic syndrome mouse models. (A) Dot plots illustrating the expression of genes in each cell cluster. (B) UMAP plot of 11 cell clusters. (C–L) UMAP plots showing the expression of specific genes (GCKR, MON1A, MLXIPL, LPL, BDNF, FAIM2, EXOC3L4, FTO, APOBR, and GGT1) across the cell clusters. UMAP: Uniform Manifold Approximation and Projection. This analysis was conducted using online tools available at the Single Cell Portal. (https://singlecell.broadinstitute.org/single_cell/study/SCP1404/multitissue-single-cell-analysis-reveals-differential-tissue-cellular-and-molecular-sensitivity-between-fructose-and-high-fat-high-sucrose-diets-liver).
12967_2024_5478_MOESM7_ESM.docx
Supplementary Material 7: Table S2. The associations of MASLD in different PRS group with SLD and extrahepatic outcomes.
12967_2024_5478_MOESM9_ESM.docx
Supplementary Material 9: Table S4. The associations of top conditionally independent SNPs with overall survival in the whole cohort.
12967_2024_5478_MOESM10_ESM.docx
Supplementary Material 10: Table S5. The associations of top conditionally independent SNPs with overall survival in the MASLD group.
12967_2024_5478_MOESM11_ESM.docx
Supplementary Material 11: Table S6. The associations of top conditionally independent SNPs with overall survival in the non-MASLD group.
12967_2024_5478_MOESM12_ESM.docx
Supplementary Material 12: Table S7. Summary of conditionally independent SNPs in the MASLD case–control analysis among male participants.
12967_2024_5478_MOESM13_ESM.docx
Supplementary Material 13: Table S8. Summary of conditionally independent SNPs in the MASLD case–control analysis among female participants.
12967_2024_5478_MOESM14_ESM.docx
Supplementary Material 14: Table S9. Summary of conditionally independent SNPs in the MASLD case–control analysis among participants with normal BMI.
12967_2024_5478_MOESM15_ESM.docx
Supplementary Material 15: Table S10. Summary of conditionally independent SNPs in the MASLD case–control analysis among overweight participants.
12967_2024_5478_MOESM16_ESM.docx
Supplementary Material 16: Table S11. Summary of conditionally independent SNPs in the MASLD case–control analysis among obese participants.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Xiao, L., Li, Y., Hong, C. et al. Polygenic risk score of metabolic dysfunction-associated steatotic liver disease amplifies the health impact on severe liver disease and metabolism-related outcomes. J Transl Med 22, 650 (2024). https://doi.org/10.1186/s12967-024-05478-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12967-024-05478-z