Genetic variability of the core protein in hepatitis C virus genotype 4 in Saudi Arabian patients and its implication on pegylated interferon and ribavirin therapy
Journal of Translational Medicine volume 12, Article number: 91 (2014)
Hepatitis C virus (HCV) shows a remarkable genetic diversity, contributing to its high persistence and varied susceptibilities to antiviral treatment. Previous studies have reported that the substitution of amino acids in the HCV subgenotype 1b core protein in infected patients is associated with a poor response to pegylated interferon and ribavirin (PEG-IFN/RBV) combined therapy.
Because the role of the core protein in HCV genotype 4 infections is unclear, we aimed in this study to compare the full-length core protein sequences of HCV genotype 4 between Saudi patients who responded (SVR) and did not respond (non-SVR) to PEG-IFN/RBV therapy.
Direct sequencing of the full-length core protein and bioinformatics sequence analysis were utilized.
Our data revealed that there is a significant association between core protein mutations, particularly at position 70 (Arg70Gln), and treatment outcome in HCV subgenotype 4d patients. However, HCV subgenotype 4a showed no significant association between core protein mutations and treatment outcome. In addition, amino acid residue at position 91 was well-conserved among studied patients where Cys91 is the dominant amino acid residue.
These findings provide a new insight into HCV genotype 4 among affected Saudi population where the knowledge of HCV core gene polymorphisms is inadequate.
Hepatitis C virus (HCV) infects more than 170 million people worldwide leading to chronic hepatitis, cirrhosis and hepatocellular carcinoma . HCV belongs to the family Flaviviridae and is a member of hepacivirus genus. It is classified into seven genotypes and numerous subtypes [2, 3]. HCV has a single-stranded RNA that encodes a polyprotein which subsequently gets cleaved into number of structural and non-structural proteins. Although the function of each protein has been intensively studied, the point mutations that occur in various positions and cause antiviral drug resistance are largely unknown. Therefore, the study of variation at the nucleotide sequence of HCV, core protein in particular, from different geographical region is important to understand its prevalence in the world as well as its clinical management.
Recently, advances in HCV treatment have led to the development of many direct-acting antiviral (DAA) agents. Early this year, the U.S. Food and Drug Administration (FDA) has approved a new therapy (simeprevir) to treat chronic HCV infection . However, the standard treatment for chronic hepatitis C infection in the developing countries is pegylated interferon (PEG-IFN) plus ribavirin (RBV) where the expected outcome of the treatment is to attain a sustained virological response (SVR) . There are serious side-effects and high medical cost that are associated with PEG-IFN/RBV treatment. As a result, it is important to predict the response to therapy for each individual patient beforehand. Previous studies have shown that the sequence polymorphisms within viral proteins, such as core protein, correlate with IFN-based treatment outcome. For example, substitutions of amino acid 70 and/or 91 in HCV subgenotype 1b core protein are predictors of poor response to PEG-IFN/RBV treatment [6, 7]. The clinical advantage of predicting SVR to PEG-IFN/RBV in patients is that patients with Arg70/Lue91 residues ought to continue the treatment course with predicted positive response. However, in patients who have mutated residues in the core region (Gln70/Met91) would be advised to withdraw from the treatment to avoid unnecessary side-effects. Indeed, if a correlation between HCV core gene mutation(s) and treatment outcome is established, then HCV sequencing can become a noninvasive and economical tool to assess an individual status and response to a treatment.
Although HCV genotype 4 is the cause of approximately 20% of HCV infection worldwide, it is poorly studied . Furthermore, there are limited studies and low informative data from patients in Saudi Arabia who are infected with HCV genotype 4. The aim of this study is to analyze the core protein of HCV genotype 4 from Saudi patient isolates and investigate the association between core protein sequence variations and treatment outcome.
Study patients and treatment regimens
The study protocol was approved by the local ethics committee at King Faisal Specialist and Research Center and written informed consent was obtained from each patient. A total of 115 baseline (i.e., treatment-naïve) patients from three different hospitals (King Khalid University Hospital, King Faisal Specialist Hospital and Research Center, and Riyadh Military Hospital) in Riyadh, Saudi Arabia, were used in this study. Exclusion criteria included co-infection with hepatitis B or human immunodeficiency virus, co-existent autoimmune or metabolic liver disease, active drug-induced hepatitis, decompensated cirrhosis, evidence of severe retinopathy, neoplastic disease, coronary artery or cerebrovascular disease, history of clinically relevant psychiatric disease. The complete treatment protocol used for these patients was previously published . HCV RNA extraction, genotyping and subgenotyping were determined using previously described methods . Herein, we presented the most dominant subgenotypes of HCV genotype 4 that are HCV-4d and HCV-4a in each group (SVR and non-SVR). Due to limited sample size, we excluded 4r, 4n and 4o from data analysis.
HCV sequence alignment and primer design
Complete genome sequences of HCV from different geographical regions were retrieved from the GenBank database (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Multiple sequence alignment of the retrieved sequences was performed using ClustalW module of MegAlign software (DNASTAR, Inc.,) and the consensus sequence was used to design degenerate primers for the core region. Primer sequences and positions are as follows: Forward: 5' TGCTAGCCGAGTAGTGTTGG 3' (positions 246–268) Reverse: 5' CCARTTCATCATCATRTCCCA 3' (position 1298–1318) and the amplicon size is 1045 bp.
Polymerase chain reaction (PCR)
All PCR mixtures had a total volume of 25 μl that contained 1 μl of HCV cDNA, 12.5 μl of GoTaq® Green Master Mix (Promega, Madison, USA), 1 μM of forward and reverse primers, and sterile nuclease-free water. In addition, appropriate positive and negative controls were employed. PCR conditions were as follows: 2 min an initial denaturing step at 95°C, followed by 35 cycles of 30 sec denaturing step at 95°C, 1 min of annealing step at 56°C, and 1 min of extending step at 72°C. A final extension at 72°C for 5 min was performed. PCR amplicons were visualized on a 1.5% agarose gel and stained with ethidium bromide. The positive amplicons were processed further for PCR sequencing using ABI3730XL sequencer (Applied Biosystems, Foster City, CA). To confirm positive results, nucleotide sequences were blasted against NCBI database.
Data analysis and statistics
Sequence chromatograms of 115 full-length core gene sequences were aligned and edited using the Lasergene suite for sequence analysis (DNASTAR, Inc.,) . Nucleotide (573 bp) and amino acid (191 aa) sequences from different patient isolates were aligned using ClustalX module (MegAlign, DNASTAR, Inc.). Full-length core gene sequences of HCV genotype 4 were retrieved from GenBank and used in this study as references. BioEdit program was used to visually display the full-length core protein with genotype corresponding references . In addition, phylogenetic tree was constructed using HCV genotype 4 patient sequences (all subgenotypes were included) and 20 random sequence references. The neighbor-joining method with a bootstrap value of 1,000 replications was employed in constructing the tree using Mega 5.0 software .
Further, detecting the most statistically significant differences between the responders and non-responders groups was done using the Viral Epidemiology Signature Pattern Analysis (VESPA) tool, provided by HCV sequence database . Numerical data were analyzed by Student’s t test using STATA IC/13 software (StataCorpLP, Houston, USA) where a P value of <0.05 was considered statistically significant.
Response to PEG-IFN/RBV therapy
One hundred and fifteen (115) patients with chronic HCV genotype 4 were enrolled in this study. The patients’ clinical characteristics are presented in Table 1. Notably, there was no significant association between the response to treatment and age, gender, weight, liver enzymes, HCV viral load, disease stage, and grade. However, there is a significant association between subgenotypes and treatment response. Indeed, SVR rate in HCV-4a is 58% while the SVR rate in HCV-4d is lower (35%) (P value = 0.02). Twenty four weeks after the completion of 48 weeks of PEG-IFN/RBV combined treatment; patients were tested and then divided to responders (i.e., SVR (48%)) and non-responders (i.e., non-SVR (51%)). HCV genotype 4 mean genetic distance was calculated between SVR and non-SVR patients (Table 2). All reported sequences in this study were deposited in the GenBank and were assigned the following accession numbers (KC143812 - KC143908).
Phylogenetic analysis of SVR and non-SVR patients
Phylogenetic analysis of core sequences provides information about overall relatedness of core gene among HCV genotype 4 isolates. One hundred fifteen sequences of HCV-4 core gene from SVR and non-SVR groups were used to construct the tree (Figure 1). HCV-4 sequences showed no clustering based on response to the treatment but rather they clustered to the respective subgenotypes correctly (i.e. HCV-4a and -4d).
Multiple sequence alignment of the core protein
Figures 2 and 3 showed aligned amino acid residues from each group/subgenotype. Figure 2A represents HCV-4a in SVR patients and revealed that the residue at position 70 (Arg70) and 91 (Cys91) are well conserved among these isolates. On the other hand, our data revealed different point mutations in the core region of HCV-4a in SVR group. At position 60, for example, G (Gly60) mutated to E (Glu60) in 12% of the clinical samples (Figure 2A). Position 71 also has S (Ser71) in 80% of the samples. Position 158 showed substitution of L (Lue158) to V (Val158) in 82% of the samples. In HCV subgenotype 4a non-SVR patients, however, the amino acid alignment figure revealed that position 70 (Arg70) is mutated to (Gln70) in only 18.5% of the clinical samples. However, position 71 showed a higher mutation rate were 70% of the clinical samples have S (Ser71) instead of P (Pro71). Position 60 showed another point mutation that was calculated to be 26% of the clinical samples where E (Glu60) substitutes G (Gly60). There was no significant correlation between core protein sequences and treatment outcome. Notably, our patient sequences showed a 100% mismatch with the reference sequence in positions such as 114 (G114R) and 146 (S146G) in HCV-4a (Figure 2).
Figure 3A showed HCV subgenotype 4d in SVR patients where the amino acid alignment revealed that the residue at position 70 (Arg70) is mutated to (Gln70) in 29% of the clinical samples. Position 71 has a point mutation where P (Pro71) is substituted with S (Ser71) in only 2% while position 157 has a mutation of L (Leu157) to A (Ala157) in 23% of the clinical samples. Moreover, position 162 has mutation of V (Val162) to I (Iso162) in 23%. In non-SVR patients, however, 58% of the clinical samples have mutation at position 70 whereas (Arg70) is mutated to (Gln70). Moreover, at position 157, 29% of the clinical samples showed mutation of L (Leu157) to V (Val157), however, this amino acid substitution is different than the mutated amino acid in SVR group (A vs. V). There was a significant correlation between HCV-4d core protein sequence at position 70 and treatment outcome (P value = 0.02). Moreover, our patient sequences showed a 100% mismatch with the reference sequence in positions such as 12 (Q12K), 20 (T20M) and 74 (K74R) in HCV-4d (Figure 3).
Patterns discovery and recognition
Positional variations of the core protein were compared using Viral Epidemiology Signature Pattern Analysis (VESPA). Results revealed that the variations in HCV 4a SVR and non-SVR patients are not statistically significant (Figure 4A), while, the signature pattern analysis of HCV 4d SVR and non-SVR was statistically significant at position 70 (Arg70Gln) (P value < 0.05) (Figure 4B).
The HCV core gene is the genetic region that encodes for the viral nucleocapsid protein. It consists of 191 amino acid residues that are divided into three domains, an N-terminal hydrophilic domain (D1, residues 1–117), a C-terminal hydrophobic domain (D2, residues 118–170), and the last 21 amino acids that serve as signal peptide for the downstream envelope protein E1 [15, 16]. It has been shown that the core protein is associated with number of cellular proteins and pathways that have direct effect on HCV lifecycle and biology . Also, HCV core protein has been suggested to have a role on antiviral activity of IFN inhibition through interaction with the cellular protein, STAT1 . Therefore, mutations in this protein have the potential to alter the viral structure leading to unexpected functions such as poor response to PEG-IFN/RBV therapy. Previous studies have shown that there is a significant correlation between mutations in the core protein and poor treatment outcome. In particular, patients who had substitutions of Arg70 to Gln70 and/or Leu91 to Met91 showed lower response to PEG-IFN/RBV combined therapy [19, 20]. However, most of these studies have been conducted on Asian populations, especially Japanese patients, who were diagnosed with HCV genotype 1b. Herein, we hypothesized that the amino acid substitutions in HCV genotype 4 (subgenotypes 4a and 4d) core region could correlate with treatment outcome. HCV subgenotype 4d showed that there is a significant association between core protein mutations, particularly at position 70 (Arg70Gln), and treatment outcome. However, amino acid substitutions in HCV-4a showed no associations with treatment outcome. The residue at position 70 of the core protein was Arg70 in most of HCV-4a SVR patient isolates and only 17% of HCV-4a non-SVR patient isolates were mutated to Gln70. Moreover, the residue at position 91 of the core protein was well-conserved among HCV genotype 4.
There are several factors (predictors) that could control the effectiveness of the treatment and such factors can be classified into host and/or viral factors. Host factors include age, gender, patient body weight, ethnicity, alcohol consumption and host genetic variations. Several recent studies have shown that single nucleotide polymorphisms (SNPs) in IL-28B gene region are associated with response to combination therapy with pegylated IFN-α and ribavirin . On the other hand, virus genotypes and viral load have been shown to modulate treatment outcome [6, 22]. Based on previous studies, HCV genotype is the most significant factor affecting treatment responses . While HCV genotype 2 and 3 have the highest rate of SVR to PEG-IFN/RBV treatment (80%), HCV genotype 1 and 4 are showing more resistance to treatment (50-60%) [24, 25]. Notably, the present study revealed that the SVR rate in HCV-4a is higher (58%) than HCV-4d (35%) indicating a role of the subgenotyping in treatment response. The differences in responding to the treatment among different genotypes and subgenotypes suggest a role of the viral sequence variations. It is noteworthy that most previous studies were conducted on Asian population. Thus, further investigations are needed to explore this phenomenon in different ethnic populations.
In recent studies, El-Shamy et al. has investigated 43 Egyptian patients who were infected with HCV genotype 4 (mostly subgenotype 4a) and revealed that no significant correlation between core protein amino acid substitutions at position 70 and/or 91 and treatment outcome . Our finding in regard to HCV-4a is in agreement with the aforementioned report that the substitutions at positions 70 and/or 91 are not associated with antiviral resistance. However, in HCV-4d patient isolates, our data revealed that there is a significant association between core amino acid substitutions, particularly at position 70 and treatment outcome. Phylogenetic analysis and sequence comparison showed that no clustering was observed based on treatment response but rather they grouped to the corresponding subgenotypes correctly (i.e. HCV-4a, −4d).
The present study revealed that HCV-4d has a point mutation at position 70 (Arg70Gln) that is statistically significant. However, no evidence was found in HCV-4a for the effect of core protein polymorphisms, either at position 70 and/or 91, and treatment outcome. Instead, mutations were scattered over the full-length core region with no specific association with drug resistance. Although several possibilities have been proposed to explain the effect of amino acid substitutions of core protein on treatment outcome, the exact mechanism has not been determined. Nonetheless, this study emphasizes the fact that single nucleotide mutations in the core gene could prove helpful in predicting the treatment outcome, at least in sub-genotype 4d-infcted patients.
Chevaliez S, Pawlotsky JM: Hepatitis C virus serologic and virological tests and clinical diagnosis of HCV-related liver disease. Int J Med Sci. 2006, 3: 35-40.
Murphy DG, Willems B, Deschenes M, Hilzenrat N, Mousseau R, Sabbah S: Use of sequence analysis of the NS5b region for routine genotyping of hepatitis C virus with reference to C/E1 and 50 untranslated region sequences. J Clin Micro. 2007, 45: 1102-1112. 10.1128/JCM.02366-06.
Simmonds P, Bukh J, Combet C, Deleage G, Enomoto N, Feinstone S, Halfon P, Inchauspe G, Kuiken C, Maertens G, Mizokami M, Murphy DG, Okamoto H, Pawlotsky JM, Penin F, Sablon E, Shin-I T, Stuyver LJ, Thiel HJ, Viazov S, Weiner AJ, Widell A: Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology. 2005, 42: 962-973. 10.1002/hep.20819.
Chae HB, Park SM, Youn SJ: Direct-acting antivirals for the treatment of chronic hepatitis C: open issues and future perspectives. Sci World J. 2013, 704912-
Strader DB, Wright T, Thomas DL, Seeff LB: Diagnosis, management, and treatment of hepatitis C. Hepatology. 2004, 39: 1147-1171. 10.1002/hep.20119.
Akuta N, Suzuki F, Kawamura Y, Yatsuji H, Sezaki H, Suzuki Y, Hosaka T, Kobayashi M, Arase Y, Ikeda K, Kumada H: Predictive factors of early and sustained responses to peginterferon plus ribavirin combination therapy in Japanese patients infected with hepatitis C virus genotype 1b: amino acid substitutions in the core region and low-density lipoprotein cholesterol levels. J Hepatol. 2007, 46: 403-410. 10.1016/j.jhep.2006.09.019.
Akuta N, Suzuki F, Hirakawa M, Kawamura Y, Yatsuji H, Sezaki H, Suzuki Y, Hosaka T, Kobayashi M, Saitoh S, Arase Y, Ikeda K, Kumada H: A matched case-controlled study of 48 and 72 weeks of peginterferon plus ribavirin combination therapy in patients infected with HCV genotype 1b in Japan: amino acid substitutions in HCV core region as predictor of sustained virological response. J Med Virol. 2009, 81: 452-458. 10.1002/jmv.21400.
Khattab MA, Ferenci P, Hadziyannis SJ, Colombo M, Manns MP, Almasio PL, Esteban R, Abdo AA, Harrison SA, Ibrahim N, Cacoub P, Eslam M, Lee SS: Management of hepatitis C virus genotype 4: recommendations of an international expert panel. J Hepatol. 2011, 54: 1250-1262. 10.1016/j.jhep.2010.11.016.
Abdo AA, Al-Ahdal MN, Khalid SS, Helmy A, Sanai FM, Alswat K, Al-Hamoudi W, Ali SM, Al-Ashgar HI, Al-Mdani A, Albenmousa A, Al Faleh FZ, Al-Anazi M, Khalaf N, Al-Qahtani A: IL28B polymorphisms predict the virological response to standard therapy in patients with chronic hepatitis C virus genotype 4 infection. Hepatol Int. 2013, 7: 533-538. 10.1007/s12072-013-9421-8.
Sandres-Saune K, Deny P, Pasquier C, Thibaut V, Duverlie G, Izopet J: Determining hepatitis C genotype by analyzing the sequence of the NS5b region. J Virol Methods. 2003, 109: 187-193. 10.1016/S0166-0934(03)00070-3.
Burland TG: DNASTAR's Lasergene sequence analysis software. Methods Mol Biol. 2000, 132: 71-91.
Hall T: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl Acids Symp Ser. 1999, 41: 95-98.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Korber B, Myers G: Signature pattern analysis: a method for assessingviral sequence relatedness. AIDS Res Hum Retroviruses. 1992, 8: 1549-1560. 10.1089/aid.1992.8.1549.
Maekawa S, Enomoto N: Viral factors influencing the response to the combination therapy of peginterferon plus ribavirin in chronic hepatitis C. J Gastroenterol. 2009, 44: 1009-1015. 10.1007/s00535-009-0126-7.
Harada S, Watanabe Y, Takeuchi K, Suzuki T, Katayama T, Takebe Y, Saito I, Miyamura T: Expression of processed core protein of hepatitis C virus in mammalian cells. J Virol. 1991, 65: 3015-3021.
McLauchlan J: Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes. J Viral Hepat. 2000, 7: 2-14. 10.1046/j.1365-2893.2000.00201.x.
Lin W, Kim SS, Yeung E, Kamegaya Y, Blackard JT, Kim KA, Holtzman MJ, Chung RT: Hepatitis C virus core protein blocks interferon signaling by interaction with the STAT1 SH2 domain. J Virol. 2006, 80: 9226-9235. 10.1128/JVI.00459-06.
Akuta N, Suzuki F, Sezaki H, Suzuki Y, Hosaka T, Someya T, Kobayashi M, Saitoh S, Watahiki S, Sato J, Matsuda M, Kobayashi M, Arase Y, Ikeda K, Kumada H: Association of amino acid substitution pattern in core protein of hepatitis C virus genotype 1b high viral load and non-virological response to interferon-ribavirin combination therapy. Intervirology. 2005, 48: 372-380. 10.1159/000086064.
El-Shamy A, Kim SR, Ide YH, Sasase N, Imoto S, Deng L, Shoji I, Hotta H: Polymorphisms of hepatitis C virus non-structural protein 5A and core protein and clinical outcome of pegylated-interferon/ribavirin combination therapy. Intervirology. 2011, 55: 1-11.
Schaefer EA, Chung RT: The impact of human gene polymorphisms on HCV infection and disease outcome. Semin Liver Dis. 2011, 31: 375-386. 10.1055/s-0031-1297926.
Reddy KR, Hoofnagle JH, Tong MJ, Lee WM, Pockros P, Heathcote EJ, Albert D, Joh T: Racial differences in responses to therapy with interferon in chronic hepatitis C Consensus Interferon Study Group. Hepatology. 1999, 30: 787-793. 10.1002/hep.510300319.
Simmonds P: Clinical relevance of hepatitis C virus genotypes. Gut. 1997, 40: 291-293.
Fried MW, Shiffman ML, Reddy KR, Smith C, Marinos G, Goncales FL, Haussinger D, Diago M, Carosi G, Dhumeaux D, Craxi A, Lin A, Hoffman J, Yu J: Peginterferon alfa-2a plus ribavirin for chronic hepatitis C virus infection. N Engl J Med. 2002, 347: 975-982. 10.1056/NEJMoa020047.
Sarasin-Filipowicz M: Interferon therapy of hepatitis C: molecular insights into success and failure. Swiss Med Wkly. 2009, 140: 3-11.
Ikeda F, Dansako H, Nishimura G, Mori K, Kawai Y, Ariumi Y, Miyake Y, Takaki A, Nouso K, Iwasaki Y, Ikeda M, Kato N, Yamamoto K: Amino acid substitutions of hepatitis C virus core protein are not associated with intracellular antiviral response to interferon-alpha in vitro. Liver Int. 2010, 30: 1324-1331. 10.1111/j.1478-3231.2010.02299.x.
We would like to extend our gratitude to the Sequencing Core Facility, Department of Genetics, King Faisal Specialist Hospital and Research Center. This study was supported in part by a grant from the King Abdulaziz City for Science and Technology [ARP-30-38]. The Research Advisory Council at the King Faisal Specialist Hospital and Research Center has approved this study RAC # 2090001.
The authors declare that they have no competing interests.
FA carried out the molecular techniques and wrote the manuscript. MA conceived and designed the study. NK carried out the gene sequencing and sequence alignments. AA, FS, HA has contributed substantially by providing the patient specimens and clinical data analysis. ME and AZ have contributed in the bioinformatics analysis. AA conceived and designed this study, interpreted the data, and edited the manuscript. All authors have read and approved the final manuscript.
About this article
Cite this article
Alhamlan, F.S., Al-Ahdal, M.N., Khalaf, N.Z. et al. Genetic variability of the core protein in hepatitis C virus genotype 4 in Saudi Arabian patients and its implication on pegylated interferon and ribavirin therapy. J Transl Med 12, 91 (2014). https://doi.org/10.1186/1479-5876-12-91