Genetic variability of the core protein in hepatitis C virus genotype 4 in Saudi Arabian patients and its implication on pegylated interferon and ribavirin therapy
© Alhamlan et al.; licensee BioMed Central Ltd. 2014
Received: 2 February 2014
Accepted: 20 March 2014
Published: 6 April 2014
Hepatitis C virus (HCV) shows a remarkable genetic diversity, contributing to its high persistence and varied susceptibilities to antiviral treatment. Previous studies have reported that the substitution of amino acids in the HCV subgenotype 1b core protein in infected patients is associated with a poor response to pegylated interferon and ribavirin (PEG-IFN/RBV) combined therapy.
Because the role of the core protein in HCV genotype 4 infections is unclear, we aimed in this study to compare the full-length core protein sequences of HCV genotype 4 between Saudi patients who responded (SVR) and did not respond (non-SVR) to PEG-IFN/RBV therapy.
Direct sequencing of the full-length core protein and bioinformatics sequence analysis were utilized.
Our data revealed that there is a significant association between core protein mutations, particularly at position 70 (Arg70Gln), and treatment outcome in HCV subgenotype 4d patients. However, HCV subgenotype 4a showed no significant association between core protein mutations and treatment outcome. In addition, amino acid residue at position 91 was well-conserved among studied patients where Cys91 is the dominant amino acid residue.
These findings provide a new insight into HCV genotype 4 among affected Saudi population where the knowledge of HCV core gene polymorphisms is inadequate.
Hepatitis C virus (HCV) infects more than 170 million people worldwide leading to chronic hepatitis, cirrhosis and hepatocellular carcinoma . HCV belongs to the family Flaviviridae and is a member of hepacivirus genus. It is classified into seven genotypes and numerous subtypes [2, 3]. HCV has a single-stranded RNA that encodes a polyprotein which subsequently gets cleaved into number of structural and non-structural proteins. Although the function of each protein has been intensively studied, the point mutations that occur in various positions and cause antiviral drug resistance are largely unknown. Therefore, the study of variation at the nucleotide sequence of HCV, core protein in particular, from different geographical region is important to understand its prevalence in the world as well as its clinical management.
Recently, advances in HCV treatment have led to the development of many direct-acting antiviral (DAA) agents. Early this year, the U.S. Food and Drug Administration (FDA) has approved a new therapy (simeprevir) to treat chronic HCV infection . However, the standard treatment for chronic hepatitis C infection in the developing countries is pegylated interferon (PEG-IFN) plus ribavirin (RBV) where the expected outcome of the treatment is to attain a sustained virological response (SVR) . There are serious side-effects and high medical cost that are associated with PEG-IFN/RBV treatment. As a result, it is important to predict the response to therapy for each individual patient beforehand. Previous studies have shown that the sequence polymorphisms within viral proteins, such as core protein, correlate with IFN-based treatment outcome. For example, substitutions of amino acid 70 and/or 91 in HCV subgenotype 1b core protein are predictors of poor response to PEG-IFN/RBV treatment [6, 7]. The clinical advantage of predicting SVR to PEG-IFN/RBV in patients is that patients with Arg70/Lue91 residues ought to continue the treatment course with predicted positive response. However, in patients who have mutated residues in the core region (Gln70/Met91) would be advised to withdraw from the treatment to avoid unnecessary side-effects. Indeed, if a correlation between HCV core gene mutation(s) and treatment outcome is established, then HCV sequencing can become a noninvasive and economical tool to assess an individual status and response to a treatment.
Although HCV genotype 4 is the cause of approximately 20% of HCV infection worldwide, it is poorly studied . Furthermore, there are limited studies and low informative data from patients in Saudi Arabia who are infected with HCV genotype 4. The aim of this study is to analyze the core protein of HCV genotype 4 from Saudi patient isolates and investigate the association between core protein sequence variations and treatment outcome.
Study patients and treatment regimens
The study protocol was approved by the local ethics committee at King Faisal Specialist and Research Center and written informed consent was obtained from each patient. A total of 115 baseline (i.e., treatment-naïve) patients from three different hospitals (King Khalid University Hospital, King Faisal Specialist Hospital and Research Center, and Riyadh Military Hospital) in Riyadh, Saudi Arabia, were used in this study. Exclusion criteria included co-infection with hepatitis B or human immunodeficiency virus, co-existent autoimmune or metabolic liver disease, active drug-induced hepatitis, decompensated cirrhosis, evidence of severe retinopathy, neoplastic disease, coronary artery or cerebrovascular disease, history of clinically relevant psychiatric disease. The complete treatment protocol used for these patients was previously published . HCV RNA extraction, genotyping and subgenotyping were determined using previously described methods . Herein, we presented the most dominant subgenotypes of HCV genotype 4 that are HCV-4d and HCV-4a in each group (SVR and non-SVR). Due to limited sample size, we excluded 4r, 4n and 4o from data analysis.
HCV sequence alignment and primer design
Complete genome sequences of HCV from different geographical regions were retrieved from the GenBank database (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Multiple sequence alignment of the retrieved sequences was performed using ClustalW module of MegAlign software (DNASTAR, Inc.,) and the consensus sequence was used to design degenerate primers for the core region. Primer sequences and positions are as follows: Forward: 5' TGCTAGCCGAGTAGTGTTGG 3' (positions 246–268) Reverse: 5' CCARTTCATCATCATRTCCCA 3' (position 1298–1318) and the amplicon size is 1045 bp.
Polymerase chain reaction (PCR)
All PCR mixtures had a total volume of 25 μl that contained 1 μl of HCV cDNA, 12.5 μl of GoTaq® Green Master Mix (Promega, Madison, USA), 1 μM of forward and reverse primers, and sterile nuclease-free water. In addition, appropriate positive and negative controls were employed. PCR conditions were as follows: 2 min an initial denaturing step at 95°C, followed by 35 cycles of 30 sec denaturing step at 95°C, 1 min of annealing step at 56°C, and 1 min of extending step at 72°C. A final extension at 72°C for 5 min was performed. PCR amplicons were visualized on a 1.5% agarose gel and stained with ethidium bromide. The positive amplicons were processed further for PCR sequencing using ABI3730XL sequencer (Applied Biosystems, Foster City, CA). To confirm positive results, nucleotide sequences were blasted against NCBI database.
Data analysis and statistics
Sequence chromatograms of 115 full-length core gene sequences were aligned and edited using the Lasergene suite for sequence analysis (DNASTAR, Inc.,) . Nucleotide (573 bp) and amino acid (191 aa) sequences from different patient isolates were aligned using ClustalX module (MegAlign, DNASTAR, Inc.). Full-length core gene sequences of HCV genotype 4 were retrieved from GenBank and used in this study as references. BioEdit program was used to visually display the full-length core protein with genotype corresponding references . In addition, phylogenetic tree was constructed using HCV genotype 4 patient sequences (all subgenotypes were included) and 20 random sequence references. The neighbor-joining method with a bootstrap value of 1,000 replications was employed in constructing the tree using Mega 5.0 software .
Further, detecting the most statistically significant differences between the responders and non-responders groups was done using the Viral Epidemiology Signature Pattern Analysis (VESPA) tool, provided by HCV sequence database . Numerical data were analyzed by Student’s t test using STATA IC/13 software (StataCorpLP, Houston, USA) where a P value of <0.05 was considered statistically significant.
Response to PEG-IFN/RBV therapy
Patient characteristics of all patients enrolled in this study
Mean ± SD*
45.82 ± 14.67
49.19 ± 15.26
Male count (%)
Female count (%)
Weight ( kg) *
74.54 ± 28.01
76.04 ± 18.81
Bil (mg/dL) *
10.54 ± 4.89
ALT (IU/L) *
82.2 ± 76.2
AST (IU/L) *
59.27 ± 43.37
53.63 ± 55.81
ALP (IU/L) *
98.4 ± 67.8
116.13 ± 60.98
AFP (ng/mL) *
7.76 ± 18.18.47
HCV load log10 ¶
≤2 count (%)
≥3 count (%)
≥1 count (%)
3 count (%)
Summary of sequence analyses of HCV-4 and mean genetic distance
No. of sequence
No. of ref seq
Mean genetic distance within groups
Mean genetic distance between groups
Phylogenetic analysis of SVR and non-SVR patients
Multiple sequence alignment of the core protein
Figure 3A showed HCV subgenotype 4d in SVR patients where the amino acid alignment revealed that the residue at position 70 (Arg70) is mutated to (Gln70) in 29% of the clinical samples. Position 71 has a point mutation where P (Pro71) is substituted with S (Ser71) in only 2% while position 157 has a mutation of L (Leu157) to A (Ala157) in 23% of the clinical samples. Moreover, position 162 has mutation of V (Val162) to I (Iso162) in 23%. In non-SVR patients, however, 58% of the clinical samples have mutation at position 70 whereas (Arg70) is mutated to (Gln70). Moreover, at position 157, 29% of the clinical samples showed mutation of L (Leu157) to V (Val157), however, this amino acid substitution is different than the mutated amino acid in SVR group (A vs. V). There was a significant correlation between HCV-4d core protein sequence at position 70 and treatment outcome (P value = 0.02). Moreover, our patient sequences showed a 100% mismatch with the reference sequence in positions such as 12 (Q12K), 20 (T20M) and 74 (K74R) in HCV-4d (Figure 3).
Patterns discovery and recognition
The HCV core gene is the genetic region that encodes for the viral nucleocapsid protein. It consists of 191 amino acid residues that are divided into three domains, an N-terminal hydrophilic domain (D1, residues 1–117), a C-terminal hydrophobic domain (D2, residues 118–170), and the last 21 amino acids that serve as signal peptide for the downstream envelope protein E1 [15, 16]. It has been shown that the core protein is associated with number of cellular proteins and pathways that have direct effect on HCV lifecycle and biology . Also, HCV core protein has been suggested to have a role on antiviral activity of IFN inhibition through interaction with the cellular protein, STAT1 . Therefore, mutations in this protein have the potential to alter the viral structure leading to unexpected functions such as poor response to PEG-IFN/RBV therapy. Previous studies have shown that there is a significant correlation between mutations in the core protein and poor treatment outcome. In particular, patients who had substitutions of Arg70 to Gln70 and/or Leu91 to Met91 showed lower response to PEG-IFN/RBV combined therapy [19, 20]. However, most of these studies have been conducted on Asian populations, especially Japanese patients, who were diagnosed with HCV genotype 1b. Herein, we hypothesized that the amino acid substitutions in HCV genotype 4 (subgenotypes 4a and 4d) core region could correlate with treatment outcome. HCV subgenotype 4d showed that there is a significant association between core protein mutations, particularly at position 70 (Arg70Gln), and treatment outcome. However, amino acid substitutions in HCV-4a showed no associations with treatment outcome. The residue at position 70 of the core protein was Arg70 in most of HCV-4a SVR patient isolates and only 17% of HCV-4a non-SVR patient isolates were mutated to Gln70. Moreover, the residue at position 91 of the core protein was well-conserved among HCV genotype 4.
There are several factors (predictors) that could control the effectiveness of the treatment and such factors can be classified into host and/or viral factors. Host factors include age, gender, patient body weight, ethnicity, alcohol consumption and host genetic variations. Several recent studies have shown that single nucleotide polymorphisms (SNPs) in IL-28B gene region are associated with response to combination therapy with pegylated IFN-α and ribavirin . On the other hand, virus genotypes and viral load have been shown to modulate treatment outcome [6, 22]. Based on previous studies, HCV genotype is the most significant factor affecting treatment responses . While HCV genotype 2 and 3 have the highest rate of SVR to PEG-IFN/RBV treatment (80%), HCV genotype 1 and 4 are showing more resistance to treatment (50-60%) [24, 25]. Notably, the present study revealed that the SVR rate in HCV-4a is higher (58%) than HCV-4d (35%) indicating a role of the subgenotyping in treatment response. The differences in responding to the treatment among different genotypes and subgenotypes suggest a role of the viral sequence variations. It is noteworthy that most previous studies were conducted on Asian population. Thus, further investigations are needed to explore this phenomenon in different ethnic populations.
In recent studies, El-Shamy et al. has investigated 43 Egyptian patients who were infected with HCV genotype 4 (mostly subgenotype 4a) and revealed that no significant correlation between core protein amino acid substitutions at position 70 and/or 91 and treatment outcome . Our finding in regard to HCV-4a is in agreement with the aforementioned report that the substitutions at positions 70 and/or 91 are not associated with antiviral resistance. However, in HCV-4d patient isolates, our data revealed that there is a significant association between core amino acid substitutions, particularly at position 70 and treatment outcome. Phylogenetic analysis and sequence comparison showed that no clustering was observed based on treatment response but rather they grouped to the corresponding subgenotypes correctly (i.e. HCV-4a, −4d).
The present study revealed that HCV-4d has a point mutation at position 70 (Arg70Gln) that is statistically significant. However, no evidence was found in HCV-4a for the effect of core protein polymorphisms, either at position 70 and/or 91, and treatment outcome. Instead, mutations were scattered over the full-length core region with no specific association with drug resistance. Although several possibilities have been proposed to explain the effect of amino acid substitutions of core protein on treatment outcome, the exact mechanism has not been determined. Nonetheless, this study emphasizes the fact that single nucleotide mutations in the core gene could prove helpful in predicting the treatment outcome, at least in sub-genotype 4d-infcted patients.
We would like to extend our gratitude to the Sequencing Core Facility, Department of Genetics, King Faisal Specialist Hospital and Research Center. This study was supported in part by a grant from the King Abdulaziz City for Science and Technology [ARP-30-38]. The Research Advisory Council at the King Faisal Specialist Hospital and Research Center has approved this study RAC # 2090001.
- Chevaliez S, Pawlotsky JM: Hepatitis C virus serologic and virological tests and clinical diagnosis of HCV-related liver disease. Int J Med Sci. 2006, 3: 35-40.PubMed CentralView ArticlePubMedGoogle Scholar
- Murphy DG, Willems B, Deschenes M, Hilzenrat N, Mousseau R, Sabbah S: Use of sequence analysis of the NS5b region for routine genotyping of hepatitis C virus with reference to C/E1 and 50 untranslated region sequences. J Clin Micro. 2007, 45: 1102-1112. 10.1128/JCM.02366-06.View ArticleGoogle Scholar
- Simmonds P, Bukh J, Combet C, Deleage G, Enomoto N, Feinstone S, Halfon P, Inchauspe G, Kuiken C, Maertens G, Mizokami M, Murphy DG, Okamoto H, Pawlotsky JM, Penin F, Sablon E, Shin-I T, Stuyver LJ, Thiel HJ, Viazov S, Weiner AJ, Widell A: Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology. 2005, 42: 962-973. 10.1002/hep.20819.View ArticlePubMedGoogle Scholar
- Chae HB, Park SM, Youn SJ: Direct-acting antivirals for the treatment of chronic hepatitis C: open issues and future perspectives. Sci World J. 2013, 704912-Google Scholar
- Strader DB, Wright T, Thomas DL, Seeff LB: Diagnosis, management, and treatment of hepatitis C. Hepatology. 2004, 39: 1147-1171. 10.1002/hep.20119.View ArticlePubMedGoogle Scholar
- Akuta N, Suzuki F, Kawamura Y, Yatsuji H, Sezaki H, Suzuki Y, Hosaka T, Kobayashi M, Arase Y, Ikeda K, Kumada H: Predictive factors of early and sustained responses to peginterferon plus ribavirin combination therapy in Japanese patients infected with hepatitis C virus genotype 1b: amino acid substitutions in the core region and low-density lipoprotein cholesterol levels. J Hepatol. 2007, 46: 403-410. 10.1016/j.jhep.2006.09.019.View ArticlePubMedGoogle Scholar
- Akuta N, Suzuki F, Hirakawa M, Kawamura Y, Yatsuji H, Sezaki H, Suzuki Y, Hosaka T, Kobayashi M, Saitoh S, Arase Y, Ikeda K, Kumada H: A matched case-controlled study of 48 and 72 weeks of peginterferon plus ribavirin combination therapy in patients infected with HCV genotype 1b in Japan: amino acid substitutions in HCV core region as predictor of sustained virological response. J Med Virol. 2009, 81: 452-458. 10.1002/jmv.21400.View ArticlePubMedGoogle Scholar
- Khattab MA, Ferenci P, Hadziyannis SJ, Colombo M, Manns MP, Almasio PL, Esteban R, Abdo AA, Harrison SA, Ibrahim N, Cacoub P, Eslam M, Lee SS: Management of hepatitis C virus genotype 4: recommendations of an international expert panel. J Hepatol. 2011, 54: 1250-1262. 10.1016/j.jhep.2010.11.016.View ArticlePubMedGoogle Scholar
- Abdo AA, Al-Ahdal MN, Khalid SS, Helmy A, Sanai FM, Alswat K, Al-Hamoudi W, Ali SM, Al-Ashgar HI, Al-Mdani A, Albenmousa A, Al Faleh FZ, Al-Anazi M, Khalaf N, Al-Qahtani A: IL28B polymorphisms predict the virological response to standard therapy in patients with chronic hepatitis C virus genotype 4 infection. Hepatol Int. 2013, 7: 533-538. 10.1007/s12072-013-9421-8.PubMed CentralView ArticlePubMedGoogle Scholar
- Sandres-Saune K, Deny P, Pasquier C, Thibaut V, Duverlie G, Izopet J: Determining hepatitis C genotype by analyzing the sequence of the NS5b region. J Virol Methods. 2003, 109: 187-193. 10.1016/S0166-0934(03)00070-3.View ArticlePubMedGoogle Scholar
- Burland TG: DNASTAR's Lasergene sequence analysis software. Methods Mol Biol. 2000, 132: 71-91.PubMedGoogle Scholar
- Hall T: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl Acids Symp Ser. 1999, 41: 95-98.Google Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralView ArticlePubMedGoogle Scholar
- Korber B, Myers G: Signature pattern analysis: a method for assessingviral sequence relatedness. AIDS Res Hum Retroviruses. 1992, 8: 1549-1560. 10.1089/aid.1992.8.1549.View ArticlePubMedGoogle Scholar
- Maekawa S, Enomoto N: Viral factors influencing the response to the combination therapy of peginterferon plus ribavirin in chronic hepatitis C. J Gastroenterol. 2009, 44: 1009-1015. 10.1007/s00535-009-0126-7.View ArticlePubMedGoogle Scholar
- Harada S, Watanabe Y, Takeuchi K, Suzuki T, Katayama T, Takebe Y, Saito I, Miyamura T: Expression of processed core protein of hepatitis C virus in mammalian cells. J Virol. 1991, 65: 3015-3021.PubMed CentralPubMedGoogle Scholar
- McLauchlan J: Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes. J Viral Hepat. 2000, 7: 2-14. 10.1046/j.1365-2893.2000.00201.x.View ArticlePubMedGoogle Scholar
- Lin W, Kim SS, Yeung E, Kamegaya Y, Blackard JT, Kim KA, Holtzman MJ, Chung RT: Hepatitis C virus core protein blocks interferon signaling by interaction with the STAT1 SH2 domain. J Virol. 2006, 80: 9226-9235. 10.1128/JVI.00459-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Akuta N, Suzuki F, Sezaki H, Suzuki Y, Hosaka T, Someya T, Kobayashi M, Saitoh S, Watahiki S, Sato J, Matsuda M, Kobayashi M, Arase Y, Ikeda K, Kumada H: Association of amino acid substitution pattern in core protein of hepatitis C virus genotype 1b high viral load and non-virological response to interferon-ribavirin combination therapy. Intervirology. 2005, 48: 372-380. 10.1159/000086064.View ArticlePubMedGoogle Scholar
- El-Shamy A, Kim SR, Ide YH, Sasase N, Imoto S, Deng L, Shoji I, Hotta H: Polymorphisms of hepatitis C virus non-structural protein 5A and core protein and clinical outcome of pegylated-interferon/ribavirin combination therapy. Intervirology. 2011, 55: 1-11.View ArticlePubMedGoogle Scholar
- Schaefer EA, Chung RT: The impact of human gene polymorphisms on HCV infection and disease outcome. Semin Liver Dis. 2011, 31: 375-386. 10.1055/s-0031-1297926.View ArticlePubMedGoogle Scholar
- Reddy KR, Hoofnagle JH, Tong MJ, Lee WM, Pockros P, Heathcote EJ, Albert D, Joh T: Racial differences in responses to therapy with interferon in chronic hepatitis C Consensus Interferon Study Group. Hepatology. 1999, 30: 787-793. 10.1002/hep.510300319.View ArticlePubMedGoogle Scholar
- Simmonds P: Clinical relevance of hepatitis C virus genotypes. Gut. 1997, 40: 291-293.PubMed CentralView ArticlePubMedGoogle Scholar
- Fried MW, Shiffman ML, Reddy KR, Smith C, Marinos G, Goncales FL, Haussinger D, Diago M, Carosi G, Dhumeaux D, Craxi A, Lin A, Hoffman J, Yu J: Peginterferon alfa-2a plus ribavirin for chronic hepatitis C virus infection. N Engl J Med. 2002, 347: 975-982. 10.1056/NEJMoa020047.View ArticlePubMedGoogle Scholar
- Sarasin-Filipowicz M: Interferon therapy of hepatitis C: molecular insights into success and failure. Swiss Med Wkly. 2009, 140: 3-11.Google Scholar
- Ikeda F, Dansako H, Nishimura G, Mori K, Kawai Y, Ariumi Y, Miyake Y, Takaki A, Nouso K, Iwasaki Y, Ikeda M, Kato N, Yamamoto K: Amino acid substitutions of hepatitis C virus core protein are not associated with intracellular antiviral response to interferon-alpha in vitro. Liver Int. 2010, 30: 1324-1331. 10.1111/j.1478-3231.2010.02299.x.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.