HCDP: Hepatitis C Data Bank of Pakistan

  • Muhammad Shahzad Hazara University, Mansehra, Pakistan
  • Arif Iqbal Umar Hazara University, Mansehra, Pakistan
  • Syed Hamad Shirazi Hazara University, Mansehra, Pakistan
  • Muhammad Tariq Pervez Virtual University of Pakistan
  • Zakir Khan Hazara University, Mansehra, Pakistan
  • Waqas Yousaf Hazara University, Mansehra, Pakistan
Keywords: HCDP (Hepatitis C Data Bank of Pakistan),, PTM (Post Translational Modification), Genotype, Hepatitis C virus (HCV)


Hepatitis C virus (HCV) is a blood born, positive, single stranded RNA strand and circular in shape. The hepatitis C virus is substantial threat to the public health and its frequency is increasing rapidly all over the world. Approximately 5 online databases are available related to Hepatitis C virus. All these databases mainly concerned their national level/specific demographic region strains for analysis. No HCV database available to find the HCV prevalence and distribution in Pakistan. We proposed a Hepatitis C virus database for Pakistani community, Hepatitis C Data Bank of Pakistan (HCDP). HCDP will be the first database that will holds HCV sequences obtained from Pakistani strains and HCV research publication by Pakistani researchers. We proposed a Hepatitis C Data Bank of Pakistan (HCDP) with an online interface. In addition to provision of annotated HCV sequences of Pakistani strains, HCDP allows the user to submit HCV sequences, find out N/O-linked glycosylation, Methylation, Ser/Tyr/Thr-phosphorylation, Methylation and ubiquitination sites in the protein sequences, motif/signature sub-sequences pattern, visual appearance of protein/nucleotide sequences for analysis of different sites, visual representation of multiple sequence alignments using colour code along with motif finding/conserved region in the sequence and analysing of graphical structure of phylogenetic tree. With the help of Format converter/Fasta generator tool user can convert sequence with formats of PhyLip, NEXUS, MSF, CLUSTAL and PIR into standard FASTA. It is also observed that genotype 3a (76%) is more prevalent followed by genotype 3 (13%). Geographic distribution reveals that rate of occurrence of HCV in Sindh and KPK is high with respect to other provinces. An annotated database of HCV genome sequences allows the researcher to investigate the structural and genetic variability of the sequences efficiently and effectively. HCDP is a specialized database that mainly focuses on the HCV strain from Pakistani community. It helps virologists in drug designing and vaccine development.


Author Biographies

Muhammad Shahzad, Hazara University, Mansehra, Pakistan

Department of Information Technology

Arif Iqbal Umar, Hazara University, Mansehra, Pakistan

Department of Information Technology

Syed Hamad Shirazi, Hazara University, Mansehra, Pakistan

Department of Information Technology

Muhammad Tariq Pervez, Virtual University of Pakistan

Department of Bioinformatics and Computational Biology

Zakir Khan, Hazara University, Mansehra, Pakistan

Department of Information Technology

Waqas Yousaf, Hazara University, Mansehra, Pakistan

Department of Information Technology


1. Ashcroft, Margaret, Michael H. G. Kubbutat, Vousden, and Karen H. 1999. "Regulation of p53 Function and Stability by Phosphorylation." Mol Cell Biol.
2. Berenguer, M., F. Xavier Lopez-Labrador, and T. L. Wright. 2001. "Hepatitis C and liver transplantation." Journal of Hepatology 666-678.
3. Blom, N., S. Gammeltoft, and S. Brunak. 1999. "Sequence and Structure-based Prediction of Eukaryotic Protein Phosphorylation Sites." Journal of Molecular Biology 294 (5): 12.
4. Christlet, T. H. T., and K. Veluraja. 2001. "Database Analysis of O-Glycosylation Sites in Proteins." Biophysical Journal 80 (2): 952–960.
5. Chun, I. Yu, and Chiang Bor-Luen. 2010. "A New Insight into Hepatitis C Vaccine Development." BioMed Research International.
6. Combet, C., N. Garnier, C. Charavay, D. Grando, D. Crisan, J. Lopez, A. Dehne-Garcia, et al. 2007. "euHCVdb: The European hepatitis C virus database." Nucleic Acids Research.
7. Combet, Penin F, Geourjon C, and Deléage G. 2004. "HCVDB: hepatitis C virus sequences database." Applied Bioinformatics 3 (4): 237-240.
8. Dalziel, M., M. Crispin, C. N. Scanlan, N. Zitzmann, and R. A. Dwek. 2014. "Emerging principles for the therapeutic exploitation of glycosylation." Science 343 (6166).
9. Elliott, S., D. Chang, E. Delorme, T. Eris, and T. Lorenzini. 2004. "Structural Requirements for Additional N-Linked Carbohydrate on Recombinant Human Erythropoietin." Journal of Biological Chemistry 16854-16862.
10. Floden, E. W., A. Khawaja, V. Vopálensky, and M. Pospíšek. 2016. "HCVIVdb: The hepatitis-C IRES variation database." BMC Microbiology 16 (1).
11. Franck, N., J. Le Seyec, C. Guguen-Guillouzo, and L. Erdtmann. 2005. "Hepatitis C Virus NS2 Protein Is Phosphorylated by the Protein Kinase CK2 and Targeted for Degradation to the Proteasome." Journal of Virology 79 (5).
12. Goffard, A., N. Callens, B. Bartosch, C. Wychowski, F. L. Cosset, C. Montpellier, and J. Dubuisson. 2005. "Role of N-linked glycans in the functions of hepatitis C virus envelope glycoproteins." Journal of Virology 79 (13).
13. Hajarizadeh, B., J. Grebely, and G. J. Dore. 2013. "Epidemiology and natural history of HCV infection." Gastroenterol Hepatol 10 (9).
14. Hundt, J., Z. Li, and Q. Liu. 2013. "Post-translational modifications of hepatitis C viral proteins and their biological significance." World Journal of Gastroenterology 19 (47): 8929-8939.
15. Jaerang, Rho, Seeyoung Choi, Young Rim Seong, Joonho Choi, and Im Dong-Soo. 2001. "The Arginine-1493 Residue in QRRGRTGR1493G Motif IV of the Hepatitis C Virus NS3 Helicase Domain Is Essential for NS3 Protein Methylation by the Protein Arginine Methyltransferase 1." Journal of Virology 75 (17): 8031–8044.
16. Kasturi, L., J. R. Eshleman, W. H. Wunner, and S. H. Shakin-Eshleman. 1995. "The hydroxy amino acid in an Asn-X-Ser/Thr sequon can influence N-linked core glycosylation efficiency and the level of expression of a cell surface glycoprotein." The Journal of Biological Chemistry 270 (24): 14756-14761.
17. Kenneth, M Daily, Predrag Radivojac, Dunker, and A. Keith. 14-15 Nov. 2005. "Intrinsic disorder and protein modifications: building an SVM predictor for methylation." IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology. CA, USA.
18. Kornfeld, R., and S. Kornfeld. 1985. "Assembly of asparagine-linked oligosaccharides." Annual Review of Biochemistry 631-664.
19. Kühne, C., and L. Banks. 1998. "E3-Ubiquitin Ligase/E6-AP Links Multicopy Maintenance Protein 7 to the Ubiquitination Pathway by a Novel Motif, the L2G Box*." THE JOURNAL OF BIOLOGICAL CHEMISTRY 34302-34309.
20. Kuiken, C., K. Yusim, L. Boykin, and R. Richardson. 2005. "The Los Alamos hepatitis C sequence database." Bionformatics 21 (3): 379–384.
21. Kwofie, SK, Schaefer U, Sundararajan VS, Bajic VB, and A. Christoffels. 2011. "HCVpro: Hepatitis C virus protein interaction database." Infection, Genetics and Evolution 11 (8): 1971-1977.
22. McBride, A. E., and P. A. Silver. 2001. "State of the arg: protein methylation at arginine comes of age." Cell Press 106 (1): 5-8.
23. McOmish, F., P. L. Yap, B. C. Dow, E. A. Follett, C. Seed, A. J. Keller, T. J. Cobain, T. Krusius, E. Kolho, and R. Naukkarinen. 1994. "Geographical distribution of hepatitis C virus genotypes in blood donors: an international collaborative survey." Journal of Clinical Microbiology 32 (4): 884-892.
24. Minosse, C., E. Giombini, B. Bartolini, M. R. Capobianchi, and A. R. Garbuglia. 2016. "Ultra-Deep Sequencing Characterization of HCV Samples with Equivocal Typing Results Determined with a Commercial Assay." International Journal of Molecular Science 17 (10).
25. Moradpour, Penin F, CM, and Rice . 2007. "Replication of hepatitis C virus." Nat Rev Microbiol 5.
26. National Institute of Genetics. n.d. Hepatitis Virus DataBase Server. Accessed Dec 11, 2019. http://s2as02.genes.nig.ac.jp/index.html.
27. Oliveira, A. P., and U. Sauer. 2012. "The importance of post-translational modifications in regulating Saccharomyces cerevisiae metabolism." FEMS Yeast Research 12 (2).
28. Pervez, M. T., M. E. Babar, A. Nadeem , N. Aslam, N. Naveed, S. Ahmad, S. Muhammad, et al. 2015. "IVisTMSA: Interactive Visual Tools for Multiple Sequence Alignments." Evolutionary Bioinformatics Online 35-42.
29. Schubert, H. L., R. M. Blumenthal, and Xiaodong Cheng. 2006. "1 Protein Methyltransferases: Their Distribution Among the Five Structural Classes of AdoMet-Dependent Methyltransferases." The Enzymes 24: 3-28.
30. Shi, Y., Y. Guo, Y. Hu, and M. Li. 2015. "Position-specific prediction of methylation sites from sequence conservation based on information theory." Scientific Reports.
31. Smith, D. B., J. Bukh, C. Kuiken, A. S. Muerhoff, C. M. Rice, J. T. Stapleton, and P. Simmonds. 2014. "Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: Updated criteria and genotype assignment web resource." Hepatology 59 (1).
32. Stallcup, M. R. 2001. "Role of protein methylation in chromatin remodeling and transcriptional regulation." Oncogene pages3014–3020.
33. Steentoft, C., S. Y. Vakhrushev, H. J. Joshi, Y. Kong , M. B. Vester-Christensen, K. T. Schjoldager, K. Lavrsen, et al. 2013. "Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology." EMBO J 32 (10).
34. UniProt. n.d. Glycosylation. UniProt EMBL-EBI. Accessed Dec. 13, 2019. http://www.uniprot.org/help/carbohyd.
35. Varki, A. 2009. Essentials of Glycobiology. 2nd. Cold Spring Harbor Laboratory Press.
36. Vlastaridis, P., P. Kyriakidou, A. Chaliotis, Y. Van de Peer, S. G. Oliver, and G. D. Amoutzias. 2017. "Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes." Gigascience 6 (2).
37. Wong, Chi-Huey. 2005. "Protein Glycosylation: New Challenges and Opportunities." Journal of Organic Chemistry 4219-4225.
38. Wooderchak, W. L., Tianzhu Zang, Zhaohui Sunny Zhou, M. Acuña, S. M. Tahara, and J. M. Hevel. 2008. "Substrate profiling of PRMT1 reveals amino acid sequences that extend beyond the "RGG" paradigm." Biochemistry 9456-9466.
How to Cite
Shahzad, M., Umar, A. I., Shirazi, S. H., Pervez, M. T., Khan, Z., & Yousaf, W. (2019, December 31). HCDP: Hepatitis C Data Bank of Pakistan. JOURNAL OF ENGINEERING AND APPLIED SCIENCES, 38(2). https://doi.org/https://doi.org/10.25211/jeas.v38i2.3142