An explainable phosphorylation peptide associated with SARS-CoV-2 infection employing a 2D Convolutional Neural Network (2DCNN)

Authors

  • Ali Ghulam Information Technology Centre, Sindh Agriculture University, Tandojam, Sindh, Pakistan, Correspondence Author, garahu@sau.edu.pk
  • Tarique Ali Information Technology Centre, Sindh Agriculture University, Tandojam, Sindh, Pakistan,
  • Taha Hussain Electrical and Electronics Engineering, Uskudar University, Istanbul,Turkey
  • Nida Jabeen School of communications and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
  • Taiyaba Qureshi School of Computer Science, University of Science and Technology of China, Hefei, Anhui, China
  • Mujeeb ur Rehman School of Information and Communication Engineering, Guilin University of Electronic Technology, Guilin, China
  • Rahu Sikander Computer Science and Software Engineering Jinnah University for women, Karachi, Pakistan
  • Sultan Ahmed Changshu Institute of Technology, Suzhou. P.R China

DOI:

https://doi.org/10.63163/jpehss.v3i1.157

Keywords:

Phosphorylation sites, SARS-COV-2, 2DCNN, DDE, 2DEEP_IPs.

Abstract

Phosphorylation is a post-translational modification process plays a critical role in the regulation of many cellular processes, including viral infection for example SARS-CoV-2. The SARS-CoV-2 is the virus responsible for causing the COVID-19 pandemic. The identification and characterization of phosphorylation sites on SARS-CoV-2 proteins could provide valuable insights into the mechanisms underlying the virus's pathogenesis and may lead to the development of new therapeutic strategies for COVID-19. The development of computational predictors for phosphorylation site identification has received remarkable attention recently, however these methods limited to find phosphorylation sites in SARS-CoV-2-infected host cells. Viral-host protein-protein interactions cause alterations in phosphorylation and may influence host protein subcellular localization. In this work we proposed a predictor called 2Deep-IPs using two-dimensional convolutional deep neural network (2D-CNN) for identification of particular phosphorylation sites. We extracted the amino acid composition-based features from protein sequence by using dipeptide deviation from expected mean (DDE) descriptor. Further, we used shapely additive explanation’s (SHAP's) algorithm to rank the effective attributes that adequately contain crucial biological information. The proposed model outperformed on top 15 high ranked features. The empirical outcomes of 2Deep-IPs based on 10- fold cross-validation achieved accuracy score 96.71, Sen score obtained 94.46 and Spec score obtain is 99.69 and MCC score obtain 0.939. The results analysis based on independent datasets achieved accuracy score 95.70, Sen score obtained 97.83 and Spec score obtain is 91.89 and MCC score obtain 0.782, respectively. Thus, the anticipated results reveal that 2Deep-IPs outperforms other phosphorylation sites predictors both on cross-validation and independent test respectively. We hope that the proposed Deep-IPs will provide in-depth knowledge to other methods that can be used to predict general phosphorylation sites.

Downloads

Published

2025-03-05

How to Cite

Ali Ghulam, Tarique Ali, Taha Hussain, Nida Jabeen, Taiyaba Qureshi, Mujeeb ur Rehman, Rahu Sikander, & Sultan Ahmed. (2025). An explainable phosphorylation peptide associated with SARS-CoV-2 infection employing a 2D Convolutional Neural Network (2DCNN). Physical Education, Health and Social Sciences, 3(1), 23–46. https://doi.org/10.63163/jpehss.v3i1.157