Identification of cancerlectin proteins using hyperparameter optimization in deep learning and DDE profiles

  • Rahu Sikander Center for Computing Research, Department of Computer Science and Software Engineering, Jinnah University for Women, Karachi Pakistan
  • Ali Ghulam Information Technology Centre, Sindh Agriculture University, Tandojam, Sindh Pakistan
  • Jawad Hassan Department of Computer Science and Engineering, Air University Multan Campus, Multan, Punjab Pakistan
  • Laiba Rehman Women university Multan
  • Nida Jabeen College of information and compute, Taiyuan university of Technology 030024, Shanxi Taiyuan China
  • Natasha Iqbal Department of Botany, Government College University of Faisalabad, 37000, Faisalabad Pakistan

Abstract

This study focuses on the development, metastasis, and spread of cancer diseases. It is therefore very desirable to establish deep learning method that classify cancerlectin proteins function efficiently and effectively. We used feature extraction model for physicochemical properties, such as Cancerlectins protein structure, functions, and other compounds. We propose a computational method, namely, cancerlectin two-dimensional convolutional neural networks (Lectin2D-CNN), for predicting cancerlectin proteins. Additionally, we conduct the cross-validation experiments. In addition to this approach, our paper proposes using cancerlectin two-dimensional convolutional neural networks (Lectin2D-CNN) to do image-based classification. The results indicate the proposed method Lectin2D-CNN achieved high accuracy and satisfactory specificity for comparison data sets and was superior to the compared methods. Various classifiers were used to predict cancerlectin protein functions. We developed a prediction model based on the 2D-CNN architecture to increase the recognition sensitivity and accuracy for cancerlectins. Results provide a basis for the estimation of cancer lectins and demonstrate deep learning approaches in in computational proteomics. When the Cross-validation using 2D-CNN random number generator has produced accuracy score obtain 0.7169%, Sensitivity score obtain 0.7012%, Specificity score obtain 0.7326%, MCC score obtain 0.4428%, ROC-AUC score is 0.76%, respectively, then we know we've attained a reliable result. When the Independent datasets using 2D-CNN random number generator has produced accuracy score obtain 0.6375%, Sensitivity score obtain 0.6160%, Specificity score obtain 0.6589%, MCC score obtain 0.2851%, and ROC (auc) score is 0.76%, respectively.

Published
Sep 29, 2023
How to Cite
SIKANDER, Rahu et al. Identification of cancerlectin proteins using hyperparameter optimization in deep learning and DDE profiles. Mehran University Research Journal of Engineering and Technology, [S.l.], v. 42, n. 4, p. 28-40, sep. 2023. ISSN 2413-7219. Available at: <https://publications.muet.edu.pk/index.php/muetrj/article/view/2793>. Date accessed: 15 may 2024. doi: http://dx.doi.org/10.22581/muet1982.2304.2793.
This is an open Access Article published by Mehran University of Engineering and Technolgy, Jamshoro under CCBY 4.0 International License