Loading...
Predicting primary sequence-based protein-protein interactions using a Mercer series representation of nonlinear support vector machine
Chatrabgoun, Omid ; Daneshkhah, Alireza ; Esmaeilbeigi, Mohsen ; ; Alenezi, Ali H. ;
Chatrabgoun, Omid
Daneshkhah, Alireza
Esmaeilbeigi, Mohsen
Alenezi, Ali H.
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2022-11-21
Submitted date
Alternative
Abstract
The prediction of protein-protein interactions (PPIs) is essential to understand the cellular processes from a medical perspective. Among the various machine learning techniques, kernel-based Support Vector Machine (SVM) has been commonly employed to discriminate between interacting and non-interacting protein pairs. The main drawback of employing the kernel-based SVM to datasets with many features, such as the primary sequence-based protein-protein dataset, is the significant increase in computational time of training stage. This increase in computational time is mainly due to the presence of the kernel in solving the quadratic optimisation problem (QOP) involved in nonlinear SVM. In order to fix this issue, we propose a novel and efficient computational algorithm by approximating the kernel-based SVM using a low-rank truncated Mercer series as well as desired. As a result, the QOP for the approximated kernel-based SVM will be very tractable in the sense that there is a significant reduction in computational time of training and validating stages. We illustrate the novelty of the proposed method by predicting the PPIs of “S. Cerevisiae” where the protein features extracted using the multiscale local descriptor (MLD), and then we compare the predictive performance of the proposed low-rank approximation with the existing methods. Finally, the new method results in significant reduction in computational time for predicting PPIs with almost as accuracy as kernel-based SVM.
Citation
Chatrabgoun, O., Daneshkhah, A., Esmaeilbeigi, M., Sohrabi Safa, N., Alenezi, A.H. and Rahman, M.A. (2022) Predicting primary sequence-based protein-protein interactions using a Mercer series representation of nonlinear support vector machine. IEEE Access, 10, pp. 124345-124354.
Publisher
Journal
Research Unit
PubMed ID
PubMed Central ID
Embedded videos
Additional Links
Type
Journal article
Language
en
Description
© 2022 The Authors. Published by IEEE. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://ieeexplore.ieee.org/document/9956991
Series/Report no.
ISSN
2169-3536
EISSN
2169-3536
ISBN
ISMN
Gov't Doc #
Sponsors
The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number IF-2020-NBU-412.