Large-scale comparative review and assessment of computational methods for phage virion proteins identification

Muhammad Kabir; Chanin Nantasenamat; Sakawrat Kanthawong; Phasit Charoenkwan; Watshara Shoombuatong

doi:10.17179/excli2021-4411

Large-scale comparative review and assessment of computational methods for phage virion proteins identification

Authors

Muhammad Kabir School of Systems and Technology, Department of Computer Science, University of Management and Technology, Lahore, Pakistan, 54770 https://orcid.org/0000-0002-2488-1653
Chanin Nantasenamat Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand, 10700 https://orcid.org/0000-0003-1040-663X
Sakawrat Kanthawong Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand, 40002 https://orcid.org/0000-0003-4068-3646
Phasit Charoenkwan Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand, 50200 https://orcid.org/0000-0002-8161-6856
Watshara Shoombuatong Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand, 10700. Phone: +66 2 441 4371; Fax: +66 2 441 4380; E-mail: watshara.sho@mahidol.ac.th https://orcid.org/0000-0002-3394-8709

DOI:

https://doi.org/10.17179/excli2021-4411

Keywords:

phage virion protein, bioinformatics, classification, machine learning, feature representation, feature select

Abstract

Phage virion proteins (PVPs) are effective at recognizing and binding to host cell receptors while having no deleterious effects on human or animal cells. Understanding their functional mechanisms is regarded as a critical goal that will aid in rational antibacterial drug discovery and development. Although high-throughput experimental methods for identifying PVPs are considered the gold standard for exploring crucial PVP features, these procedures are frequently time-consuming and labor-intensive. Thusfar, more than ten sequence-based predictors have been established for the in silico identification of PVPs in conjunction with traditional experimental approaches. As a result, a revised and more thorough assessment is extremely desirable. With this purpose in mind, we first conduct a thorough survey and evaluation of a vast array of 13 state-of-the-art PVP predictors. Among these PVP predictors, they can be classified into three groups according to the types of machine learning (ML) algorithms employed (i.e. traditional ML-based methods, ensemble-based methods and deep learning-based methods). Subsequently, we explored which factors are important for building more accurate and stable predictors and this included training/independent datasets, feature encoding algorithms, feature selection methods, core algorithms, performance evaluation metrics/strategies and web servers. Finally, we provide insights and future perspectives for the design and development of new and more effective computational approaches for the detection and characterization of PVPs.

Downloads

Published

2022-01-03

How to Cite

Kabir, M., Nantasenamat, C., Kanthawong, S., Charoenkwan, P., & Shoombuatong, W. (2022). Large-scale comparative review and assessment of computational methods for phage virion proteins identification. EXCLI Journal, 21, 11–29. https://doi.org/10.17179/excli2021-4411

Download Citation

Issue

Vol. 21 (2022)

Section

Review articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish in this journal agree to the following terms:

The authors keep the copyright and grant the journal the right of first publication under the terms of the Creative Commons Attribution license, CC BY 4.0. This licencse permits unrestricted use, distribution and reproduction in any medium, provided that the original work is properly cited.
The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.
Because the advice and information in this journal are believed to be true and accurate at the time of publication, neither the authors, the editors, nor the publisher accept any legal responsibility for any errors or omissions presented in the publication. The publisher makes no guarantee, express or implied, with respect to the material contained herein.
The authors can enter into additional contracts for the non-exclusive distribution of the journal's published version by citing the initial publication in this journal (e.g. publishing in an institutional repository or in a book).

Large-scale comparative review and assessment of computational methods for phage virion proteins identification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Categories

License

Most read articles by the same author(s)

Make a Submission

Categories from 2022 onwards

ifado

Current Issue

User Login

impactfactor

EXCLI Journal has been added to

Impact Factor