Harnessing the evolutionary information on oxygen binding proteins through Support Vector Machines based modules

Loading...
Thumbnail Image

Date

2018-12-01

DOI

Open Access Location

Journal Title

Journal ISSN

Volume Title

Publisher

BioMed Central Ltd

Rights

CC BY 4.0
(c) 2018 The Author/s

Abstract

Objectives: The arrival of free oxygen on the globe, aerobic life is becoming possible. However, it has become very clear that the oxygen binding proteins are widespread in the biosphere and are found in all groups of organisms, including prokaryotes, eukaryotes as well as in fungi, plants, and animals. The exponential growth and availability of fresh annotated protein sequences in the databases motivated us to develop an improved version of "Oxypred" for identifying oxygen-binding proteins. Results: In this study, we have proposed a method for identifying oxy-proteins with two different sequence similarity cutoffs 50 and 90%. A different amino acid composition based Support Vector Machines models was developed, including the evolutionary profiles in the form position-specific scoring matrix (PSSM). The fivefold cross-validation techniques were applied to evaluate the prediction performance. Also, we compared with existing methods, which shows nearly 97% recognition, but, our newly developed models were able to recognize almost 99.99 and 100% in both oxy-50 and 90% similarity models respectively. Our result shows that our approaches are faster and achieve a better prediction performance over the existing methods. The web-server Oxypred2 was developed for an alternative method for identifying oxy-proteins with more additional modules including PSSM, available at http://bioinfo.imtech.res.in/servers/muthu/oxypred2/home.html.

Description

Keywords

Oxygen binding proteins, Hemoglobin, Myoglobin, Leghemoglobin, Erythrocruorin, Hemerythrin, Hemocyanin, Support Vector Machines, Confusion matrix, ROC Analysis

Citation

Muthukrishnan S, Puri M. (2018). Harnessing the evolutionary information on oxygen binding proteins through Support Vector Machines based modules. BMC Research Notes. 11. 1.

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as CC BY 4.0