“School of Biological”

Back to Papers Home
Back to Papers of School of Biological

Paper   IPM / Biological / 13185
School of Biological Sciences
  Title:   A Protein Fold Classifier Formed By Fusing Different Modes Of Pseudo Amino Acid Composition Via PSSM
  Author(s): 
1.  K. Kavousi.
2.  B. Moshiri.
3.  M. Sadeghi.
4.  B. N. Araabi.
5.  A. A. Moosavi-Movahedi.
  Status:   Published
  Journal: Computational Biology and Chemistry
  No.:  1
  Vol.:  35
  Year:  2011
  Pages:   1-9
  Supported by:  IPM
  Abstract:
Protein function is related to its chemical reaction to the surrounding environment including other proteins. On the other hand, this depends on the spatial shape and tertiary structure of protein and folding of its constituent components in space. The correct identification of protein domain fold solely using extracted information from protein sequence is a complicated and controversial task in the current computational biology. In this article a combined classifier based on the information content of extracted features from the primary structure of protein has been introduced to face this challenging problem. In the first stage of our proposed two-tier architecture, there are several classifiers each of which is trained with a different sequence based feature vector. Apart from the application of the predicted secondary structure, hydrophobicity, van der Waals volume, polarity, polarizability, and different dimensions of pseudo-amino acid composition vectors in similar studies, the position specific scoring matrix (PSSM) has also been used to improve the correct classification rate (CCR) in this study. Using K-fold cross validation on training dataset related to 27 famous folds of SCOP, the 28 dimensional probability output vector from each evidence theoretic K-NN classifier is used to determine the information content or expertness of corresponding feature for discrimination in each fold class. In the second stage, the outputs of classifiers for test dataset are fused using Sugeno fuzzy integral operator to make better decision for target fold class. The expertness factor of each classifier in each fold class has been used to calculate the fuzzy integral operator weights. Results make it possible to provide deeper interpretation about the effectiveness of each feature for discrimination in target classes for query proteins. more info: http://www.sciencedirect.com/science/article/pii/S1476927110001106

Download TeX format
back to top
scroll left or right