IPM - Institute for Research in Fundamental Sciences

“School of Biological Sciences”

Back to Papers Home
Back to Papers of School of Biological Sciences

Paper IPM / Biological Sciences / 13170

School of Biological Sciences

Title:

A Modified Hidden Markov Model And Its Application In Protein Secondary Structure Prediction

Author(s):

1.	sima
2.	V. Rezaei.
3.	H.Pezeshk.
4.	D. Matthews.

Status:

Published

Journal:

Proteomics Bioinform

No.:

Vol.:

Year:

2012

Supported by:

IPM

Abstract:

One of the important tools in analyzing and modeling biological data is the Hidden Markov Model (HMM), which is used for gene prediction, protein secondary structure and other essential tasks. An HMM is a stochastic process in which a hidden Markov chain called; the chain of states, emits a sequence of observations. Using this sequence, various questions about the underlying emission generation scheme can be addressed. Applying an HMM to any particular situation is an attempt to infer which state in the chain emits an observation. This is usually called posterior decoding. In general, the emissions are assumed to be conditionally independent from each other. In this work we consider some dependencies among the states and emissions. The aim of our research is to study a certain relationship among emissions, with a focus on the Markov property. We assume that the probability of observing an emission depends not only on the current state but also on the previous state and one of the previous emissions. We also use additional environmental information, and classify amino acids into three groups, using the Relative Solvent Accessibility (RSA). We also investigate how this modification might change the current algorithms for ordinary HMMs, and introduce modified Viterbi and Forward-Backward algorithms for the new model. We apply our proposed model to an actual dataset concerning prediction of the protein secondary structure and demonstrate improved accuracy compared to the ordinary HMM. In particular, the overall accuracy of our modified HMM, which uses the RSA information, is 63.95

more info: http://www.omicsonline.org/0974-276X/JPB-05-024.php

Download TeX format

“School of Biological Sciences”

People

Schools

Centers

Groups

E-Services

Publications