A Bayesian Approach to Learning Hidden Markov Model Topology with Applications to Biological Sequence Analysis

Schliep, Alexander (2001) A Bayesian Approach to Learning Hidden Markov Model Topology with Applications to Biological Sequence Analysis. PhD thesis.

Abstract

Hidden-Markov-Models (HMMs) are a widely and successfully used tool in statistical modeling and statistical pattern recognition. One fundamental problem in the application of HMMs is finding the underlying architecture or topology, particularly when there is no strong evidence from the application domain --- e.g., when doing black box modeling. Topology is important with regard to good parameter estimates and with regard to performance: A model with ''too many'' states --- and hence too many parameters --- requires too much training data while an model with ''not enough'' states impedes the HMM from capturing subtle statistical patterns. We have developed a novel algorithm that, given sequence data originating from an ergodic process, infers an HMM, its topology and its parameters. We introduce a Bayesian approach, where a suitable prior forces generalization while giving the user control with a single prior on one parameter.


Actions:
Download: [img] Postscript - Accepted Version
Download (1583Kb) | Preview
Download: [img] PDF - Accepted Version
Download (905Kb) | Preview
Export as:
Editorial actions: View Item View Item (Login required)
Deposit Information:
ZAIK Number: zaik2002-432
Depositing User: Alexander Schliep
Date Deposited: 24 May 2002 00:00
Last Modified: 19 Dec 2011 09:44
URI: http://e-archive.informatik.uni-koeln.de/id/eprint/432