A Bayesian Approach to Learning Hidden Markov Model Topology with Applications to Biological Sequence Analysis
Schliep, Alexander (2001) A Bayesian Approach to Learning Hidden Markov Model Topology with Applications to Biological Sequence Analysis. PhD thesis.
Hidden-Markov-Models (HMMs) are a widely and successfully used tool in statistical modeling and statistical pattern recognition. One fundamental problem in the application of HMMs is finding the underlying architecture or topology, particularly when there is no strong evidence from the application domain --- e.g., when doing black box modeling. Topology is important with regard to good parameter estimates and with regard to performance: A model with ''too many'' states --- and hence too many parameters --- requires too much training data while an model with ''not enough'' states impedes the HMM from capturing subtle statistical patterns. We have developed a novel algorithm that, given sequence data originating from an ergodic process, infers an HMM, its topology and its parameters. We introduce a Bayesian approach, where a suitable prior forces generalization while giving the user control with a single prior on one parameter.
|Depositing User:||Alexander Schliep|
|Date Deposited:||24 May 2002 00:00|
|Last Modified:||19 Dec 2011 09:44|