Home | Issues | Profile | History | Submission | Review
Vol: 57(71) No: 3 / September 2012

A Comprehensive Approach of Multinomial Hidden Markov Models
Marina Cidota
University of Bucharest, Faculty of Mathematics and Computer Science, Department of Computer Science, Str. Academiei No.14, Bucharest, 010014 Romania, e-mail: cidota@fmi.unibuc.ro
Monica Dumitrescu
University of Bucharest, Faculty of Mathematics and Computer Science, Department of Mathematics, Str. Academiei No.14, Bucharest, 010014 Romania, e-mail: mdumi@fmi.unibuc.ro


Keywords: communication channel, hidden Markov model, multinomial response model, nested optimization algorithms

Abstract
The paper deals with stochastic modeling for communication systems that are influenced by an external ”catalyzer” (e.g. environmental or experimental conditions). For an one dimensional catalyzer, a model named the Logistic HMM (LHMM) was already introduced by the authors. A new extension of Hidden Markov models is now considered, and all three aspects of stochastic modeling are addressed: building the model, its training and model validation. In the current model, the catalyzer can have multiple components and its influence over the system is expressed through multinomial link functions. We introduce a complex training algorithm based on the Baum-Welch scheme, including nested algorithms for optimization such as the Newton - Raphson and the Expectation-Maximization technique for updating the parameters of the model. The validation step is achieved by means of a statistical test of significance that allows us to decide how many and which catalyzers should be included in the Multinomial Hidden Markov model (MHMM). The likelihood ratio test that we apply for model validation is based on the fact that the probability distributions belong to the Exponential class, hence the model verifies the usual regularity conditions and the maximum-likelihood estimates are asymptotically normal. In order to explore the convergence of the proposed training procedure and the validation of the model, a simulation study is provided.

References
L. E. Baum and J. Eagon, “An inequality with applications to statistical prediction for functions of markov processes and to a model for ecology,” Bull. Amer. Math. Soc., vol. 73, pp. 360 –363, 1967.
[2] L. E. Baum, T. Petrie, G. Soules and N. Weiss, “A maximization technique occurring in the statistical analysis of probabilistic functions of markov chains,” Ann. Math. Statistic., vol. 41, pp. 164 – 171, 1970.
[3] S. Michalek, H. Lerche, M. Wagner, N. Mitrovic, M. Schiebe, F. Lehmann-Horn and J. Timmer, “On identification of na(+) channel gating schemes using moving-average filtered hidden markov models,” Eur. Biophys. J.,vol. 28, pp. 605 – 609, 1999.
[4] F. Qin, A. Auerbach and F. Sachs, “A direct optimization approach to hidden markov modeling for single channel kinetics,” Biophys. J., vol. 79, pp. 1915 – 1927, 2000.
[5] F. Qin, A. Auerbach and F. Sachs., “Hidden markov modeling for single channel kinetics with filtering and correlated noise,” Biophys. J., vol. 79, pp. 1928 – 1944, 2000.
[6] J. D. Becker, J. Honerkamp, J. Hirsch, U. Fröbe, E. Schlatter and R. Greger, “Analyzing ion channels with hidden markov models,” Pflugers Arch., vol. 426, pp. 328 – 332, 1994.
[7] S. H. Chung, J. Moore, L. Xia, L. S. Premkumar and P. W. Gage, “Characterization of single channel currents using digital signal processing techniques based on hidden markov models,” Phil. Trans. R. Soc. London B., vol. 329, pp. 265 – 285, 1990.
[8] L. Venkataramanan, J. L. Walsh, R. Kuc and F. J. Sigworth, “Identification of hidden markov models for ion channel currents. Part i:colored background noise,” IEEE Trans. Signal Proc., vol. 46, pp. 1901 – 1915, 1998.
[9] L. Venkataramanan and F. J. Sigworth, “Applying hidden markov models to the analysis of single ion channel activity,” Biophys. J., vol. 82, no. 4, pp. 1930 – 1942, 2002.
[10] M. Gollery, Handbook of Hidden Markov Models in Bioinformatics. Boca Raton, FL: Chapman & Hall / CRC Press, 2008.
[11] M. Cidota and M. Dumitrescu, “On a logistic-hidden markov model for communication systems,” Applied Artificial Intelligence, vol. 26, no. 03, pp. 229 – 245, 2012.
[12] A. Agresti, Categorical Data Analysis. New York: Wiley, 1990.
[13] P. McCullagh and J. A. Nelder, Generalized Linear Models. London: Chapman and Hall, 1989.
[14] M. Cidota and M. Dumitrescu, “A multinomial - hidden markov model for communication systems influenced by external factors,” Proc. IEEE 7th International Symposium on Applied Computational Intelligence and Informatics, Timisoara, Romania, 2012.
[15] M. Cidota and M. Dumitrescu, “A multinomial hidden Markov model and its training by a combined iterative procedure,” unpublished, 2012.
[16] X. Huang, A. Acero and H. W. Hon, Spoken Language Processing-A Guide to Theory, Algorithm and System Development. Upper Saddle River, New Jersey: Prentice Hall PTR, 2001.
[17] L. R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition,” in Proc. IEEE, 77, 1989, pp. 257–285.