Abstract
We have developed a method to partition a set of data into clusters by use of Hidden Markov Models. Given a number of clusters, each of which is represented by one Hidden Markov Model, an iterative procedure finds the combination of cluster models and an assignment of data points to cluster models which maximizes the joint likelihood of the clustering. To reflect the partially non-Markovian nature of the data we also extend classical Hidden Markov Models to use a non-homogeneous Markov chain, where the non-homogeneity is dependent not on the time of the observation but rather on a quantity derived from previous observations.
We present the method and an evaluation on simulated time-series and large data sets of financial time-series from the Public Saving and Loan Banks in Germany.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BACHEM, A. ET AL. (1997): Analyse großer Datenmengen und Clusteralgorithmen im Bausparwesen. In: C. Hipp, W. Eichhorn, W.-R., W.-R. Heilmann (Eds.), Beiträge zum 7. Symposium Geld, Finanzwirtschaft, Banken und Versicherungen, Dezember 1996, no. 257, 955–961.
BAUM, L. E., PETRIE, T. (1966): Statistical inference for probabilistic functions of finite Markov chains. Ann. Math. Statist., 37, 1554–1563.
BAUM, L. E., PETRIE, T., SOULES, G., WEISS, N. (1970): A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann. Math. Statist., 41, 164–171.
BOCK, H. H. (1974): Automatische Klassifikation. Theoretische und praktische Methoden zur Gruppierung und Strukturierung von Daten. Vandenhoeck & Ruprecht.
BURKE, C. J., ROSENBLATT, M. (1958): A Markovian function of a Markov chain. Ann. math. stat., 29, 1112–1120.
EVERITT, B. S. (1993): Cluster Analysis. Edward Arnold, London.
KNAB, B. (2000): Erweiterungen von Hidden-Markov-Modellen zur Analyse ökonomischer Zeitreihen. Ph.D. thesis.
KNAB, B., SCHLIEP, A., STECKEMETZ, B., WICHERN, B., GÄDKE, A., THORANSDOTTIR, D. (2002): The GNU Hidden Markov Model Library. Available from http://www.zpr.uni-koeln.de/hmm.
KNAB, B., SCHRADER, R., WEBER, I., WEINBRECHT, K., WICHERN, B. (1997): Mesoskopisches Simulationsmodell zur Kollektivfortschreibung. Tech. Rep. ZPR97-295, Mathematisches Institut, Universität zu Köln
MACDONALD, I. L., ZUCCHINI, W. (1997): Hidden Markov and other models for discrete-valued time series. Chapman & Hall, London.
MCLACHLAN, G., BASFORD, K. (1988): Mixture Models: Inference and Applications to Clustering. Marcel Dekker, Inc., New York, Basel.
PETRIE, T. (1969): Probabilistic functions of finite state Markov chains. Ann. Math. Statist, 40, 97–115.
RABINER, L. R. (1989): A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, 77(2), 257–285.
SJOLANDER, K., KARPLUS, K., BROWN, M., HUGHEY, R., KROGH, A., MIAN, I. S., HAUSSLER, D. (1996): Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci, 12(4), 327–45.
SMYTH, P. (2000): A general probabilistic framework for clustering individuals. Tech. Rep. TR-00-09, University of California, Irvine.
WICHERN, B. (2001): Hidden-Markov-Modelle zur Analyse und Simulation von Finanzzeitreihen. Ph.D. Thesis.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Knab, B., Schliep, A., Steckemetz, B., Wichern, B. (2003). Model-Based Clustering With Hidden Markov Models and its Application to Financial Time-Series Data. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_64
Download citation
DOI: https://doi.org/10.1007/978-3-642-18991-3_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40354-8
Online ISBN: 978-3-642-18991-3
eBook Packages: Springer Book Archive