Model-Based Clustering With Hidden Markov Models and its Application to Financial Time-Series Data

Knab, Bernhard; Schliep, Alexander; Steckemetz, Barthel; Wichern, Bernd

doi:10.1007/978-3-642-18991-3_64

Bernhard Knab⁷,
Alexander Schliep⁸,
Barthel Steckemetz⁹ &
…
Bernd Wichern¹⁰

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

868 Accesses
16 Citations

Abstract

We have developed a method to partition a set of data into clusters by use of Hidden Markov Models. Given a number of clusters, each of which is represented by one Hidden Markov Model, an iterative procedure finds the combination of cluster models and an assignment of data points to cluster models which maximizes the joint likelihood of the clustering. To reflect the partially non-Markovian nature of the data we also extend classical Hidden Markov Models to use a non-homogeneous Markov chain, where the non-homogeneity is dependent not on the time of the observation but rather on a quantity derived from previous observations.

We present the method and an evaluation on simulated time-series and large data sets of financial time-series from the Public Saving and Loan Banks in Germany.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

BACHEM, A. ET AL. (1997): Analyse großer Datenmengen und Clusteralgorithmen im Bausparwesen. In: C. Hipp, W. Eichhorn, W.-R., W.-R. Heilmann (Eds.), Beiträge zum 7. Symposium Geld, Finanzwirtschaft, Banken und Versicherungen, Dezember 1996, no. 257, 955–961.
Google Scholar
BAUM, L. E., PETRIE, T. (1966): Statistical inference for probabilistic functions of finite Markov chains. Ann. Math. Statist., 37, 1554–1563.
Article MathSciNet MATH Google Scholar
BAUM, L. E., PETRIE, T., SOULES, G., WEISS, N. (1970): A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann. Math. Statist., 41, 164–171.
Article MathSciNet MATH Google Scholar
BOCK, H. H. (1974): Automatische Klassifikation. Theoretische und praktische Methoden zur Gruppierung und Strukturierung von Daten. Vandenhoeck & Ruprecht.
Google Scholar
BURKE, C. J., ROSENBLATT, M. (1958): A Markovian function of a Markov chain. Ann. math. stat., 29, 1112–1120.
Article MathSciNet MATH Google Scholar
EVERITT, B. S. (1993): Cluster Analysis. Edward Arnold, London.
Google Scholar
KNAB, B. (2000): Erweiterungen von Hidden-Markov-Modellen zur Analyse ökonomischer Zeitreihen. Ph.D. thesis.
Google Scholar
KNAB, B., SCHLIEP, A., STECKEMETZ, B., WICHERN, B., GÄDKE, A., THORANSDOTTIR, D. (2002): The GNU Hidden Markov Model Library. Available from http://www.zpr.uni-koeln.de/hmm.
Google Scholar
KNAB, B., SCHRADER, R., WEBER, I., WEINBRECHT, K., WICHERN, B. (1997): Mesoskopisches Simulationsmodell zur Kollektivfortschreibung. Tech. Rep. ZPR97-295, Mathematisches Institut, Universität zu Köln
Google Scholar
MACDONALD, I. L., ZUCCHINI, W. (1997): Hidden Markov and other models for discrete-valued time series. Chapman & Hall, London.
MATH Google Scholar
MCLACHLAN, G., BASFORD, K. (1988): Mixture Models: Inference and Applications to Clustering. Marcel Dekker, Inc., New York, Basel.
MATH Google Scholar
PETRIE, T. (1969): Probabilistic functions of finite state Markov chains. Ann. Math. Statist, 40, 97–115.
Article MathSciNet MATH Google Scholar
RABINER, L. R. (1989): A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, 77(2), 257–285.
Article Google Scholar
SJOLANDER, K., KARPLUS, K., BROWN, M., HUGHEY, R., KROGH, A., MIAN, I. S., HAUSSLER, D. (1996): Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci, 12(4), 327–45.
Google Scholar
SMYTH, P. (2000): A general probabilistic framework for clustering individuals. Tech. Rep. TR-00-09, University of California, Irvine.
Google Scholar
WICHERN, B. (2001): Hidden-Markov-Modelle zur Analyse und Simulation von Finanzzeitreihen. Ph.D. Thesis.
Google Scholar

Download references

Author information

Authors and Affiliations

Bayer AG, D-51368, Leverkusen, Germany
Bernhard Knab
Department Computational Molecular Biology, Max-Planck-Institut für Molekulare Genetik, D-14195, Berlin, Germany
Alexander Schliep
Science Factory GmbH, D-50667, Köln, Germany
Barthel Steckemetz
ifb AG, D-50667, Köln, Germany
Bernd Wichern

Authors

Bernhard Knab
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Schliep
View author publications
You can also search for this author in PubMed Google Scholar
Barthel Steckemetz
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Wichern
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Systems, University of Mannheim, Schloss, 68131, Mannheim, Germany
Martin Schader
Institute of Decision Theory, University of Karlsruhe, Kaiserstr. 12, 76128, Karlsruhe, Germany
Wolfgang Gaul
Department of Statistics, University of Rome, Piazzale Aldo Moro, 00185, Rome, Italy
Maurizio Vichi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Knab, B., Schliep, A., Steckemetz, B., Wichern, B. (2003). Model-Based Clustering With Hidden Markov Models and its Application to Financial Time-Series Data. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_64

Download citation

DOI: https://doi.org/10.1007/978-3-642-18991-3_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40354-8
Online ISBN: 978-3-642-18991-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics