Skip to main content

Model-Based Clustering With Hidden Markov Models and its Application to Financial Time-Series Data

  • Conference paper
Between Data Science and Applied Data Analysis

Abstract

We have developed a method to partition a set of data into clusters by use of Hidden Markov Models. Given a number of clusters, each of which is represented by one Hidden Markov Model, an iterative procedure finds the combination of cluster models and an assignment of data points to cluster models which maximizes the joint likelihood of the clustering. To reflect the partially non-Markovian nature of the data we also extend classical Hidden Markov Models to use a non-homogeneous Markov chain, where the non-homogeneity is dependent not on the time of the observation but rather on a quantity derived from previous observations.

We present the method and an evaluation on simulated time-series and large data sets of financial time-series from the Public Saving and Loan Banks in Germany.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BACHEM, A. ET AL. (1997): Analyse großer Datenmengen und Clusteralgorithmen im Bausparwesen. In: C. Hipp, W. Eichhorn, W.-R., W.-R. Heilmann (Eds.), Beiträge zum 7. Symposium Geld, Finanzwirtschaft, Banken und Versicherungen, Dezember 1996, no. 257, 955–961.

    Google Scholar 

  • BAUM, L. E., PETRIE, T. (1966): Statistical inference for probabilistic functions of finite Markov chains. Ann. Math. Statist., 37, 1554–1563.

    Article  MathSciNet  MATH  Google Scholar 

  • BAUM, L. E., PETRIE, T., SOULES, G., WEISS, N. (1970): A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann. Math. Statist., 41, 164–171.

    Article  MathSciNet  MATH  Google Scholar 

  • BOCK, H. H. (1974): Automatische Klassifikation. Theoretische und praktische Methoden zur Gruppierung und Strukturierung von Daten. Vandenhoeck & Ruprecht.

    Google Scholar 

  • BURKE, C. J., ROSENBLATT, M. (1958): A Markovian function of a Markov chain. Ann. math. stat., 29, 1112–1120.

    Article  MathSciNet  MATH  Google Scholar 

  • EVERITT, B. S. (1993): Cluster Analysis. Edward Arnold, London.

    Google Scholar 

  • KNAB, B. (2000): Erweiterungen von Hidden-Markov-Modellen zur Analyse ökonomischer Zeitreihen. Ph.D. thesis.

    Google Scholar 

  • KNAB, B., SCHLIEP, A., STECKEMETZ, B., WICHERN, B., GÄDKE, A., THORANSDOTTIR, D. (2002): The GNU Hidden Markov Model Library. Available from http://www.zpr.uni-koeln.de/hmm.

    Google Scholar 

  • KNAB, B., SCHRADER, R., WEBER, I., WEINBRECHT, K., WICHERN, B. (1997): Mesoskopisches Simulationsmodell zur Kollektivfortschreibung. Tech. Rep. ZPR97-295, Mathematisches Institut, Universität zu Köln

    Google Scholar 

  • MACDONALD, I. L., ZUCCHINI, W. (1997): Hidden Markov and other models for discrete-valued time series. Chapman & Hall, London.

    MATH  Google Scholar 

  • MCLACHLAN, G., BASFORD, K. (1988): Mixture Models: Inference and Applications to Clustering. Marcel Dekker, Inc., New York, Basel.

    MATH  Google Scholar 

  • PETRIE, T. (1969): Probabilistic functions of finite state Markov chains. Ann. Math. Statist, 40, 97–115.

    Article  MathSciNet  MATH  Google Scholar 

  • RABINER, L. R. (1989): A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, 77(2), 257–285.

    Article  Google Scholar 

  • SJOLANDER, K., KARPLUS, K., BROWN, M., HUGHEY, R., KROGH, A., MIAN, I. S., HAUSSLER, D. (1996): Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci, 12(4), 327–45.

    Google Scholar 

  • SMYTH, P. (2000): A general probabilistic framework for clustering individuals. Tech. Rep. TR-00-09, University of California, Irvine.

    Google Scholar 

  • WICHERN, B. (2001): Hidden-Markov-Modelle zur Analyse und Simulation von Finanzzeitreihen. Ph.D. Thesis.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Knab, B., Schliep, A., Steckemetz, B., Wichern, B. (2003). Model-Based Clustering With Hidden Markov Models and its Application to Financial Time-Series Data. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_64

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-18991-3_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40354-8

  • Online ISBN: 978-3-642-18991-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics