Analysis of stopping criteria for the EM algorithm in the context of patient grouping according to length of stay

Abbi, Revlin and El-Darzi, Elia and Vasilakis, Christos and Millard, Peter H. (2008) Analysis of stopping criteria for the EM algorithm in the context of patient grouping according to length of stay. In: Proceedings of the 4th International IEEE Conference on Intelligent Systems IS'08. Varna, Bulgaria, September, 6-8 2008. IEEE, Los Alamitos, USA, pp. 9-14. ISBN 9781424417391

[img]
Preview
PDF
Abbi_El_Darzi_Vasilakis_Millard_2008_as_published.pdf

Download (473kB)
Official URL: http://dx.doi.org/10.1109/IS.2008.4670413

Abstract

The expectation maximisation (EM) algorithm is an iterative maximum likelihood procedure often used for estimating the parameters of a mixture model. Theoretically, increases in the likelihood function are guaranteed as the algorithm iteratively improves upon previously derived parameter estimates. The algorithm is considered to converge when all parameter estimates become stable and no further improvements can be made to the likelihood value. However, to reduce computational time, it is often common practice for the algorithm to be stopped before complete convergence using heuristic approaches. In this paper, we consider various stopping criteria and evaluate their effect on fitting Gaussian mixture models (GMMs) to patient length of stay (LOS) data. Although the GMM can be successfully fitted to positively skewed data such as LOS, the fitting procedure often requires many iterations of the EM algorithm. To our knowledge, no previous study has evaluated the effect of different stopping criteria on fitting GMMs to skewed distributions. Hence, the aim of this paper is to evaluate the effect of various stopping criteria in order to select and justify their use within a patient spell classification methodology. Results illustrate that criteria based on the difference in the likelihood value and on the GMM parameters may not always be a good indicator for stopping the algorithm. In fact we show that the values of the difference in the variance parameters should be used instead, as these parameters are the last to stabilise. In addition, we also specify threshold values for the other stopping criteria.

Item Type: Book Section
Subjects: University of Westminster > Science and Technology > Electronics and Computer Science, School of (No longer in use)
Depositing User: Miss Nina Watts
Date Deposited: 19 Dec 2008 15:49
Last Modified: 10 Nov 2010 12:50
URI: http://westminsterresearch.wmin.ac.uk/id/eprint/5640

Actions (login required)

Edit Item (Repository staff only) Edit Item (Repository staff only)

Downloads