Home
Scholarly Works
Mixtures of generalized hyperbolic distributions...
Journal article

Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data

Abstract

Robust clustering from incomplete data is an important topic because, in many practical situations, real datasets are heavy-tailed, asymmetric, and/or have arbitrary patterns of missing observations. Flexible methods and algorithms for model-based clustering are presented via mixture of the generalized hyperbolic distributions and its limiting case, the mixture of multivariate skew-t distributions. An analytically feasible EM algorithm is formulated for parameter estimation and imputation of missing values for mixture models employing missing at random mechanisms. The proposed methodologies are investigated through a simulation study with varying proportions of synthetic missing values and illustrated using a real dataset. Comparisons are made with those obtained from the traditional mixture of generalized hyperbolic distribution counterparts by filling in the missing data using the mean imputation method.

Authors

Wei Y; Tang Y; McNicholas PD

Journal

Computational Statistics & Data Analysis, Vol. 130, , pp. 18–41

Publisher

Elsevier

Publication Date

February 1, 2019

DOI

10.1016/j.csda.2018.08.016

ISSN

0167-9473

Contact the Experts team