Home
Scholarly Works
Flexible Clustering with a Sparse Mixture of...
Journal article

Flexible Clustering with a Sparse Mixture of Generalized Hyperbolic Distributions

Abstract

Robust clustering of high-dimensional data is an important topic because clusters in real datasets are often heavy-tailed and/or asymmetric. Traditional approaches to model-based clustering often fail for high dimensional data, e.g., due to the number of free covariance parameters. A parametrization of the component scale matrices for the mixture of generalized hyperbolic distributions is proposed. This parameterization includes a penalty term in the likelihood. An analytically feasible expectation-maximization algorithm is developed by placing a gamma-lasso penalty constraining the concentration matrix. The proposed methodology is investigated through simulation studies and illustrated using two real datasets.

Authors

Sochaniwsky AA; Gallaugher MPB; Tang Y; McNicholas PD

Journal

Journal of Classification, Vol. 42, No. 1, pp. 113–133

Publisher

Springer Nature

Publication Date

March 1, 2025

DOI

10.1007/s00357-024-09479-x

ISSN

0176-4268

Contact the Experts team