Journal article
Finding Outliers in Gaussian Model-based Clustering
Abstract
Clustering, or unsupervised classification, is a task often plagued by outliers. Yet there is a paucity of work on handling outliers in clustering. Outlier identification algorithms tend to fall into three broad categories: outlier inclusion, outlier trimming, and post hoc outlier identification methods, with the former two often requiring pre-specification of the number of outliers. The fact that sample squared Mahalanobis distance is …
Authors
Clark KM; McNicholas PD
Journal
Journal of Classification, Vol. 41, No. 2, pp. 313–337
Publisher
Springer Nature
Publication Date
July 2024
DOI
10.1007/s00357-024-09473-3
ISSN
0176-4268