Home
Scholarly Works
Model based clustering of high-dimensional binary...
Journal article

Model based clustering of high-dimensional binary data

Abstract

A mixture of latent trait models with common slope parameters for model-based clustering of high-dimensional binary data, a data type for which few established methods exist, is proposed. Recent work on clustering of binary data, based on a d -dimensional Gaussian latent variable, is extended by incorporating common factor analyzers. Accordingly, this approach facilitates a low-dimensional visual representation of the clusters. The model is further extended by the incorporation of random block effects. The dependencies in each block are taken into account through block-specific parameters that are considered to be random variables. A variational approximation to the likelihood is exploited to derive a fast algorithm for determining the model parameters. Real and simulated data are used to demonstrate this approach.

Authors

Tang Y; Browne RP; McNicholas PD

Journal

Computational Statistics & Data Analysis, Vol. 87, , pp. 84–101

Publisher

Elsevier

Publication Date

July 1, 2015

DOI

10.1016/j.csda.2014.12.009

ISSN

0167-9473

Contact the Experts team