Home
Scholarly Works
Nonparametric estimation of distributions with...
Journal article

Nonparametric estimation of distributions with categorical and continuous data

Abstract

In this paper we consider the problem of estimating an unknown joint distribution which is defined over mixed discrete and continuous variables. A nonparametric kernel approach is proposed with smoothing parameters obtained from the cross-validated minimization of the estimator's integrated squared error. We derive the rate of convergence of the cross-validated smoothing parameters to their ‘benchmark’ optimal values, and we also establish the asymptotic normality of the resulting nonparametric kernel density estimator. Monte Carlo simulations illustrate that the proposed estimator performs substantially better than the conventional nonparametric frequency estimator in a range of settings. The simulations also demonstrate that the proposed approach does not suffer from known limitations of the likelihood cross-validation method which breaks down with commonly used kernels when the continuous variables are drawn from fat-tailed distributions. An empirical application demonstrates that the proposed method can yield superior predictions relative to commonly used parametric models.

Authors

Li Q; Racine J

Journal

Journal of Multivariate Analysis, Vol. 86, No. 2, pp. 266–292

Publisher

Elsevier

Publication Date

January 1, 2003

DOI

10.1016/s0047-259x(02)00025-8

ISSN

0047-259X

Contact the Experts team