Home
Scholarly Works
Cross-validation and the estimation of probability...
Journal article

Cross-validation and the estimation of probability distributions with categorical data

Abstract

In this paper, we consider the problem of estimating a joint distribution that is defined over a set of discrete variables. We use a smoothing kernel estimator to estimate the joint distribution. We allow for the case in which some of the discrete variables are uniformly distributed, and explicitly address the vector-valued smoothing parameter case due to its practical relevance. We show that the cross-validated smoothing parameters differ in their asymptotic behavior depending on whether a variable is uniformly distributed or not. We also discuss the mixed discrete and continuous variable case. Simulations show that the proposed estimator performs much better than the commonly used frequency estimator.

Authors

Ouyang D; Li Q; Racine J

Journal

Journal of Nonparametric Statistics, Vol. 18, No. 1, pp. 69–100

Publisher

Taylor & Francis

Publication Date

January 1, 2006

DOI

10.1080/10485250600569002

ISSN

1048-5252

Contact the Experts team