Home
Scholarly Works
Towards a Unified Framework for Data Cleaning and...
Conference

Towards a Unified Framework for Data Cleaning and Data Privacy

Abstract

Data quality has become a pervasive challenge for organizations as they wrangle with large, heterogeneous datasets to extract value. Existing data cleaning solutions have focused on scalable techniques to resolve inconsistencies quickly. However, given the proliferation of sensitive, confidential user information, data privacy concerns have largely remained unexplored in data cleaning techniques. In this work, we present a new privacy-aware, data cleaning framework that aims to resolve data inconsistencies while minimizing the amount of information disclosed. We present a set of data disclosure operations that facilitate the data cleaning process, and propose two information-theoretic measures for privacy loss and data utility that are used to correct inconsistencies in the data.

Authors

Huang Y; Chiang F

Series

Lecture Notes in Computer Science

Volume

9419

Pagination

pp. 359-365

Publisher

Springer Nature

Publication Date

January 1, 2015

DOI

10.1007/978-3-319-26187-4_34

Conference proceedings

Lecture Notes in Computer Science

ISSN

0302-9743
View published work (Non-McMaster Users)

Contact the Experts team