Conference
Models for Distributed, Large Scale Data Cleaning
Abstract
Poor data quality is a serious and costly problem affecting organizations across all industries. Real data is often dirty, containing missing, erroneous, incomplete, and duplicate values. Declarative data cleaning techniques have been proposed to resolve some of these underlying errors by identifying the inconsistencies and proposing updates to the data. However, much of this work has focused on cleaning data in static environments. Given the …
Authors
Maccio VJ; Chiang F; Down DG
Series
Lecture Notes in Computer Science
Volume
8643
Pagination
pp. 369-380
Publisher
Springer Nature
Publication Date
2014
DOI
10.1007/978-3-319-13186-3_34
Conference proceedings
Lecture Notes in Computer Science
ISSN
0302-9743