selected scholarly activity
-
chapters
- Discovery of Keys for Graphs. Lecture Notes in Computer Science. 202-208. 2022
-
conferences
- Will my Flight be on Time? Learning from Part Failures to Predict Future Reliability. 2023 IEEE International Conference on Big Data (BigData). 1803-1813. 2023
- Lithium-ion Battery State-of-Health Estimation via Histogram Data, Principal Component Analysis, and Machine Learning. 2023 IEEE Transportation Electrification Conference & Expo (ITEC). 2023
- Inconsistency Detection with Temporal Graph Functional Dependencies. Proceedings - International Conference on Data Engineering. 464-476. 2023
- Efficient Action Recognition Using Confidence Distillation. Proceedings - International Conference on Pattern Recognition. 3362-3369. 2022
- Confidence Bounded Replica Currency Estimation. Proceedings of the ACM SIGMOD International Conference on Management of Data. 730-743. 2022
- Discovery of Temporal Graph Functional Dependencies. Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 3348-3352. 2021
- Preserving diversity in anonymized data. Advances in Database Technology - EDBT. 511-516. 2021
- Quantifying duplication to improve data quality. Proceedings of the 27th Annual International Conference on Computer Science and Software Engineering, CASCON 2017. 272-278. 2020
- CurrentClean. Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2917-2920. 2019
- CurrentClean: Spatio-Temporal Cleaning of Stale Data. Proceedings - International Conference on Data Engineering. 172-183. 2019
- Restoring Consistency in Ontological Multidimensional Data Models via Weighted Repairs. Procedia Computer Science. 1085-1094. 2019
- PACAS: Privacy-Aware, Data Cleaning-as-a-Service. 2018 IEEE International Conference on Big Data (Big Data). 1023-1030. 2018
- Contextual Data Cleaning. 2018 IEEE 34th International Conference on Data Engineering Workshops (ICDEW). 21-24. 2018
- FastofD: Contextual data cleaning with ontology functional dependencies. Advances in Database Technology - EDBT. 694-697. 2018
- Privacy aware web services in the cloud. 2017 IEEE Conference on Communications and Network Security (CNS). 458-466. 2017
- Efficient Discovery of Ontology Functional Dependencies.. CIKM. 1847-1856. 2017
- Refining Duplicate Detection for Improved Data Quality.. TDDL/MDQual/Futurity@TPDL. 2017
- PARC. Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 2433-2436. 2016
- A Data Quality Framework for Customer Relationship Analytics. Lecture Notes in Computer Science. 366-378. 2015
- Towards a Unified Framework for Data Cleaning and Data Privacy. Lecture Notes in Computer Science. 359-365. 2015
- CONDOR. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 2087-2089. 2014
- Continuous data cleaning. Proceedings - International Conference on Data Engineering. 244-255. 2014
- Models for Distributed, Large Scale Data Cleaning. Lecture Notes in Computer Science. 369-380. 2014
- An Algebraic Approach Towards Data Cleaning. Procedia Computer Science. 50-59. 2013
- AutoDict: Automated Dictionary Discovery. Proceedings - International Conference on Data Engineering. 1277-1280. 2012
- Automated dictionary discovery for the online marketplace. Proceedings of the 2012 iConference. 421-422. 2012
- Active repair of data quality rules. ICIQ 2011 - Proceedings of the 16th International Conference on Information Quality. 174-188. 2011
- A unified model for data and constraint repair. Proceedings - International Conference on Data Engineering. 446-457. 2011
- An xml index advisor for DB2. Proceedings of the ACM SIGMOD International Conference on Management of Data. 1267-1270. 2008
- XML Index Recommendation with Tight Optimizer Coupling. Proceedings - International Conference on Data Engineering. 833-+. 2008
- Seeking stable clusters in the blogosphere. 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings. 806-817. 2007
-
journal articles
- Mining Keys for Graphs. Data & Knowledge Engineering. 150:102274-102274. 2024
- Data Anonymization With Diversity Constraints. IEEE Transactions on Knowledge and Data Engineering. 35:3603-3618. 2023
- Contextual Data Cleaning with Ontology Functional Dependencies. Journal of Data and Information Quality. 14:1-26. 2022
- From demand forecasting to inventory ordering decisions for red blood cells through integrating machine learning, statistical modeling, and inventory optimization. Transfusion. 62:87-99. 2022
- A decision integration strategy for short-term demand forecasting and ordering for red blood cell components. Operations Research for Health Care. 29:100290-100290. 2021
- Privacy-aware data cleaning-as-a-service. Information Systems. 94:101608-101608. 2020
- Ontology-based entity matching in attributed graphs. Proceedings of the VLDB Endowment. 12:1195-1207. 2019
- InfoClean. Journal of Data and Information Quality. 9:1-26. 2017
- Efficient Discovery of Ontology Functional Dependencies. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. Part F131841:1847-1856. 2017
- Unifying Data and Constraint Repairs. Journal of Data and Information Quality. 7:1-26. 2016
- Data Driven Discovery of Attribute Dictionaries. Lecture Notes in Computer Science. 9630:69-96. 2016
- Combining quantitative and logical data cleaning. Proceedings of the VLDB Endowment. 9:300-311. 2015
- Repairing integrity rules for improved data quality. International Journal of Information Quality. 3:273-273. 2014
- Framework for evaluating clustering algorithms in duplicate detection. Proceedings of the VLDB Endowment. 2:1282-1293. 2009
- Discovering data quality rules. Proceedings of the VLDB Endowment. 1:1166-1177. 2008
-
preprints
- Discovery of Keys for Graphs [Extended Version] 2022
- Efficient Action Recognition Using Confidence Distillation 2021
- Temporal Graph Functional Dependencies [Extended Version] 2021
- Discovery and Contextual Data Cleaning with Ontology Functional Dependencies 2021
- A decision integration strategy for short-term demand forecasting and ordering for red blood cell components 2020
- Privacy-Aware Data Cleaning-as-a-Service (Extended Version) 2020
- Efficient Discovery of Ontology Functional Dependencies 2016