Collaborative Data Cleaning Framework: a Pilot Case Study for Machine Learning Development

This study experiments with collaborative data cleaning, a pivotal phase in data preparation for both analysis and machine learning. We used a provenance Data Cleaning Model (DCM) for multi-user scenarios to track changes on a dataset and conduct comprehensive experiments that simulate multiple dat...

Full description

Bibliographic Details
Main Authors: Nikolaus Parulian, Bertram Ludäscher
Format: Article
Language:English
Published: University of Edinburgh 2024-12-01
Series:International Journal of Digital Curation
Online Access:https://ijdc.net/index.php/ijdc/article/view/942