Pattern functional dependencies for data cleaning

Patterns (or regex-based expressions) are widely used to constrain the format of a domain (or a column), e.g., a Year column should contain only four digits, and thus a value like “1980-“ might be a typo. Moreover, integrity constraints (ICs) defined over multiple columns, such as (conditional) func...

Full description

Bibliographic Details
Main Authors: Qahtan, Abdulhakim, Tang, Nan, Ouzzani, Mourad, Cao, Yang, Stonebraker, Michael
Format: Article
Language:English
Published: VLDB Endowment 2021
Online Access:https://hdl.handle.net/1721.1/133951