Analytics on Non-Normalized Data Sources: More Learning, Rather Than More Cleaning

Data analysis is increasingly performed over data assembled from uncontrolled sources, facing inconsistency in knowledge-representation conventions. The typical practice is to create “clean” data for analysis, matching entities and merging variants to overcome differences in kn...

Full description

Bibliographic Details
Main Authors: Alexis Cvetkov-Iliev, Alexandre Allauzen, Gael Varoquaux
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9758752/