Analytics on Non-Normalized Data Sources: More Learning, Rather Than More Cleaning

Data analysis is increasingly performed over data assembled from uncontrolled sources, facing inconsistency in knowledge-representation conventions. The typical practice is to create “clean” data for analysis, matching entities and merging variants to overcome differences in kn...

Celý popis

Podrobná bibliografie
Hlavní autoři: Alexis Cvetkov-Iliev, Alexandre Allauzen, Gael Varoquaux
Médium: Článek
Jazyk:English
Vydáno: IEEE 2022-01-01
Edice:IEEE Access
Témata:
On-line přístup:https://ieeexplore.ieee.org/document/9758752/