DIRECT: A System for Mining Data Value Conversion Rules from Disparate Data Sources

The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reffected by discrepancies in attribute values of the same data ob...

Full description

Bibliographic Details
Main Authors: Fan, Weiguo, Lu, Hongjun, Madnick, Stuart, Cheung, David
Format: Working Paper
Language:en_US
Published: 2003
Subjects:
Online Access:http://hdl.handle.net/1721.1/1825
Description
Summary:The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reffected by discrepancies in attribute values of the same data object. In this paper, we describe a recently developed prototype system, DIRECT (DIscovering and REconciling ConflicTs). The system mines data value conversion rules in the process of integrating business data from multiple sources. The system architecture and functional modules are described. The process of discovering conversion rules from sales data of a trading company is presented as an illustrative example