Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach

There is pressing need for effectively integrating information from an ever increasing number of available sources both on the web and in other existing systems. A key difficulty of achieving this goal comes from the pervasive heterogeneities in all levels of information systems. Existing and emergi...

Full description

Bibliographic Details
Main Authors: Gannon, Thomas, Madnick, Stuart, Moulton, Allen, Siegel, Michael, Sabbouh, Marwan, Zhu, Hongwei
Format: Working Paper
Language:en_US
Published: 2005
Subjects:
Online Access:http://hdl.handle.net/1721.1/18072
_version_ 1811069826447179776
author Gannon, Thomas
Madnick, Stuart
Moulton, Allen
Siegel, Michael
Sabbouh, Marwan
Zhu, Hongwei
author_facet Gannon, Thomas
Madnick, Stuart
Moulton, Allen
Siegel, Michael
Sabbouh, Marwan
Zhu, Hongwei
author_sort Gannon, Thomas
collection MIT
description There is pressing need for effectively integrating information from an ever increasing number of available sources both on the web and in other existing systems. A key difficulty of achieving this goal comes from the pervasive heterogeneities in all levels of information systems. Existing and emerging technologies, such as the Web, ODBC, XML, and Web Services, provide essential capabilities in resolving heterogeneities in the hardware and software platforms, but they do not address the semantic heterogeneity of the data itself. A robust solution to this problem needs to be adaptable, extensible, and scalable. In this paper, we identify the deficiencies of traditional approaches that address this problem using hand-coded programs or require complete data standardization. The COntext INterchange (COIN) approach overcomes these deficiencies by declaratively representing data semantics and using a mediator to create the necessary conversion programs using a small number of conversion rules. The capabilities of COIN is demonstrated using an intelligence information integration example consisting of 150 data sources, where COIN can automatically generate the over 22,000 conversion programs needed to enable semantic integration using only six parametizable conversion rules. This paper makes a unique contribution by providing a systematic evaluation of COIN and other commonly practiced approaches.
first_indexed 2024-09-23T08:17:02Z
format Working Paper
id mit-1721.1/18072
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T08:17:02Z
publishDate 2005
record_format dspace
spelling mit-1721.1/180722019-04-09T17:45:08Z Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach Gannon, Thomas Madnick, Stuart Moulton, Allen Siegel, Michael Sabbouh, Marwan Zhu, Hongwei semantic integration adaptability extensibility scalability context There is pressing need for effectively integrating information from an ever increasing number of available sources both on the web and in other existing systems. A key difficulty of achieving this goal comes from the pervasive heterogeneities in all levels of information systems. Existing and emerging technologies, such as the Web, ODBC, XML, and Web Services, provide essential capabilities in resolving heterogeneities in the hardware and software platforms, but they do not address the semantic heterogeneity of the data itself. A robust solution to this problem needs to be adaptable, extensible, and scalable. In this paper, we identify the deficiencies of traditional approaches that address this problem using hand-coded programs or require complete data standardization. The COntext INterchange (COIN) approach overcomes these deficiencies by declaratively representing data semantics and using a mediator to create the necessary conversion programs using a small number of conversion rules. The capabilities of COIN is demonstrated using an intelligence information integration example consisting of 150 data sources, where COIN can automatically generate the over 22,000 conversion programs needed to enable semantic integration using only six parametizable conversion rules. This paper makes a unique contribution by providing a systematic evaluation of COIN and other commonly practiced approaches. 2005-06-03T16:26:47Z 2005-06-03T16:26:47Z 2005-06-03T16:26:47Z Working Paper http://hdl.handle.net/1721.1/18072 en_US MIT Sloan School of Management Working Paper;4541-05 CISL Working Paper;2005-04 380374 bytes application/pdf application/pdf
spellingShingle semantic integration
adaptability
extensibility
scalability
context
Gannon, Thomas
Madnick, Stuart
Moulton, Allen
Siegel, Michael
Sabbouh, Marwan
Zhu, Hongwei
Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title_full Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title_fullStr Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title_full_unstemmed Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title_short Semantic Information Integration in the Large: Adaptability, Extensibility, and Scalability of the Context Mediation Approach
title_sort semantic information integration in the large adaptability extensibility and scalability of the context mediation approach
topic semantic integration
adaptability
extensibility
scalability
context
url http://hdl.handle.net/1721.1/18072
work_keys_str_mv AT gannonthomas semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach
AT madnickstuart semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach
AT moultonallen semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach
AT siegelmichael semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach
AT sabbouhmarwan semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach
AT zhuhongwei semanticinformationintegrationinthelargeadaptabilityextensibilityandscalabilityofthecontextmediationapproach