The fast and the numerous - Combining machine and community intelligence for semantic annotation

Starting from the observation that certain communities have incentive mechanisms in place to create large amounts of unstructured content, we propose in this paper an original model which we expect to lead to the large number of annotations required to semantically enrich Web content at a large scal...

Full description

Bibliographic Details
Main Authors: Blohm, S, Krötzsch, M, Cimiano, P
Format: Journal article
Language:English
Published: 2008
_version_ 1797061382717308928
author Blohm, S
Krötzsch, M
Cimiano, P
author_facet Blohm, S
Krötzsch, M
Cimiano, P
author_sort Blohm, S
collection OXFORD
description Starting from the observation that certain communities have incentive mechanisms in place to create large amounts of unstructured content, we propose in this paper an original model which we expect to lead to the large number of annotations required to semantically enrich Web content at a large scale. The novelty of our model lies in the combination of two key ingredients: the effort that online communities are making to create content and the capability of machines to detect regular patterns in user annotation to suggest new annotations. Provided that the creation of semantic content is made easy enough and incentives are in place, we can assume that these communities will be willing to provide annotations. However, as human resources are clearly limited, we aim at integrating algorithmic support into our model to bootstrap on existing annotations and learn patterns to be used for suggesting new annotations. As the automatically extracted information needs to be validated, our model presents the extracted knowledge to the user in the form of questions, thus allowing for the validation of the information. In this paper, we describe the requirements on our model, its concrete implementation based on Semantic MediaWiki and an information extraction system and discuss lessons learned from practical experience with real users. These experiences allow us to conclude that our model is a promising approach towards leveraging semantic annotation. Copyright © 2008.
first_indexed 2024-03-06T20:30:19Z
format Journal article
id oxford-uuid:30d20422-21a1-4cda-88a6-9915597384e4
institution University of Oxford
language English
last_indexed 2024-03-06T20:30:19Z
publishDate 2008
record_format dspace
spelling oxford-uuid:30d20422-21a1-4cda-88a6-9915597384e42022-03-26T13:03:57ZThe fast and the numerous - Combining machine and community intelligence for semantic annotationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:30d20422-21a1-4cda-88a6-9915597384e4EnglishSymplectic Elements at Oxford2008Blohm, SKrötzsch, MCimiano, PStarting from the observation that certain communities have incentive mechanisms in place to create large amounts of unstructured content, we propose in this paper an original model which we expect to lead to the large number of annotations required to semantically enrich Web content at a large scale. The novelty of our model lies in the combination of two key ingredients: the effort that online communities are making to create content and the capability of machines to detect regular patterns in user annotation to suggest new annotations. Provided that the creation of semantic content is made easy enough and incentives are in place, we can assume that these communities will be willing to provide annotations. However, as human resources are clearly limited, we aim at integrating algorithmic support into our model to bootstrap on existing annotations and learn patterns to be used for suggesting new annotations. As the automatically extracted information needs to be validated, our model presents the extracted knowledge to the user in the form of questions, thus allowing for the validation of the information. In this paper, we describe the requirements on our model, its concrete implementation based on Semantic MediaWiki and an information extraction system and discuss lessons learned from practical experience with real users. These experiences allow us to conclude that our model is a promising approach towards leveraging semantic annotation. Copyright © 2008.
spellingShingle Blohm, S
Krötzsch, M
Cimiano, P
The fast and the numerous - Combining machine and community intelligence for semantic annotation
title The fast and the numerous - Combining machine and community intelligence for semantic annotation
title_full The fast and the numerous - Combining machine and community intelligence for semantic annotation
title_fullStr The fast and the numerous - Combining machine and community intelligence for semantic annotation
title_full_unstemmed The fast and the numerous - Combining machine and community intelligence for semantic annotation
title_short The fast and the numerous - Combining machine and community intelligence for semantic annotation
title_sort fast and the numerous combining machine and community intelligence for semantic annotation
work_keys_str_mv AT blohms thefastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation
AT krotzschm thefastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation
AT cimianop thefastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation
AT blohms fastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation
AT krotzschm fastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation
AT cimianop fastandthenumerouscombiningmachineandcommunityintelligenceforsemanticannotation