Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval
Abstract The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, the semi-automatic creation of a lexical database in Brazilian Portuguese containing morphological, syntactic, and semantic information that can be read by machines was...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Pontificia Universidade Católica de Campinas
|
Series: | Transinformação |
Subjects: | |
Online Access: | http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0103-37862017000100057&lng=en&tlng=en |
_version_ | 1818450590539186176 |
---|---|
author | Marcelo SCHIESSL Marisa BRÄSCHER |
author_facet | Marcelo SCHIESSL Marisa BRÄSCHER |
author_sort | Marcelo SCHIESSL |
collection | DOAJ |
description | Abstract The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, the semi-automatic creation of a lexical database in Brazilian Portuguese containing morphological, syntactic, and semantic information that can be read by machines was proposed, allowing the link between structured and unstructured data and its integration into an information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in the risco financeiro (financial risk) domain in Portuguese for the construction of an ontology and the lexical-semantic database and the proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documents containing the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. To enable the comparison between the approaches, two databases were created based on the texts with the semantic annotations to represent the semantic search. The first one represents the traditional search and the second contained the index built based on the texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall and precision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validates the methodology used. Although more complex, the procedure proposed can be used in all kinds of domains. |
first_indexed | 2024-12-14T20:53:43Z |
format | Article |
id | doaj.art-1b11e27e36ae4f3eb4928b847f8268cf |
institution | Directory Open Access Journal |
issn | 2318-0889 |
language | English |
last_indexed | 2024-12-14T20:53:43Z |
publisher | Pontificia Universidade Católica de Campinas |
record_format | Article |
series | Transinformação |
spelling | doaj.art-1b11e27e36ae4f3eb4928b847f8268cf2022-12-21T22:47:44ZengPontificia Universidade Católica de CampinasTransinformação2318-0889291577210.1590/2318-08892017000100006S0103-37862017000100057Ontology lexicalization: Relationship between content and meaning in the context of Information RetrievalMarcelo SCHIESSLMarisa BRÄSCHERAbstract The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, the semi-automatic creation of a lexical database in Brazilian Portuguese containing morphological, syntactic, and semantic information that can be read by machines was proposed, allowing the link between structured and unstructured data and its integration into an information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in the risco financeiro (financial risk) domain in Portuguese for the construction of an ontology and the lexical-semantic database and the proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documents containing the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. To enable the comparison between the approaches, two databases were created based on the texts with the semantic annotations to represent the semantic search. The first one represents the traditional search and the second contained the index built based on the texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall and precision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validates the methodology used. Although more complex, the procedure proposed can be used in all kinds of domains.http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0103-37862017000100057&lng=en&tlng=enInformation ScienceOntologyInformation retrievalRepresentation of informationSemantic Web. |
spellingShingle | Marcelo SCHIESSL Marisa BRÄSCHER Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval Transinformação Information Science Ontology Information retrieval Representation of information Semantic Web. |
title | Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval |
title_full | Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval |
title_fullStr | Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval |
title_full_unstemmed | Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval |
title_short | Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval |
title_sort | ontology lexicalization relationship between content and meaning in the context of information retrieval |
topic | Information Science Ontology Information retrieval Representation of information Semantic Web. |
url | http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0103-37862017000100057&lng=en&tlng=en |
work_keys_str_mv | AT marceloschiessl ontologylexicalizationrelationshipbetweencontentandmeaninginthecontextofinformationretrieval AT marisabrascher ontologylexicalizationrelationshipbetweencontentandmeaninginthecontextofinformationretrieval |