Entity comparison in knowledge graphs

<p>Knowledge graphs (KGs) represent information in a simple, interlinked format and are used in a variety of applications. Besides the standard reasoning tasks such as query answering, there has been an increasing need for new types of analysis. One such fundamental analysis task is entity com...

Full description

Bibliographic Details
Main Author:	Petrova, A
Other Authors:	Horrocks, I
Format:	Thesis
Language:	English
Published:	2019
Subjects:	Computational complexity Query languages (Computer science) SPARQL (Computer program language) Semantic Web Graph databases

_version_	1826316379754594304
author	Petrova, A
author2	Horrocks, I
author_facet	Horrocks, I Petrova, A
author_sort	Petrova, A
collection	OXFORD
description	<p>Knowledge graphs (KGs) represent information in a simple, interlinked format and are used in a variety of applications. Besides the standard reasoning tasks such as query answering, there has been an increasing need for new types of analysis. One such fundamental analysis task is entity comparison, i.e., determining what two entities have in common and how they are different. The importance of such a task can be seen in various applications such as product comparisons, and explainable recommender systems. This thesis studies the problem of entity comparison over knowledge graphs and presents a novel framework that models comparisons via similarity and difference queries. It is the first declarative comparison framework for linked data that treats both similarities and differences as first-class citizens.</p> <p>We first define the syntax and semantics of various types of comparison queries, motivating each type with examples and establishing important relations between different query types. Next, we study the complexity of the two fundamental decision problems over queries, namely the existence and verification problems, for basic comparison queries and for most specific similarity queries (MSSQs). We consider two underlying fragments of SPARQL, with and without arithmetic comparisons, and we show how they affect the complexity of the above problems, and which types of data they suit most. We then study practical aspects of computing the MSSQ. In particular, we establish an important uniqueness result for MSSQs and propose an efficient approximation algorithm. We implement an algorithm for computing acyclic similarity queries and evaluate its performance on large-scale KGs; our empirical results demonstrate the practical feasibility of our algorithm.</p>
first_indexed	2024-03-06T21:05:52Z
format	Thesis
id	oxford-uuid:3c6dce6e-04a1-4aee-85d9-ad45ee374fc2
institution	University of Oxford
language	English
last_indexed	2024-12-09T03:44:07Z
publishDate	2019
record_format	dspace
spelling	oxford-uuid:3c6dce6e-04a1-4aee-85d9-ad45ee374fc22024-12-07T15:27:22ZEntity comparison in knowledge graphsThesishttp://purl.org/coar/resource_type/c_db06uuid:3c6dce6e-04a1-4aee-85d9-ad45ee374fc2Computational complexityQuery languages (Computer science)SPARQL (Computer program language)Semantic WebGraph databasesEnglishHyrax Deposit2019Petrova, AHorrocks, ICuenca Grau, BKostylev, E<p>Knowledge graphs (KGs) represent information in a simple, interlinked format and are used in a variety of applications. Besides the standard reasoning tasks such as query answering, there has been an increasing need for new types of analysis. One such fundamental analysis task is entity comparison, i.e., determining what two entities have in common and how they are different. The importance of such a task can be seen in various applications such as product comparisons, and explainable recommender systems. This thesis studies the problem of entity comparison over knowledge graphs and presents a novel framework that models comparisons via similarity and difference queries. It is the first declarative comparison framework for linked data that treats both similarities and differences as first-class citizens.</p> <p>We first define the syntax and semantics of various types of comparison queries, motivating each type with examples and establishing important relations between different query types. Next, we study the complexity of the two fundamental decision problems over queries, namely the existence and verification problems, for basic comparison queries and for most specific similarity queries (MSSQs). We consider two underlying fragments of SPARQL, with and without arithmetic comparisons, and we show how they affect the complexity of the above problems, and which types of data they suit most. We then study practical aspects of computing the MSSQ. In particular, we establish an important uniqueness result for MSSQs and propose an efficient approximation algorithm. We implement an algorithm for computing acyclic similarity queries and evaluate its performance on large-scale KGs; our empirical results demonstrate the practical feasibility of our algorithm.</p>
spellingShingle	Computational complexity Query languages (Computer science) SPARQL (Computer program language) Semantic Web Graph databases Petrova, A Entity comparison in knowledge graphs
title	Entity comparison in knowledge graphs
title_full	Entity comparison in knowledge graphs
title_fullStr	Entity comparison in knowledge graphs
title_full_unstemmed	Entity comparison in knowledge graphs
title_short	Entity comparison in knowledge graphs
title_sort	entity comparison in knowledge graphs
topic	Computational complexity Query languages (Computer science) SPARQL (Computer program language) Semantic Web Graph databases
work_keys_str_mv	AT petrovaa entitycomparisoninknowledgegraphs

Entity comparison in knowledge graphs

Similar Items