Benchmarking spatial-vector queries

The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, dat...

Full description

Bibliographic Details
Main Author: Wong, Scott Wen Jie
Other Authors: Gao Cong
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181533
_version_ 1826122514482331648
author Wong, Scott Wen Jie
author2 Gao Cong
author_facet Gao Cong
Wong, Scott Wen Jie
author_sort Wong, Scott Wen Jie
collection NTU
description The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements.
first_indexed 2025-03-09T13:15:40Z
format Final Year Project (FYP)
id ntu-10356/181533
institution Nanyang Technological University
language English
last_indexed 2025-03-09T13:15:40Z
publishDate 2024
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1815332024-12-09T01:07:42Z Benchmarking spatial-vector queries Wong, Scott Wen Jie Gao Cong College of Computing and Data Science gaocong@ntu.edu.sg Computer and Information Science The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements. Bachelor's degree 2024-12-09T01:07:42Z 2024-12-09T01:07:42Z 2024 Final Year Project (FYP) Wong, S. W. J. (2024). Benchmarking spatial-vector queries. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181533 https://hdl.handle.net/10356/181533 en SCSE23-1120 application/pdf Nanyang Technological University
spellingShingle Computer and Information Science
Wong, Scott Wen Jie
Benchmarking spatial-vector queries
title Benchmarking spatial-vector queries
title_full Benchmarking spatial-vector queries
title_fullStr Benchmarking spatial-vector queries
title_full_unstemmed Benchmarking spatial-vector queries
title_short Benchmarking spatial-vector queries
title_sort benchmarking spatial vector queries
topic Computer and Information Science
url https://hdl.handle.net/10356/181533
work_keys_str_mv AT wongscottwenjie benchmarkingspatialvectorqueries