Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data

NoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there...

Full description

Bibliographic Details
Main Authors: Dongfang Zhang, Yong Wang, Zhenling Liu, Shijie Dai
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8736364/
_version_ 1818407582099832832
author Dongfang Zhang
Yong Wang
Zhenling Liu
Shijie Dai
author_facet Dongfang Zhang
Yong Wang
Zhenling Liu
Shijie Dai
author_sort Dongfang Zhang
collection DOAJ
description NoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there are few rules and constraints to be followed for the NoSQL database, the design of storage schema for spatial data based on NoSQL is difficult. In this paper, based on our early work, an improved Z-curve storage schema is proposed for spatial vector data. According to our new schema, row key of a geometric object is the Z-curve code of the spatial grids intersected with the geometric object. Moreover, geometric objects with the same row key are stored in a column family. Our proposed method has two features. First, geometric objects adjacent in the location are adjacent in physical storage. Second, redundant exists in storage for improving query accuracy. In our experiments, we compare the improved Z-curve storage schema with a Quadtree storage schema, an R-tree storage schema, and the previous Z-curve storage schema. Query response time, memory usage, and the query accuracy of spatial query on point and range are used to verify the validity of our proposed method. The experimental results show that the two storage schemas based on Z-curve achieve higher query efficiency than the two storage schemas based on tree-the Quadtree storage schema and the R-tree storage schema. More importantly, the query results of the improved Z-curve schema are completely correct, while the query results of the previous Z-curve schema are not.
first_indexed 2024-12-14T09:30:07Z
format Article
id doaj.art-bcbc71e53f8f42bf94d8bca8380e7363
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-14T09:30:07Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-bcbc71e53f8f42bf94d8bca8380e73632022-12-21T23:08:06ZengIEEEIEEE Access2169-35362019-01-017788177882910.1109/ACCESS.2019.29226938736364Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector DataDongfang Zhang0https://orcid.org/0000-0001-9219-6829Yong Wang1https://orcid.org/0000-0001-6261-8912Zhenling Liu2Shijie Dai3School of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaNoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there are few rules and constraints to be followed for the NoSQL database, the design of storage schema for spatial data based on NoSQL is difficult. In this paper, based on our early work, an improved Z-curve storage schema is proposed for spatial vector data. According to our new schema, row key of a geometric object is the Z-curve code of the spatial grids intersected with the geometric object. Moreover, geometric objects with the same row key are stored in a column family. Our proposed method has two features. First, geometric objects adjacent in the location are adjacent in physical storage. Second, redundant exists in storage for improving query accuracy. In our experiments, we compare the improved Z-curve storage schema with a Quadtree storage schema, an R-tree storage schema, and the previous Z-curve storage schema. Query response time, memory usage, and the query accuracy of spatial query on point and range are used to verify the validity of our proposed method. The experimental results show that the two storage schemas based on Z-curve achieve higher query efficiency than the two storage schemas based on tree-the Quadtree storage schema and the R-tree storage schema. More importantly, the query results of the improved Z-curve schema are completely correct, while the query results of the previous Z-curve schema are not.https://ieeexplore.ieee.org/document/8736364/Cloud computinggeographic information system (GIS)HBaseNoSQLspatial index
spellingShingle Dongfang Zhang
Yong Wang
Zhenling Liu
Shijie Dai
Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
IEEE Access
Cloud computing
geographic information system (GIS)
HBase
NoSQL
spatial index
title Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
title_full Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
title_fullStr Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
title_full_unstemmed Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
title_short Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
title_sort improving nosql storage schema based on z curve for spatial vector data
topic Cloud computing
geographic information system (GIS)
HBase
NoSQL
spatial index
url https://ieeexplore.ieee.org/document/8736364/
work_keys_str_mv AT dongfangzhang improvingnosqlstorageschemabasedonzcurveforspatialvectordata
AT yongwang improvingnosqlstorageschemabasedonzcurveforspatialvectordata
AT zhenlingliu improvingnosqlstorageschemabasedonzcurveforspatialvectordata
AT shijiedai improvingnosqlstorageschemabasedonzcurveforspatialvectordata