Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data
NoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8736364/ |
_version_ | 1818407582099832832 |
---|---|
author | Dongfang Zhang Yong Wang Zhenling Liu Shijie Dai |
author_facet | Dongfang Zhang Yong Wang Zhenling Liu Shijie Dai |
author_sort | Dongfang Zhang |
collection | DOAJ |
description | NoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there are few rules and constraints to be followed for the NoSQL database, the design of storage schema for spatial data based on NoSQL is difficult. In this paper, based on our early work, an improved Z-curve storage schema is proposed for spatial vector data. According to our new schema, row key of a geometric object is the Z-curve code of the spatial grids intersected with the geometric object. Moreover, geometric objects with the same row key are stored in a column family. Our proposed method has two features. First, geometric objects adjacent in the location are adjacent in physical storage. Second, redundant exists in storage for improving query accuracy. In our experiments, we compare the improved Z-curve storage schema with a Quadtree storage schema, an R-tree storage schema, and the previous Z-curve storage schema. Query response time, memory usage, and the query accuracy of spatial query on point and range are used to verify the validity of our proposed method. The experimental results show that the two storage schemas based on Z-curve achieve higher query efficiency than the two storage schemas based on tree-the Quadtree storage schema and the R-tree storage schema. More importantly, the query results of the improved Z-curve schema are completely correct, while the query results of the previous Z-curve schema are not. |
first_indexed | 2024-12-14T09:30:07Z |
format | Article |
id | doaj.art-bcbc71e53f8f42bf94d8bca8380e7363 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-14T09:30:07Z |
publishDate | 2019-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-bcbc71e53f8f42bf94d8bca8380e73632022-12-21T23:08:06ZengIEEEIEEE Access2169-35362019-01-017788177882910.1109/ACCESS.2019.29226938736364Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector DataDongfang Zhang0https://orcid.org/0000-0001-9219-6829Yong Wang1https://orcid.org/0000-0001-6261-8912Zhenling Liu2Shijie Dai3School of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaNoSQL database can provide massive, high concurrency, and scalable services for storing different types of data. HBase, a type of NoSQL database, in which columns are grouped into column families, is very suitable for storing semi-structured or unstructured spatial vector data. However, since there are few rules and constraints to be followed for the NoSQL database, the design of storage schema for spatial data based on NoSQL is difficult. In this paper, based on our early work, an improved Z-curve storage schema is proposed for spatial vector data. According to our new schema, row key of a geometric object is the Z-curve code of the spatial grids intersected with the geometric object. Moreover, geometric objects with the same row key are stored in a column family. Our proposed method has two features. First, geometric objects adjacent in the location are adjacent in physical storage. Second, redundant exists in storage for improving query accuracy. In our experiments, we compare the improved Z-curve storage schema with a Quadtree storage schema, an R-tree storage schema, and the previous Z-curve storage schema. Query response time, memory usage, and the query accuracy of spatial query on point and range are used to verify the validity of our proposed method. The experimental results show that the two storage schemas based on Z-curve achieve higher query efficiency than the two storage schemas based on tree-the Quadtree storage schema and the R-tree storage schema. More importantly, the query results of the improved Z-curve schema are completely correct, while the query results of the previous Z-curve schema are not.https://ieeexplore.ieee.org/document/8736364/Cloud computinggeographic information system (GIS)HBaseNoSQLspatial index |
spellingShingle | Dongfang Zhang Yong Wang Zhenling Liu Shijie Dai Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data IEEE Access Cloud computing geographic information system (GIS) HBase NoSQL spatial index |
title | Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data |
title_full | Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data |
title_fullStr | Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data |
title_full_unstemmed | Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data |
title_short | Improving NoSQL Storage Schema Based on Z-Curve for Spatial Vector Data |
title_sort | improving nosql storage schema based on z curve for spatial vector data |
topic | Cloud computing geographic information system (GIS) HBase NoSQL spatial index |
url | https://ieeexplore.ieee.org/document/8736364/ |
work_keys_str_mv | AT dongfangzhang improvingnosqlstorageschemabasedonzcurveforspatialvectordata AT yongwang improvingnosqlstorageschemabasedonzcurveforspatialvectordata AT zhenlingliu improvingnosqlstorageschemabasedonzcurveforspatialvectordata AT shijiedai improvingnosqlstorageschemabasedonzcurveforspatialvectordata |