A general-purpose framework for parallel processing of large-scale LiDAR data

Light detection and ranging (LiDAR) data are essential for scientific discoveries such as Earth and ecological sciences, environmental applications, and responding to natural disasters. While collecting LiDAR data over large areas is quite possible the subsequent processing steps typically involve l...

Full description

Bibliographic Details
Main Authors: Zhenlong Li, Michael E. Hodgson, Wenwen Li
Format: Article
Language:English
Published: Taylor & Francis Group 2018-01-01
Series:International Journal of Digital Earth
Subjects:
Online Access:http://dx.doi.org/10.1080/17538947.2016.1269842
_version_ 1797678652651995136
author Zhenlong Li
Michael E. Hodgson
Wenwen Li
author_facet Zhenlong Li
Michael E. Hodgson
Wenwen Li
author_sort Zhenlong Li
collection DOAJ
description Light detection and ranging (LiDAR) data are essential for scientific discoveries such as Earth and ecological sciences, environmental applications, and responding to natural disasters. While collecting LiDAR data over large areas is quite possible the subsequent processing steps typically involve large computational demands. Efficiently storing, managing, and processing LiDAR data are the prerequisite steps for enabling these LiDAR-based applications. However, handling LiDAR data poses grand geoprocessing challenges due to data and computational intensity. To tackle such challenges, we developed a general-purpose scalable framework coupled with a sophisticated data decomposition and parallelization strategy to efficiently handle ‘big’ LiDAR data collections. The contributions of this research were (1) a tile-based spatial index to manage big LiDAR data in the scalable and fault-tolerable Hadoop distributed file system, (2) two spatial decomposition techniques to enable efficient parallelization of different types of LiDAR processing tasks, and (3) by coupling existing LiDAR processing tools with Hadoop, a variety of LiDAR data processing tasks can be conducted in parallel in a highly scalable distributed computing environment using an online geoprocessing application. A proof-of-concept prototype is presented here to demonstrate the feasibility, performance, and scalability of the proposed framework.
first_indexed 2024-03-11T23:03:03Z
format Article
id doaj.art-d97f4b48bf81455bbd2ec0646ce31c0d
institution Directory Open Access Journal
issn 1753-8947
1753-8955
language English
last_indexed 2024-03-11T23:03:03Z
publishDate 2018-01-01
publisher Taylor & Francis Group
record_format Article
series International Journal of Digital Earth
spelling doaj.art-d97f4b48bf81455bbd2ec0646ce31c0d2023-09-21T14:38:05ZengTaylor & Francis GroupInternational Journal of Digital Earth1753-89471753-89552018-01-01111264710.1080/17538947.2016.12698421269842A general-purpose framework for parallel processing of large-scale LiDAR dataZhenlong Li0Michael E. Hodgson1Wenwen Li2University of South CarolinaUniversity of South CarolinaArizona State UniversityLight detection and ranging (LiDAR) data are essential for scientific discoveries such as Earth and ecological sciences, environmental applications, and responding to natural disasters. While collecting LiDAR data over large areas is quite possible the subsequent processing steps typically involve large computational demands. Efficiently storing, managing, and processing LiDAR data are the prerequisite steps for enabling these LiDAR-based applications. However, handling LiDAR data poses grand geoprocessing challenges due to data and computational intensity. To tackle such challenges, we developed a general-purpose scalable framework coupled with a sophisticated data decomposition and parallelization strategy to efficiently handle ‘big’ LiDAR data collections. The contributions of this research were (1) a tile-based spatial index to manage big LiDAR data in the scalable and fault-tolerable Hadoop distributed file system, (2) two spatial decomposition techniques to enable efficient parallelization of different types of LiDAR processing tasks, and (3) by coupling existing LiDAR processing tools with Hadoop, a variety of LiDAR data processing tasks can be conducted in parallel in a highly scalable distributed computing environment using an online geoprocessing application. A proof-of-concept prototype is presented here to demonstrate the feasibility, performance, and scalability of the proposed framework.http://dx.doi.org/10.1080/17538947.2016.1269842big dataonline geoprocessinghadoop mapreducespatial decompositionlastoolsparallel
spellingShingle Zhenlong Li
Michael E. Hodgson
Wenwen Li
A general-purpose framework for parallel processing of large-scale LiDAR data
International Journal of Digital Earth
big data
online geoprocessing
hadoop mapreduce
spatial decomposition
lastools
parallel
title A general-purpose framework for parallel processing of large-scale LiDAR data
title_full A general-purpose framework for parallel processing of large-scale LiDAR data
title_fullStr A general-purpose framework for parallel processing of large-scale LiDAR data
title_full_unstemmed A general-purpose framework for parallel processing of large-scale LiDAR data
title_short A general-purpose framework for parallel processing of large-scale LiDAR data
title_sort general purpose framework for parallel processing of large scale lidar data
topic big data
online geoprocessing
hadoop mapreduce
spatial decomposition
lastools
parallel
url http://dx.doi.org/10.1080/17538947.2016.1269842
work_keys_str_mv AT zhenlongli ageneralpurposeframeworkforparallelprocessingoflargescalelidardata
AT michaelehodgson ageneralpurposeframeworkforparallelprocessingoflargescalelidardata
AT wenwenli ageneralpurposeframeworkforparallelprocessingoflargescalelidardata
AT zhenlongli generalpurposeframeworkforparallelprocessingoflargescalelidardata
AT michaelehodgson generalpurposeframeworkforparallelprocessingoflargescalelidardata
AT wenwenli generalpurposeframeworkforparallelprocessingoflargescalelidardata