Unsupervised spatial urban data representation learning

Urban environments are complex ecosystems comprising diverse spatial entities such as regions, road networks, and points-of-interest (POIs), which fundamentally shape our urban infrastructure and societal interactions. Understanding these spatial entities is crucial for various urban applications, f...

Full description

Bibliographic Details
Main Author: Zhang, Liang
Other Authors: Long Cheng
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/179599
_version_ 1811692557750501376
author Zhang, Liang
author2 Long Cheng
author_facet Long Cheng
Zhang, Liang
author_sort Zhang, Liang
collection NTU
description Urban environments are complex ecosystems comprising diverse spatial entities such as regions, road networks, and points-of-interest (POIs), which fundamentally shape our urban infrastructure and societal interactions. Understanding these spatial entities is crucial for various urban applications, from optimizing transportation networks to informing urban planning and management decisions. In this regard, traditional methods relying on labor-intensive field surveys and questionnaires are gradually transformed by urban computing methods through integrating advanced machine learning techniques. However, these methods often focus on single urban application and require extensive labeled data as supervision, limiting their real-world applicability. Leveraging the wealth of unlabeled urban data available from sources like OpenStreetMap (OSM), this thesis proposes to develop pretraining methods specifically tailored for unlabeled urban data and investigate spatial entity representation learning problem within this context, aiming to address the scarcity of labeled urban data and facilitate the application of learned representations across diverse urban challenges. Specifically, in this thesis, we focus on three essential spatial entities in urban environments, namely region, road network, and points-of-interest (POI). Effective representation learning of urban regions from the widespread multi-view urban data is crucial for various applications. However, existing methods often struggle with extracting representations within each single view and integrating them across multiple views effectively. In response to these challenges, we propose ReMVC (Region Embedding with Multi-View Contrastive Learning), a novel framework for region representation learning. ReMVC employs a hierarchical contrastive learning framework to extract representations within each view and fuse representations across views. Specifically, it emphasizes the comparison of different regions within each view to capture distinctive properties and promotes cross-view correlations by comparing the same region across different views. Experimental results demonstrate the effectiveness of ReMVC in two urban applications. Road network serves as a foundational infrastructure supporting various real-life applications, yet raw road network data cannot be directly utilized by machine learning models designed for these tasks. Accordingly, road network representation learning (RNRL) is studied by learning vector representations of roads. Despite recent successes in adopting graph neural networks (GNNs) for RNRL, these models still face critical challenges in capturing high-order and long-range relationships between roads. To tackle these challenges, we introduce HyperRoad (Hyper}graph-Oriented Road network representation), a novel framework that leverages the power of hypergraphs for road network modeling. HyperRoad addresses the limitations of existing models by performing hypergraph-based dual-channel aggregation to capture high-order relationships among roads, as well as designing a novel hyperedge classification task to capture long-range relationships. Experimental results demonstrate that HyperRoad achieves impressive improvements compared to existing baselines across five applications. The proliferation of location-based services has revolutionized user experiences, with recommendation systems playing a pivotal role in suggesting users Points-of-Interest (POIs). Despite their success, contemporary POI recommendation systems often necessitate extensive user-POI interaction data to learn high-quality POI representations and encounter critical cold start dilemmas. Consequently, learning to generate warm-up representations for these cold start POIs becomes imperative. Existing methods relying on crowdsourcing or adaptation from cold start product recommendation often suffer from low-fidelity sample issues and overlook rich spatial context information associated with POIs. To address these challenges, we propose a novel framework named DiffPOI (Diffusion-Oriented POI representation). DiffPOI treats POI representations as images and employs diffusion models to generate representations conditioned on rich POI semantic and spatial contexts. Experimental results show the superiority of our approach over existing methods, particularly in cold start POI recommendation applications. In summary, this thesis presents several novel methodologies to address urban spatial entity representation learning problems. The proposed solutions have demonstrated improved performance across various scenarios and urban applications, offering valuable tools for analyzing large-scale unlabeled urban data.
first_indexed 2024-10-01T06:37:41Z
format Thesis-Doctor of Philosophy
id ntu-10356/179599
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:37:41Z
publishDate 2024
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1795992024-09-04T07:56:36Z Unsupervised spatial urban data representation learning Zhang, Liang Long Cheng School of Computer Science and Engineering Data Management and Analytics Lab c.long@ntu.edu.sg Computer and Information Science Data mining Representation learning Urban environments are complex ecosystems comprising diverse spatial entities such as regions, road networks, and points-of-interest (POIs), which fundamentally shape our urban infrastructure and societal interactions. Understanding these spatial entities is crucial for various urban applications, from optimizing transportation networks to informing urban planning and management decisions. In this regard, traditional methods relying on labor-intensive field surveys and questionnaires are gradually transformed by urban computing methods through integrating advanced machine learning techniques. However, these methods often focus on single urban application and require extensive labeled data as supervision, limiting their real-world applicability. Leveraging the wealth of unlabeled urban data available from sources like OpenStreetMap (OSM), this thesis proposes to develop pretraining methods specifically tailored for unlabeled urban data and investigate spatial entity representation learning problem within this context, aiming to address the scarcity of labeled urban data and facilitate the application of learned representations across diverse urban challenges. Specifically, in this thesis, we focus on three essential spatial entities in urban environments, namely region, road network, and points-of-interest (POI). Effective representation learning of urban regions from the widespread multi-view urban data is crucial for various applications. However, existing methods often struggle with extracting representations within each single view and integrating them across multiple views effectively. In response to these challenges, we propose ReMVC (Region Embedding with Multi-View Contrastive Learning), a novel framework for region representation learning. ReMVC employs a hierarchical contrastive learning framework to extract representations within each view and fuse representations across views. Specifically, it emphasizes the comparison of different regions within each view to capture distinctive properties and promotes cross-view correlations by comparing the same region across different views. Experimental results demonstrate the effectiveness of ReMVC in two urban applications. Road network serves as a foundational infrastructure supporting various real-life applications, yet raw road network data cannot be directly utilized by machine learning models designed for these tasks. Accordingly, road network representation learning (RNRL) is studied by learning vector representations of roads. Despite recent successes in adopting graph neural networks (GNNs) for RNRL, these models still face critical challenges in capturing high-order and long-range relationships between roads. To tackle these challenges, we introduce HyperRoad (Hyper}graph-Oriented Road network representation), a novel framework that leverages the power of hypergraphs for road network modeling. HyperRoad addresses the limitations of existing models by performing hypergraph-based dual-channel aggregation to capture high-order relationships among roads, as well as designing a novel hyperedge classification task to capture long-range relationships. Experimental results demonstrate that HyperRoad achieves impressive improvements compared to existing baselines across five applications. The proliferation of location-based services has revolutionized user experiences, with recommendation systems playing a pivotal role in suggesting users Points-of-Interest (POIs). Despite their success, contemporary POI recommendation systems often necessitate extensive user-POI interaction data to learn high-quality POI representations and encounter critical cold start dilemmas. Consequently, learning to generate warm-up representations for these cold start POIs becomes imperative. Existing methods relying on crowdsourcing or adaptation from cold start product recommendation often suffer from low-fidelity sample issues and overlook rich spatial context information associated with POIs. To address these challenges, we propose a novel framework named DiffPOI (Diffusion-Oriented POI representation). DiffPOI treats POI representations as images and employs diffusion models to generate representations conditioned on rich POI semantic and spatial contexts. Experimental results show the superiority of our approach over existing methods, particularly in cold start POI recommendation applications. In summary, this thesis presents several novel methodologies to address urban spatial entity representation learning problems. The proposed solutions have demonstrated improved performance across various scenarios and urban applications, offering valuable tools for analyzing large-scale unlabeled urban data. Doctor of Philosophy 2024-08-13T01:13:40Z 2024-08-13T01:13:40Z 2024 Thesis-Doctor of Philosophy Zhang, L. (2024). Unsupervised spatial urban data representation learning. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/179599 https://hdl.handle.net/10356/179599 10.32657/10356/179599 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
spellingShingle Computer and Information Science
Data mining
Representation learning
Zhang, Liang
Unsupervised spatial urban data representation learning
title Unsupervised spatial urban data representation learning
title_full Unsupervised spatial urban data representation learning
title_fullStr Unsupervised spatial urban data representation learning
title_full_unstemmed Unsupervised spatial urban data representation learning
title_short Unsupervised spatial urban data representation learning
title_sort unsupervised spatial urban data representation learning
topic Computer and Information Science
Data mining
Representation learning
url https://hdl.handle.net/10356/179599
work_keys_str_mv AT zhangliang unsupervisedspatialurbandatarepresentationlearning