Fraction-score : a new support measure for co-location pattern mining

Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure...

Full description

Bibliographic Details
Main Author: Koh, Wei Hao
Other Authors: Cheng Long
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/137975
_version_ 1811696706391113728
author Koh, Wei Hao
author2 Cheng Long
author_facet Cheng Long
Koh, Wei Hao
author_sort Koh, Wei Hao
collection NTU
description Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another. However, these support measures suffer from various weaknesses, some will fail to notice all possible instances while the other support measures will ignore some cases when multiple instances overlap . In this FYP , a new support measure will be proposed which is called the Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to the existing support measures, Fraction Score not only capture all possible instances, but also handles the cases where instances overlap appropriately so that the supports defined are more meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms have been developed which are significantly faster than a baseline that adapts the state of art to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms.
first_indexed 2024-10-01T07:43:37Z
format Final Year Project (FYP)
id ntu-10356/137975
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:43:37Z
publishDate 2020
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1379752020-04-21T00:21:41Z Fraction-score : a new support measure for co-location pattern mining Koh, Wei Hao Cheng Long School of Computer Science and Engineering c.long@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another. However, these support measures suffer from various weaknesses, some will fail to notice all possible instances while the other support measures will ignore some cases when multiple instances overlap . In this FYP , a new support measure will be proposed which is called the Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to the existing support measures, Fraction Score not only capture all possible instances, but also handles the cases where instances overlap appropriately so that the supports defined are more meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms have been developed which are significantly faster than a baseline that adapts the state of art to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms. Bachelor of Engineering (Computer Science) 2020-04-21T00:16:47Z 2020-04-21T00:16:47Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/137975 en SCSE19-0030 application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Koh, Wei Hao
Fraction-score : a new support measure for co-location pattern mining
title Fraction-score : a new support measure for co-location pattern mining
title_full Fraction-score : a new support measure for co-location pattern mining
title_fullStr Fraction-score : a new support measure for co-location pattern mining
title_full_unstemmed Fraction-score : a new support measure for co-location pattern mining
title_short Fraction-score : a new support measure for co-location pattern mining
title_sort fraction score a new support measure for co location pattern mining
topic Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
url https://hdl.handle.net/10356/137975
work_keys_str_mv AT kohweihao fractionscoreanewsupportmeasureforcolocationpatternmining