Event detection based on on-line news clustering

In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional al...

Full description

Bibliographic Details
Main Author: Zhang, Tiannuo
Other Authors: Mao Kezhi
Format: Thesis
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/78629
_version_ 1826124600157667328
author Zhang, Tiannuo
author2 Mao Kezhi
author_facet Mao Kezhi
Zhang, Tiannuo
author_sort Zhang, Tiannuo
collection NTU
description In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional algorithm, its real-time and dynamic natures are guaranteed, and the improved algorithm solves the problem that the original algorithm is greatly affected by the input sequence. In addition, the new algorithm also improves the accuracy of topic detection. The improved Single-pass algorithm processes the text data by groups and calculates the similarity by average-link instead of maximum value. The experiment part verified that the improved Single-pass algorithm has great performance on Event Detection, with high accuracy and efficiency.
first_indexed 2024-10-01T06:23:20Z
format Thesis
id ntu-10356/78629
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:23:20Z
publishDate 2019
record_format dspace
spelling ntu-10356/786292023-07-04T16:22:53Z Event detection based on on-line news clustering Zhang, Tiannuo Mao Kezhi School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional algorithm, its real-time and dynamic natures are guaranteed, and the improved algorithm solves the problem that the original algorithm is greatly affected by the input sequence. In addition, the new algorithm also improves the accuracy of topic detection. The improved Single-pass algorithm processes the text data by groups and calculates the similarity by average-link instead of maximum value. The experiment part verified that the improved Single-pass algorithm has great performance on Event Detection, with high accuracy and efficiency. Master of Science (Computer Control and Automation) 2019-06-25T00:53:13Z 2019-06-25T00:53:13Z 2019 Thesis http://hdl.handle.net/10356/78629 en Nanyang Technological University 63 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Zhang, Tiannuo
Event detection based on on-line news clustering
title Event detection based on on-line news clustering
title_full Event detection based on on-line news clustering
title_fullStr Event detection based on on-line news clustering
title_full_unstemmed Event detection based on on-line news clustering
title_short Event detection based on on-line news clustering
title_sort event detection based on on line news clustering
topic DRNTU::Engineering::Electrical and electronic engineering
url http://hdl.handle.net/10356/78629
work_keys_str_mv AT zhangtiannuo eventdetectionbasedononlinenewsclustering