Event detection based on on-line news clustering
In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional al...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/78629 |
_version_ | 1826124600157667328 |
---|---|
author | Zhang, Tiannuo |
author2 | Mao Kezhi |
author_facet | Mao Kezhi Zhang, Tiannuo |
author_sort | Zhang, Tiannuo |
collection | NTU |
description | In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional algorithm, its real-time and dynamic natures are guaranteed, and the improved algorithm solves the problem that the original algorithm is greatly affected by the input sequence. In addition, the new algorithm also improves the accuracy of topic detection. The improved Single-pass algorithm processes the text data by groups and calculates the similarity by average-link instead of maximum value. The experiment part verified that the improved Single-pass algorithm has great performance on Event Detection, with high accuracy and efficiency. |
first_indexed | 2024-10-01T06:23:20Z |
format | Thesis |
id | ntu-10356/78629 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T06:23:20Z |
publishDate | 2019 |
record_format | dspace |
spelling | ntu-10356/786292023-07-04T16:22:53Z Event detection based on on-line news clustering Zhang, Tiannuo Mao Kezhi School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering In this dissertation, we develop and implementation a news event detection system by using an improved Single-pass incremental clustering algorithm. The objective of our work is to judge whether a current document is talking about the same event as the previous documents. Based on the traditional algorithm, its real-time and dynamic natures are guaranteed, and the improved algorithm solves the problem that the original algorithm is greatly affected by the input sequence. In addition, the new algorithm also improves the accuracy of topic detection. The improved Single-pass algorithm processes the text data by groups and calculates the similarity by average-link instead of maximum value. The experiment part verified that the improved Single-pass algorithm has great performance on Event Detection, with high accuracy and efficiency. Master of Science (Computer Control and Automation) 2019-06-25T00:53:13Z 2019-06-25T00:53:13Z 2019 Thesis http://hdl.handle.net/10356/78629 en Nanyang Technological University 63 p. application/pdf |
spellingShingle | DRNTU::Engineering::Electrical and electronic engineering Zhang, Tiannuo Event detection based on on-line news clustering |
title | Event detection based on on-line news clustering |
title_full | Event detection based on on-line news clustering |
title_fullStr | Event detection based on on-line news clustering |
title_full_unstemmed | Event detection based on on-line news clustering |
title_short | Event detection based on on-line news clustering |
title_sort | event detection based on on line news clustering |
topic | DRNTU::Engineering::Electrical and electronic engineering |
url | http://hdl.handle.net/10356/78629 |
work_keys_str_mv | AT zhangtiannuo eventdetectionbasedononlinenewsclustering |