Link blog post to news articles

Detecting events from one or more temporally-ordered stream(s) of documents (e.g. news articles, blog posts) and group these documents based on the events that they describe is one of the goals in Topic Detection and Tracking (TDT). However, most of the existing event detection solutions do not co...

Full description

Bibliographic Details
Main Author: Tan, Wee Beng.
Other Authors: Sun Aixin
Format: Final Year Project (FYP)
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/19306
_version_ 1826118193237721088
author Tan, Wee Beng.
author2 Sun Aixin
author_facet Sun Aixin
Tan, Wee Beng.
author_sort Tan, Wee Beng.
collection NTU
description Detecting events from one or more temporally-ordered stream(s) of documents (e.g. news articles, blog posts) and group these documents based on the events that they describe is one of the goals in Topic Detection and Tracking (TDT). However, most of the existing event detection solutions do not consider users’ input (e.g. search engines, blog posts) and group news articles into events which may not be useful to users. In this project, the author studied the approach of query-guided event detection and tracking from two parallel documents streams (news and blog) based on an ongoing research work. This approach takes users’ input into consideration through popular keyword queries and group queries, news articles and blog posts into events. The main focus in the project is to build an annotated dataset using real-world data collected from Google News and Technorati for evaluating the event detection algorithms. A web application was developed to facilitate the tasks of annotating, searching, analyzing and manipulating the dataset. Various software, tools and APIs were explored to aid in the development of a user friendly and interactive web interface.
first_indexed 2024-10-01T04:39:42Z
format Final Year Project (FYP)
id ntu-10356/19306
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:39:42Z
publishDate 2009
record_format dspace
spelling ntu-10356/193062023-03-03T20:38:35Z Link blog post to news articles Tan, Wee Beng. Sun Aixin School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Detecting events from one or more temporally-ordered stream(s) of documents (e.g. news articles, blog posts) and group these documents based on the events that they describe is one of the goals in Topic Detection and Tracking (TDT). However, most of the existing event detection solutions do not consider users’ input (e.g. search engines, blog posts) and group news articles into events which may not be useful to users. In this project, the author studied the approach of query-guided event detection and tracking from two parallel documents streams (news and blog) based on an ongoing research work. This approach takes users’ input into consideration through popular keyword queries and group queries, news articles and blog posts into events. The main focus in the project is to build an annotated dataset using real-world data collected from Google News and Technorati for evaluating the event detection algorithms. A web application was developed to facilitate the tasks of annotating, searching, analyzing and manipulating the dataset. Various software, tools and APIs were explored to aid in the development of a user friendly and interactive web interface. Bachelor of Engineering (Computer Science) 2009-12-03T08:12:13Z 2009-12-03T08:12:13Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/19306 en Nanyang Technological University 89 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Tan, Wee Beng.
Link blog post to news articles
title Link blog post to news articles
title_full Link blog post to news articles
title_fullStr Link blog post to news articles
title_full_unstemmed Link blog post to news articles
title_short Link blog post to news articles
title_sort link blog post to news articles
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
url http://hdl.handle.net/10356/19306
work_keys_str_mv AT tanweebeng linkblogposttonewsarticles