Hashtag and highest scored terms for expanding query

Communicating in short messages, such as using micro blogs, was becoming more popular currently. Twitter https://twitter.com supports micro blogs and retrieval of the blogs by users.To retrieve Twitter documents, we need specific strategies due to its specific characteristics.One new strategy for im...

Full description

Bibliographic Details
Main Authors: Wibowo, Wahyu Catur, Widodo, Widodo
Format: Article
Language:English
Published: Universiti Utara Malaysia 2017
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/24057/1/JICT%2016%20%201%202017%20121-135.pdf
Description
Summary:Communicating in short messages, such as using micro blogs, was becoming more popular currently. Twitter https://twitter.com supports micro blogs and retrieval of the blogs by users.To retrieve Twitter documents, we need specific strategies due to its specific characteristics.One new strategy for improving the effectiveness of twitter document retrieval is using the query expansion technique.This paper elaborates query expansion in twitter document retrieval by using the hashtag. We compared the effectiveness of query expansion in four different scenarios: the baseline result using no query expansion, highest scored term in terms of frequency-inverse document frequency (tfidf), maximum hashtag occurance, and combination of the highest scored-term and the maximum hashtag.The results show that the combination of the maximum term in tfidf and the maximum hashtag performs better in retrieving relevant documents than the baseline.