Hashtag and highest scored terms for expanding query
Communicating in short messages, such as using micro blogs, was becoming more popular currently. Twitter https://twitter.com supports micro blogs and retrieval of the blogs by users.To retrieve Twitter documents, we need specific strategies due to its specific characteristics.One new strategy for im...
Main Authors: | , |
---|---|
Formato: | Artigo |
Idioma: | English |
Publicado em: |
Universiti Utara Malaysia
2017
|
Assuntos: | |
Acesso em linha: | https://repo.uum.edu.my/id/eprint/24057/1/JICT%2016%20%201%202017%20121-135.pdf |
_version_ | 1825805038783561728 |
---|---|
author | Wibowo, Wahyu Catur Widodo, Widodo |
author_facet | Wibowo, Wahyu Catur Widodo, Widodo |
author_sort | Wibowo, Wahyu Catur |
collection | UUM |
description | Communicating in short messages, such as using micro blogs, was becoming more popular currently. Twitter https://twitter.com supports micro blogs and retrieval of the blogs by users.To retrieve Twitter documents, we need specific strategies due to its specific characteristics.One new strategy for improving the effectiveness of twitter document retrieval is using the query expansion technique.This paper elaborates query expansion in twitter document retrieval by using the hashtag. We compared the effectiveness of query expansion in four different scenarios: the baseline result using no query expansion, highest scored term in terms of frequency-inverse document frequency (tfidf), maximum hashtag occurance, and combination of the highest scored-term and the maximum hashtag.The results show that the combination of the maximum term in tfidf and the maximum hashtag performs better in retrieving relevant documents than the baseline. |
first_indexed | 2024-07-04T06:25:22Z |
format | Article |
id | uum-24057 |
institution | Universiti Utara Malaysia |
language | English |
last_indexed | 2024-07-04T06:25:22Z |
publishDate | 2017 |
publisher | Universiti Utara Malaysia |
record_format | eprints |
spelling | uum-240572018-04-29T01:43:01Z https://repo.uum.edu.my/id/eprint/24057/ Hashtag and highest scored terms for expanding query Wibowo, Wahyu Catur Widodo, Widodo QA75 Electronic computers. Computer science Communicating in short messages, such as using micro blogs, was becoming more popular currently. Twitter https://twitter.com supports micro blogs and retrieval of the blogs by users.To retrieve Twitter documents, we need specific strategies due to its specific characteristics.One new strategy for improving the effectiveness of twitter document retrieval is using the query expansion technique.This paper elaborates query expansion in twitter document retrieval by using the hashtag. We compared the effectiveness of query expansion in four different scenarios: the baseline result using no query expansion, highest scored term in terms of frequency-inverse document frequency (tfidf), maximum hashtag occurance, and combination of the highest scored-term and the maximum hashtag.The results show that the combination of the maximum term in tfidf and the maximum hashtag performs better in retrieving relevant documents than the baseline. Universiti Utara Malaysia 2017 Article PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/24057/1/JICT%2016%20%201%202017%20121-135.pdf Wibowo, Wahyu Catur and Widodo, Widodo (2017) Hashtag and highest scored terms for expanding query. Journal of Information and Communication Technology (JICT), 16 (1). pp. 121-135. ISSN 1675-414X http://jict.uum.edu.my/index.php/previous-issues/150-journal-of-information-and-communication-technology-jict-vol-16-no-1-june-2017#j3 |
spellingShingle | QA75 Electronic computers. Computer science Wibowo, Wahyu Catur Widodo, Widodo Hashtag and highest scored terms for expanding query |
title | Hashtag and highest scored terms for expanding query |
title_full | Hashtag and highest scored terms for expanding query |
title_fullStr | Hashtag and highest scored terms for expanding query |
title_full_unstemmed | Hashtag and highest scored terms for expanding query |
title_short | Hashtag and highest scored terms for expanding query |
title_sort | hashtag and highest scored terms for expanding query |
topic | QA75 Electronic computers. Computer science |
url | https://repo.uum.edu.my/id/eprint/24057/1/JICT%2016%20%201%202017%20121-135.pdf |
work_keys_str_mv | AT wibowowahyucatur hashtagandhighestscoredtermsforexpandingquery AT widodowidodo hashtagandhighestscoredtermsforexpandingquery |