Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm

Online media has become an essential part of everyday life in modern society. Everyone or organization is free to share their opinions and feelings about any topic on it, including information or news about commodity price fluctuations. Commodity price data from the National Strategic Price Informat...

Full description

Bibliographic Details
Main Authors: Nurcahyawati, Vivine, Mustaffa, Zuriani
Format: Conference or Workshop Item
Language:English
English
Published: IEEE 2020
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/36781/1/Online%20media%20as%20a%20price%20monitor%20-%20Text%20analysis%20.pdf
http://umpir.ump.edu.my/id/eprint/36781/2/Online%20Media%20as%20a%20Price%20Monitor%20-%20Text%20Analysis_FULL.pdf
_version_ 1825814811327332352
author Nurcahyawati, Vivine
Mustaffa, Zuriani
author_facet Nurcahyawati, Vivine
Mustaffa, Zuriani
author_sort Nurcahyawati, Vivine
collection UMP
description Online media has become an essential part of everyday life in modern society. Everyone or organization is free to share their opinions and feelings about any topic on it, including information or news about commodity price fluctuations. Commodity price data from the National Strategic Price Information Center (NSPIC) website is not real-time, so it is not sufficient as a basis for monitoring commodity price fluctuations. Meanwhile, the government needs to collect data and information quickly about these price fluctuations, hence immediately strategic decisions and policies can be made to stabilize the prices. This study explores the potential function of online media by extracting the text in it and analyzing text so that it can display the commodity price data sought. The commodities used as search keywords were commodities that had the highest consumption level in 2016 in Indonesia. The texts analyzed were taken from three online media, namely Twitter, Liputan6.com, and Detik.com. It was analyzed using text extraction techniques and the application of the Jaro-Winkler algorithm to find commodity prices in the text collection. Then compare the results of text analysis with commodity prices from the NSPIC website. The experimental data were 99,007 with a data collection time of three months. From only 122 data that match the keywords, it consists of 100 training data and 22 testing data. The results of the text analysis show that the text from the Detik.com website shows the commodity prices closest to the price data from the NSPIC, while Twitter shows the farthest results. The accuracy test with the confusion matrix is 75%. Based on this research, online media texts are a viable source for monitoring commodity price fluctuations.
first_indexed 2024-03-06T13:04:06Z
format Conference or Workshop Item
id UMPir36781
institution Universiti Malaysia Pahang
language English
English
last_indexed 2024-03-06T13:04:06Z
publishDate 2020
publisher IEEE
record_format dspace
spelling UMPir367812023-01-25T02:15:41Z http://umpir.ump.edu.my/id/eprint/36781/ Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm Nurcahyawati, Vivine Mustaffa, Zuriani QA76 Computer software T Technology (General) Online media has become an essential part of everyday life in modern society. Everyone or organization is free to share their opinions and feelings about any topic on it, including information or news about commodity price fluctuations. Commodity price data from the National Strategic Price Information Center (NSPIC) website is not real-time, so it is not sufficient as a basis for monitoring commodity price fluctuations. Meanwhile, the government needs to collect data and information quickly about these price fluctuations, hence immediately strategic decisions and policies can be made to stabilize the prices. This study explores the potential function of online media by extracting the text in it and analyzing text so that it can display the commodity price data sought. The commodities used as search keywords were commodities that had the highest consumption level in 2016 in Indonesia. The texts analyzed were taken from three online media, namely Twitter, Liputan6.com, and Detik.com. It was analyzed using text extraction techniques and the application of the Jaro-Winkler algorithm to find commodity prices in the text collection. Then compare the results of text analysis with commodity prices from the NSPIC website. The experimental data were 99,007 with a data collection time of three months. From only 122 data that match the keywords, it consists of 100 training data and 22 testing data. The results of the text analysis show that the text from the Detik.com website shows the commodity prices closest to the price data from the NSPIC, while Twitter shows the farthest results. The accuracy test with the confusion matrix is 75%. Based on this research, online media texts are a viable source for monitoring commodity price fluctuations. IEEE 2020 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/36781/1/Online%20media%20as%20a%20price%20monitor%20-%20Text%20analysis%20.pdf pdf en http://umpir.ump.edu.my/id/eprint/36781/2/Online%20Media%20as%20a%20Price%20Monitor%20-%20Text%20Analysis_FULL.pdf Nurcahyawati, Vivine and Mustaffa, Zuriani (2020) Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm. In: 2020 International Conference on Emerging Technology in Computing, Communication and Electronics, ETCCE 2020 , 21 - 22 December 2020 , Virtual, Dhaka. pp. 1-6. (167272). ISBN 978-166541962-8 (Published) https://doi.org/ 10.1109/ETCCE51779.2020.9350898
spellingShingle QA76 Computer software
T Technology (General)
Nurcahyawati, Vivine
Mustaffa, Zuriani
Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title_full Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title_fullStr Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title_full_unstemmed Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title_short Online media as a price monitor: Text analysis using text extraction technique and jaro-winkler similarity algorithm
title_sort online media as a price monitor text analysis using text extraction technique and jaro winkler similarity algorithm
topic QA76 Computer software
T Technology (General)
url http://umpir.ump.edu.my/id/eprint/36781/1/Online%20media%20as%20a%20price%20monitor%20-%20Text%20analysis%20.pdf
http://umpir.ump.edu.my/id/eprint/36781/2/Online%20Media%20as%20a%20Price%20Monitor%20-%20Text%20Analysis_FULL.pdf
work_keys_str_mv AT nurcahyawativivine onlinemediaasapricemonitortextanalysisusingtextextractiontechniqueandjarowinklersimilarityalgorithm
AT mustaffazuriani onlinemediaasapricemonitortextanalysisusingtextextractiontechniqueandjarowinklersimilarityalgorithm