Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method

Social media platforms generate many opinions, emotions, and views on all public services. Sentiment analysis is used in various institutions, such as universities, the business industry, and politicians. The evaluation process requires some data, both quantitative and qualitative. Researchers only...

Full description

Bibliographic Details
Main Authors:	Jazuli Ahmad, Widowati, Kusumaningrum Retno
Format:	Article
Language:	English
Published:	EDP Sciences 2022-01-01
Series:	E3S Web of Conferences
Subjects:	auto labeling sentiment analysis learning process k-nearest neighbors
Online Access:	https://www.e3s-conferences.org/articles/e3sconf/pdf/2022/26/e3sconf_icenis2022_05001.pdf

_version_	1811179685747359744
author	Jazuli Ahmad Widowati Kusumaningrum Retno
author_facet	Jazuli Ahmad Widowati Kusumaningrum Retno
author_sort	Jazuli Ahmad
collection	DOAJ
description	Social media platforms generate many opinions, emotions, and views on all public services. Sentiment analysis is used in various institutions, such as universities, the business industry, and politicians. The evaluation process requires some data, both quantitative and qualitative. Researchers only focus on quantitative data but ignore qualitative data. The evaluation process given by students in the form of a review is qualitative data that is not structured, so it cannot use conventional methods. Unstructured data requires analysis as well as labeling. The labeling process of large amounts of data is a waste of time and money. Data labeling requires very high accuracy to avoid errors. Accuracy in data labeling is used for the process of classifying, training, and testing data. This study aims to automate data labeling using the K-Nearest Neighbors algorithm method. This labeling process can improve the accuracy of sentiment analysis. The results of the classification method can classify responses from Twitter users and can be used by universities as material for evaluating and assessing higher education services. The results of using a confusion matrix with 1.409 data obtained an accuracy rate of 79.43% with a value of k=15
first_indexed	2024-04-11T06:38:52Z
format	Article
id	doaj.art-0f0679822a9b4dddaaee21f4ed9d0d9c
institution	Directory Open Access Journal
issn	2267-1242
language	English
last_indexed	2024-04-11T06:38:52Z
publishDate	2022-01-01
publisher	EDP Sciences
record_format	Article
series	E3S Web of Conferences
spelling	doaj.art-0f0679822a9b4dddaaee21f4ed9d0d9c2022-12-22T04:39:36ZengEDP SciencesE3S Web of Conferences2267-12422022-01-013590500110.1051/e3sconf/202235905001e3sconf_icenis2022_05001Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors MethodJazuli Ahmad0Widowati1Kusumaningrum Retno2Information System, School of Postgraduate Studies, Diponegoro UniversityDepartment of Mathematics, Faculty of Sains and Mathematics, Diponegoro UniversityDepartment of Informatics, Faculty of Sains and Mathematics, Diponegoro UniversitySocial media platforms generate many opinions, emotions, and views on all public services. Sentiment analysis is used in various institutions, such as universities, the business industry, and politicians. The evaluation process requires some data, both quantitative and qualitative. Researchers only focus on quantitative data but ignore qualitative data. The evaluation process given by students in the form of a review is qualitative data that is not structured, so it cannot use conventional methods. Unstructured data requires analysis as well as labeling. The labeling process of large amounts of data is a waste of time and money. Data labeling requires very high accuracy to avoid errors. Accuracy in data labeling is used for the process of classifying, training, and testing data. This study aims to automate data labeling using the K-Nearest Neighbors algorithm method. This labeling process can improve the accuracy of sentiment analysis. The results of the classification method can classify responses from Twitter users and can be used by universities as material for evaluating and assessing higher education services. The results of using a confusion matrix with 1.409 data obtained an accuracy rate of 79.43% with a value of k=15https://www.e3s-conferences.org/articles/e3sconf/pdf/2022/26/e3sconf_icenis2022_05001.pdfauto labelingsentiment analysislearning processk-nearest neighbors
spellingShingle	Jazuli Ahmad Widowati Kusumaningrum Retno Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method E3S Web of Conferences auto labeling sentiment analysis learning process k-nearest neighbors
title	Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method
title_full	Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method
title_fullStr	Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method
title_full_unstemmed	Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method
title_short	Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method
title_sort	auto labeling to increase aspect based sentiment analysis using k nearest neighbors method
topic	auto labeling sentiment analysis learning process k-nearest neighbors
url	https://www.e3s-conferences.org/articles/e3sconf/pdf/2022/26/e3sconf_icenis2022_05001.pdf
work_keys_str_mv	AT jazuliahmad autolabelingtoincreaseaspectbasedsentimentanalysisusingknearestneighborsmethod AT widowati autolabelingtoincreaseaspectbasedsentimentanalysisusingknearestneighborsmethod AT kusumaningrumretno autolabelingtoincreaseaspectbasedsentimentanalysisusingknearestneighborsmethod

Auto Labeling to Increase Aspect-Based Sentiment Analysis Using K-Nearest Neighbors Method

Similar Items