Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model

The disposability of clothing has emerged as a critical concern, precipitating waste accumulation due to product quality degradation. Such consequences exert significant pressure on resources and challenge sustainability efforts. In response, this research focuses on empowering clothing companies to...

Full description

Bibliographic Details
Main Authors: Andry Alamsyah, Nadhif Ditertian Girawan
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Big Data and Cognitive Computing
Subjects:
Online Access:https://www.mdpi.com/2504-2289/7/4/168
_version_ 1797382065146036224
author Andry Alamsyah
Nadhif Ditertian Girawan
author_facet Andry Alamsyah
Nadhif Ditertian Girawan
author_sort Andry Alamsyah
collection DOAJ
description The disposability of clothing has emerged as a critical concern, precipitating waste accumulation due to product quality degradation. Such consequences exert significant pressure on resources and challenge sustainability efforts. In response, this research focuses on empowering clothing companies to elevate product excellence by harnessing consumer feedback. Beyond insights, this research extends to sustainability by providing suggestions on refining product quality by improving material handling, gradually mitigating waste production, and cultivating longevity, therefore decreasing discarded clothes. Managing a vast influx of diverse reviews necessitates sophisticated natural language processing (NLP) techniques. Our study introduces a Robustly optimized BERT Pretraining Approach (RoBERTa) model calibrated for multilabel classification and BERTopic for topic modeling. The model adeptly distills vital themes from consumer reviews, exhibiting astounding accuracy in projecting concerns across various dimensions of clothing quality. NLP’s potential lies in endowing companies with insights into consumer review, augmented by the BERTopic to facilitate immersive exploration of harvested review topics. This research presents a thorough case for integrating machine learning to foster sustainability and waste reduction. The contribution of this research is notable for its integration of RoBERTa and BERTopic in multilabel classification tasks and topic modeling in the fashion industry. The results indicate that the RoBERTa model exhibits remarkable performance, as demonstrated by its macro-averaged F1 score of 0.87 and micro-averaged F1 score of 0.87. Likewise, BERTopic achieves a coherence score of 0.67, meaning the model can form an insightful topic.
first_indexed 2024-03-08T21:00:05Z
format Article
id doaj.art-d117b0f62fd64283a3fb5b055f296e88
institution Directory Open Access Journal
issn 2504-2289
language English
last_indexed 2024-03-08T21:00:05Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Big Data and Cognitive Computing
spelling doaj.art-d117b0f62fd64283a3fb5b055f296e882023-12-22T13:53:32ZengMDPI AGBig Data and Cognitive Computing2504-22892023-10-017416810.3390/bdcc7040168Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language ModelAndry Alamsyah0Nadhif Ditertian Girawan1School of Economics and Business, Telkom University, Bandung 40257, IndonesiaSchool of Economics and Business, Telkom University, Bandung 40257, IndonesiaThe disposability of clothing has emerged as a critical concern, precipitating waste accumulation due to product quality degradation. Such consequences exert significant pressure on resources and challenge sustainability efforts. In response, this research focuses on empowering clothing companies to elevate product excellence by harnessing consumer feedback. Beyond insights, this research extends to sustainability by providing suggestions on refining product quality by improving material handling, gradually mitigating waste production, and cultivating longevity, therefore decreasing discarded clothes. Managing a vast influx of diverse reviews necessitates sophisticated natural language processing (NLP) techniques. Our study introduces a Robustly optimized BERT Pretraining Approach (RoBERTa) model calibrated for multilabel classification and BERTopic for topic modeling. The model adeptly distills vital themes from consumer reviews, exhibiting astounding accuracy in projecting concerns across various dimensions of clothing quality. NLP’s potential lies in endowing companies with insights into consumer review, augmented by the BERTopic to facilitate immersive exploration of harvested review topics. This research presents a thorough case for integrating machine learning to foster sustainability and waste reduction. The contribution of this research is notable for its integration of RoBERTa and BERTopic in multilabel classification tasks and topic modeling in the fashion industry. The results indicate that the RoBERTa model exhibits remarkable performance, as demonstrated by its macro-averaged F1 score of 0.87 and micro-averaged F1 score of 0.87. Likewise, BERTopic achieves a coherence score of 0.67, meaning the model can form an insightful topic.https://www.mdpi.com/2504-2289/7/4/168big datamultilabel classificationnatural language processingsustainability
spellingShingle Andry Alamsyah
Nadhif Ditertian Girawan
Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
Big Data and Cognitive Computing
big data
multilabel classification
natural language processing
sustainability
title Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
title_full Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
title_fullStr Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
title_full_unstemmed Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
title_short Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model
title_sort improving clothing product quality and reducing waste based on consumer review using roberta and bertopic language model
topic big data
multilabel classification
natural language processing
sustainability
url https://www.mdpi.com/2504-2289/7/4/168
work_keys_str_mv AT andryalamsyah improvingclothingproductqualityandreducingwastebasedonconsumerreviewusingrobertaandbertopiclanguagemodel
AT nadhifditertiangirawan improvingclothingproductqualityandreducingwastebasedonconsumerreviewusingrobertaandbertopiclanguagemodel