Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs

In recent years, computational approaches for automatically extracting the voice of the customer from user generated content have been proposed. These studies have tackled the task of obtaining current customer needs, however, there is a lack of methods that predict future needs (i.e. needs that may...

Full description

Bibliographic Details
Main Authors: David Kilroy, Graham Healy, Simon Caton
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9749255/
_version_ 1818272642540503040
author David Kilroy
Graham Healy
Simon Caton
author_facet David Kilroy
Graham Healy
Simon Caton
author_sort David Kilroy
collection DOAJ
description In recent years, computational approaches for automatically extracting the voice of the customer from user generated content have been proposed. These studies have tackled the task of obtaining current customer needs, however, there is a lack of methods that predict future needs (i.e. needs that may become popular in the marketplace). Therefore, this study presents a multi-document keyphrase extraction algorithm which predicts future customer needs from users’ social media posts on Reddit. Key to our approach is a novel document filtering method (discovering potentially relevant social media content) and a keyphrase ranking method, which promotes terms with rising frequency likely to be future product needs. In order to evaluate the approach, a case study of “toothpaste” needs is reviewed and a novel evaluation approach using ground truth automatically extracted from a collection of future specifications of new-to-market products is proposed. In our evaluation, we show that the approach is significantly better than simple baselines at identifying customer needs on social media before they trend in the marketplace. We also show that our approach can capture important customer needs identified by a large multinational company with lead times of up to 25 months ahead of them trending in the marketplace. The findings of this research could provide many benefits to businesses such as gaining early access into markets ahead of their competitors and giving early notice to manufacturers/engineers/developers before a need for a product is in demand.
first_indexed 2024-12-12T21:45:19Z
format Article
id doaj.art-7babf4ba302c46568a101115066517ec
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-12T21:45:19Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-7babf4ba302c46568a101115066517ec2022-12-22T00:10:55ZengIEEEIEEE Access2169-35362022-01-0110377743779510.1109/ACCESS.2022.31650439749255Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer NeedsDavid Kilroy0https://orcid.org/0000-0003-2571-8038Graham Healy1https://orcid.org/0000-0001-6429-6339Simon Caton2https://orcid.org/0000-0001-9379-3879School of Computer Science, University College Dublin, Dublin 4, IrelandSchool of Computing, Dublin City University, Dublin 9, IrelandSchool of Computer Science, University College Dublin, Dublin 4, IrelandIn recent years, computational approaches for automatically extracting the voice of the customer from user generated content have been proposed. These studies have tackled the task of obtaining current customer needs, however, there is a lack of methods that predict future needs (i.e. needs that may become popular in the marketplace). Therefore, this study presents a multi-document keyphrase extraction algorithm which predicts future customer needs from users’ social media posts on Reddit. Key to our approach is a novel document filtering method (discovering potentially relevant social media content) and a keyphrase ranking method, which promotes terms with rising frequency likely to be future product needs. In order to evaluate the approach, a case study of “toothpaste” needs is reviewed and a novel evaluation approach using ground truth automatically extracted from a collection of future specifications of new-to-market products is proposed. In our evaluation, we show that the approach is significantly better than simple baselines at identifying customer needs on social media before they trend in the marketplace. We also show that our approach can capture important customer needs identified by a large multinational company with lead times of up to 25 months ahead of them trending in the marketplace. The findings of this research could provide many benefits to businesses such as gaining early access into markets ahead of their competitors and giving early notice to manufacturers/engineers/developers before a need for a product is in demand.https://ieeexplore.ieee.org/document/9749255/Machine learningproduct developmentReddittext mining
spellingShingle David Kilroy
Graham Healy
Simon Caton
Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
IEEE Access
Machine learning
product development
Reddit
text mining
title Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
title_full Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
title_fullStr Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
title_full_unstemmed Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
title_short Using Machine Learning to Improve Lead Times in the Identification of Emerging Customer Needs
title_sort using machine learning to improve lead times in the identification of emerging customer needs
topic Machine learning
product development
Reddit
text mining
url https://ieeexplore.ieee.org/document/9749255/
work_keys_str_mv AT davidkilroy usingmachinelearningtoimproveleadtimesintheidentificationofemergingcustomerneeds
AT grahamhealy usingmachinelearningtoimproveleadtimesintheidentificationofemergingcustomerneeds
AT simoncaton usingmachinelearningtoimproveleadtimesintheidentificationofemergingcustomerneeds