Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools

With the increase in usage of machine learning models within many different aspects of customer interactions, it has become very clear that bias detection within associated customer interaction datasets has led to a critical focus on issues such as the identification of bias prior to model building,...

Full description

Bibliographic Details
Main Authors: Andy Donald, Apostolos Galanopoulos, Edward Curry, Emir Munoz, Ihsan Ullah, M. A. Waskow, Maciej Dabrowski, Manan Kalra
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10126086/
_version_ 1797808454709018624
author Andy Donald
Apostolos Galanopoulos
Edward Curry
Emir Munoz
Ihsan Ullah
M. A. Waskow
Maciej Dabrowski
Manan Kalra
author_facet Andy Donald
Apostolos Galanopoulos
Edward Curry
Emir Munoz
Ihsan Ullah
M. A. Waskow
Maciej Dabrowski
Manan Kalra
author_sort Andy Donald
collection DOAJ
description With the increase in usage of machine learning models within many different aspects of customer interactions, it has become very clear that bias detection within associated customer interaction datasets has led to a critical focus on issues such as the identification of bias prior to model building, lack of understanding and transparency within models, and ultimately the prevention of biased predictions or classifications. This has never been more important since the introduction of the EU General Data Protection Regulation (GDPR) and the associated rule of “right of explanation”. In this paper, we survey the state of the art for bias detection, avoidance and mitigation within datasets, and the associated methods and tools available. Our purpose is to establish an understanding of how established customer interaction-based use cases can utilise these techniques. The focus is primarily on tackling the bias in unstructured text data as a pre-process prior to the machine learning model training phase. We hope that this research encourages the further establishment of responsible usage of customer interaction datasets to allow the prevention of bias being introduced into machine learning pipelines and to also allow greater awareness of the potential for further research in this area.
first_indexed 2024-03-13T06:37:45Z
format Article
id doaj.art-140b5c56bca04e7e9e3604fa879262bb
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T06:37:45Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-140b5c56bca04e7e9e3604fa879262bb2023-06-08T23:01:24ZengIEEEIEEE Access2169-35362023-01-0111537035371510.1109/ACCESS.2023.327675710126086Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and ToolsAndy Donald0https://orcid.org/0009-0007-3307-2800Apostolos Galanopoulos1Edward Curry2https://orcid.org/0000-0001-8236-6433Emir Munoz3https://orcid.org/0000-0002-0089-8135Ihsan Ullah4https://orcid.org/0000-0002-7964-5199M. A. Waskow5https://orcid.org/0000-0001-7443-2920Maciej Dabrowski6Manan Kalra7Insight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, IrelandGenesys Cloud Services Inc., Galway, IrelandInsight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, IrelandGenesys Cloud Services Inc., Galway, IrelandInsight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, IrelandInsight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, IrelandGenesys Cloud Services Inc., Galway, IrelandGenesys Cloud Services Inc., Galway, IrelandWith the increase in usage of machine learning models within many different aspects of customer interactions, it has become very clear that bias detection within associated customer interaction datasets has led to a critical focus on issues such as the identification of bias prior to model building, lack of understanding and transparency within models, and ultimately the prevention of biased predictions or classifications. This has never been more important since the introduction of the EU General Data Protection Regulation (GDPR) and the associated rule of “right of explanation”. In this paper, we survey the state of the art for bias detection, avoidance and mitigation within datasets, and the associated methods and tools available. Our purpose is to establish an understanding of how established customer interaction-based use cases can utilise these techniques. The focus is primarily on tackling the bias in unstructured text data as a pre-process prior to the machine learning model training phase. We hope that this research encourages the further establishment of responsible usage of customer interaction datasets to allow the prevention of bias being introduced into machine learning pipelines and to also allow greater awareness of the potential for further research in this area.https://ieeexplore.ieee.org/document/10126086/Bias detectionmachine learningbias evaluationexplainable AI
spellingShingle Andy Donald
Apostolos Galanopoulos
Edward Curry
Emir Munoz
Ihsan Ullah
M. A. Waskow
Maciej Dabrowski
Manan Kalra
Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
IEEE Access
Bias detection
machine learning
bias evaluation
explainable AI
title Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
title_full Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
title_fullStr Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
title_full_unstemmed Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
title_short Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools
title_sort bias detection for customer interaction data a survey on datasets methods and tools
topic Bias detection
machine learning
bias evaluation
explainable AI
url https://ieeexplore.ieee.org/document/10126086/
work_keys_str_mv AT andydonald biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT apostolosgalanopoulos biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT edwardcurry biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT emirmunoz biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT ihsanullah biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT mawaskow biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT maciejdabrowski biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools
AT manankalra biasdetectionforcustomerinteractiondataasurveyondatasetsmethodsandtools