BERT named entity recognition on emergency response system

Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing th...

Full description

Bibliographic Details
Main Author: Chua, Clarita Wyn Kay
Other Authors: Chng Eng Siong
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156612
_version_ 1811688291753263104
author Chua, Clarita Wyn Kay
author2 Chng Eng Siong
author_facet Chng Eng Siong
Chua, Clarita Wyn Kay
author_sort Chua, Clarita Wyn Kay
collection NTU
description Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing the need for manual insertion by call operators into the command control system. This paper proposes the improvement of the NER model in the Emergency Response System by including medical and covid-related entities through finetuning and training the different BERT variant models onto a COVID dataset and General Emergency Response dataset. To aid in the development and deployment of a variety of BERT-based models, the paper also introduces an automated NER pipeline with modules to prepare data and run the NER model. This paper also explores the benefits of data augmentation from PEGASUS with experiments conducted on augmented and non-augmented datasets to obtain the best baseline NER model for each respective dataset. From the experiments, we have found that roBERTa is the best baseline model for original datasets, but performed around 10% lower on augmented datasets. Additionally, DistilBERT and BERT NER models have shown significant improvement within a range of 3- 7% in their performance after data augmentation.
first_indexed 2024-10-01T05:29:53Z
format Final Year Project (FYP)
id ntu-10356/156612
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:29:53Z
publishDate 2022
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1566122022-04-21T05:02:37Z BERT named entity recognition on emergency response system Chua, Clarita Wyn Kay Chng Eng Siong School of Computer Science and Engineering ASESChng@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing the need for manual insertion by call operators into the command control system. This paper proposes the improvement of the NER model in the Emergency Response System by including medical and covid-related entities through finetuning and training the different BERT variant models onto a COVID dataset and General Emergency Response dataset. To aid in the development and deployment of a variety of BERT-based models, the paper also introduces an automated NER pipeline with modules to prepare data and run the NER model. This paper also explores the benefits of data augmentation from PEGASUS with experiments conducted on augmented and non-augmented datasets to obtain the best baseline NER model for each respective dataset. From the experiments, we have found that roBERTa is the best baseline model for original datasets, but performed around 10% lower on augmented datasets. Additionally, DistilBERT and BERT NER models have shown significant improvement within a range of 3- 7% in their performance after data augmentation. Bachelor of Engineering (Computer Science) 2022-04-21T05:02:37Z 2022-04-21T05:02:37Z 2022 Final Year Project (FYP) Chua, C. W. K. (2022). BERT named entity recognition on emergency response system. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156612 https://hdl.handle.net/10356/156612 en SCSE21-0068 application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Chua, Clarita Wyn Kay
BERT named entity recognition on emergency response system
title BERT named entity recognition on emergency response system
title_full BERT named entity recognition on emergency response system
title_fullStr BERT named entity recognition on emergency response system
title_full_unstemmed BERT named entity recognition on emergency response system
title_short BERT named entity recognition on emergency response system
title_sort bert named entity recognition on emergency response system
topic Engineering::Computer science and engineering::Computing methodologies::Document and text processing
url https://hdl.handle.net/10356/156612
work_keys_str_mv AT chuaclaritawynkay bertnamedentityrecognitiononemergencyresponsesystem