Food safety news events classification via a hierarchical transformer model

In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety a...

Full description

Bibliographic Details
Main Authors: Shufeng Xiong, Wenjie Tian, Vishwash Batra, Xiaobo Fan, Lei Xi, Hebing Liu, Liangliang Liu
Format: Article
Language:English
Published: Elsevier 2023-07-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844023050144
_version_ 1797771471934717952
author Shufeng Xiong
Wenjie Tian
Vishwash Batra
Xiaobo Fan
Lei Xi
Hebing Liu
Liangliang Liu
author_facet Shufeng Xiong
Wenjie Tian
Vishwash Batra
Xiaobo Fan
Lei Xi
Hebing Liu
Liangliang Liu
author_sort Shufeng Xiong
collection DOAJ
description In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance.
first_indexed 2024-03-12T21:38:05Z
format Article
id doaj.art-056bb4a75194456880668bb4d36788bf
institution Directory Open Access Journal
issn 2405-8440
language English
last_indexed 2024-03-12T21:38:05Z
publishDate 2023-07-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj.art-056bb4a75194456880668bb4d36788bf2023-07-27T05:57:31ZengElsevierHeliyon2405-84402023-07-0197e17806Food safety news events classification via a hierarchical transformer modelShufeng Xiong0Wenjie Tian1Vishwash Batra2Xiaobo Fan3Lei Xi4Hebing Liu5Liangliang Liu6Henan Agricultural University, Zhengzhou, 450002, China; Corresponding author.Henan Agricultural University, Zhengzhou, 450002, ChinaSchool of Computer Science and Mathematics, Keele University, ST5 5AA, Keele, UKHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaIn light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance.http://www.sciencedirect.com/science/article/pii/S2405844023050144Food safetyBERTTransformerDeep learningMulti-classificationNatural language processing
spellingShingle Shufeng Xiong
Wenjie Tian
Vishwash Batra
Xiaobo Fan
Lei Xi
Hebing Liu
Liangliang Liu
Food safety news events classification via a hierarchical transformer model
Heliyon
Food safety
BERT
Transformer
Deep learning
Multi-classification
Natural language processing
title Food safety news events classification via a hierarchical transformer model
title_full Food safety news events classification via a hierarchical transformer model
title_fullStr Food safety news events classification via a hierarchical transformer model
title_full_unstemmed Food safety news events classification via a hierarchical transformer model
title_short Food safety news events classification via a hierarchical transformer model
title_sort food safety news events classification via a hierarchical transformer model
topic Food safety
BERT
Transformer
Deep learning
Multi-classification
Natural language processing
url http://www.sciencedirect.com/science/article/pii/S2405844023050144
work_keys_str_mv AT shufengxiong foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT wenjietian foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT vishwashbatra foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT xiaobofan foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT leixi foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT hebingliu foodsafetynewseventsclassificationviaahierarchicaltransformermodel
AT liangliangliu foodsafetynewseventsclassificationviaahierarchicaltransformermodel