Food safety news events classification via a hierarchical transformer model
In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety a...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-07-01
|
Series: | Heliyon |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2405844023050144 |
_version_ | 1797771471934717952 |
---|---|
author | Shufeng Xiong Wenjie Tian Vishwash Batra Xiaobo Fan Lei Xi Hebing Liu Liangliang Liu |
author_facet | Shufeng Xiong Wenjie Tian Vishwash Batra Xiaobo Fan Lei Xi Hebing Liu Liangliang Liu |
author_sort | Shufeng Xiong |
collection | DOAJ |
description | In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance. |
first_indexed | 2024-03-12T21:38:05Z |
format | Article |
id | doaj.art-056bb4a75194456880668bb4d36788bf |
institution | Directory Open Access Journal |
issn | 2405-8440 |
language | English |
last_indexed | 2024-03-12T21:38:05Z |
publishDate | 2023-07-01 |
publisher | Elsevier |
record_format | Article |
series | Heliyon |
spelling | doaj.art-056bb4a75194456880668bb4d36788bf2023-07-27T05:57:31ZengElsevierHeliyon2405-84402023-07-0197e17806Food safety news events classification via a hierarchical transformer modelShufeng Xiong0Wenjie Tian1Vishwash Batra2Xiaobo Fan3Lei Xi4Hebing Liu5Liangliang Liu6Henan Agricultural University, Zhengzhou, 450002, China; Corresponding author.Henan Agricultural University, Zhengzhou, 450002, ChinaSchool of Computer Science and Mathematics, Keele University, ST5 5AA, Keele, UKHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaHenan Agricultural University, Zhengzhou, 450002, ChinaIn light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance.http://www.sciencedirect.com/science/article/pii/S2405844023050144Food safetyBERTTransformerDeep learningMulti-classificationNatural language processing |
spellingShingle | Shufeng Xiong Wenjie Tian Vishwash Batra Xiaobo Fan Lei Xi Hebing Liu Liangliang Liu Food safety news events classification via a hierarchical transformer model Heliyon Food safety BERT Transformer Deep learning Multi-classification Natural language processing |
title | Food safety news events classification via a hierarchical transformer model |
title_full | Food safety news events classification via a hierarchical transformer model |
title_fullStr | Food safety news events classification via a hierarchical transformer model |
title_full_unstemmed | Food safety news events classification via a hierarchical transformer model |
title_short | Food safety news events classification via a hierarchical transformer model |
title_sort | food safety news events classification via a hierarchical transformer model |
topic | Food safety BERT Transformer Deep learning Multi-classification Natural language processing |
url | http://www.sciencedirect.com/science/article/pii/S2405844023050144 |
work_keys_str_mv | AT shufengxiong foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT wenjietian foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT vishwashbatra foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT xiaobofan foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT leixi foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT hebingliu foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT liangliangliu foodsafetynewseventsclassificationviaahierarchicaltransformermodel |