Information extraction of hazard events

Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency resp...

Full description

Bibliographic Details
Main Author: Lim, Joanna Jia Yi
Other Authors: Mao Kezhi
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149108
_version_ 1826117950496571392
author Lim, Joanna Jia Yi
author2 Mao Kezhi
author_facet Mao Kezhi
Lim, Joanna Jia Yi
author_sort Lim, Joanna Jia Yi
collection NTU
description Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas. In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project. In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition.
first_indexed 2024-10-01T04:35:45Z
format Final Year Project (FYP)
id ntu-10356/149108
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:35:45Z
publishDate 2021
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1491082023-07-07T17:33:04Z Information extraction of hazard events Lim, Joanna Jia Yi Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas. In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project. In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition. Bachelor of Engineering (Information Engineering and Media) 2021-05-26T13:24:54Z 2021-05-26T13:24:54Z 2021 Final Year Project (FYP) Lim, J. J. Y. (2021). Information extraction of hazard events. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149108 https://hdl.handle.net/10356/149108 en A1104-201 application/pdf Nanyang Technological University
spellingShingle Engineering::Electrical and electronic engineering
Lim, Joanna Jia Yi
Information extraction of hazard events
title Information extraction of hazard events
title_full Information extraction of hazard events
title_fullStr Information extraction of hazard events
title_full_unstemmed Information extraction of hazard events
title_short Information extraction of hazard events
title_sort information extraction of hazard events
topic Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/149108
work_keys_str_mv AT limjoannajiayi informationextractionofhazardevents