Global Flood Disaster Research Graph Analysis Based on Literature Mining

Floods are the most frequent and highest-impact among the natural disasters caused by global climate change. A large number of flood disaster knowledge were buried in the scientific literature. This study mines research trends and hotspots on flood disasters and identifies their quantitative and spa...

Full description

Bibliographic Details
Main Authors: Min Zhang, Juanle Wang
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/6/3066
_version_ 1797472972888342528
author Min Zhang
Juanle Wang
author_facet Min Zhang
Juanle Wang
author_sort Min Zhang
collection DOAJ
description Floods are the most frequent and highest-impact among the natural disasters caused by global climate change. A large number of flood disaster knowledge were buried in the scientific literature. This study mines research trends and hotspots on flood disasters and identifies their quantitative and spatial distribution features using natural language process technology. The abstracts of 14,076 studies related to flood disasters from 1990 to 2020 were used for text mining. The study used logistic regression to classify themes, adopted the dictionary matching method to analyze flood disaster subcategories, analyzed the spatial distribution characteristics of research institutions, and used Stanford named entity recognition to identify hot research areas. Finally, the disaster information was integrated and visualized as a knowledge graph. The main findings are as follows. (1) The research hotspots are concentrated on flood disaster risks and prediction. Rainfall, coastal floods, and flash floods are the most-studied flood disaster sub-categories. (2) There are some connections and differences between the physical occurrence and research frequency of flood disasters. Occurrence frequency and research frequency of flood disasters are correlated. However, the spatial distribution at the global and intercontinental scales is geographically imbalanced. (3) The study’s flood disaster knowledge graph contains 39,679 nodes and 64,908 edges, reflecting the literature distribution and field information on the research themes. Future research will extract more disaster information from the full texts of the studies to enrich the flood disaster knowledge graph and obtain more knowledge on flood disaster risk and reduction.
first_indexed 2024-03-09T20:08:37Z
format Article
id doaj.art-5343be6d7df74238ad533a025304b7ba
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T20:08:37Z
publishDate 2022-03-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-5343be6d7df74238ad533a025304b7ba2023-11-24T00:23:13ZengMDPI AGApplied Sciences2076-34172022-03-01126306610.3390/app12063066Global Flood Disaster Research Graph Analysis Based on Literature MiningMin Zhang0Juanle Wang1State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, ChinaState Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, ChinaFloods are the most frequent and highest-impact among the natural disasters caused by global climate change. A large number of flood disaster knowledge were buried in the scientific literature. This study mines research trends and hotspots on flood disasters and identifies their quantitative and spatial distribution features using natural language process technology. The abstracts of 14,076 studies related to flood disasters from 1990 to 2020 were used for text mining. The study used logistic regression to classify themes, adopted the dictionary matching method to analyze flood disaster subcategories, analyzed the spatial distribution characteristics of research institutions, and used Stanford named entity recognition to identify hot research areas. Finally, the disaster information was integrated and visualized as a knowledge graph. The main findings are as follows. (1) The research hotspots are concentrated on flood disaster risks and prediction. Rainfall, coastal floods, and flash floods are the most-studied flood disaster sub-categories. (2) There are some connections and differences between the physical occurrence and research frequency of flood disasters. Occurrence frequency and research frequency of flood disasters are correlated. However, the spatial distribution at the global and intercontinental scales is geographically imbalanced. (3) The study’s flood disaster knowledge graph contains 39,679 nodes and 64,908 edges, reflecting the literature distribution and field information on the research themes. Future research will extract more disaster information from the full texts of the studies to enrich the flood disaster knowledge graph and obtain more knowledge on flood disaster risk and reduction.https://www.mdpi.com/2076-3417/12/6/3066flood disasterresearch hotspotliterature miningnatural language processingknowledge graph
spellingShingle Min Zhang
Juanle Wang
Global Flood Disaster Research Graph Analysis Based on Literature Mining
Applied Sciences
flood disaster
research hotspot
literature mining
natural language processing
knowledge graph
title Global Flood Disaster Research Graph Analysis Based on Literature Mining
title_full Global Flood Disaster Research Graph Analysis Based on Literature Mining
title_fullStr Global Flood Disaster Research Graph Analysis Based on Literature Mining
title_full_unstemmed Global Flood Disaster Research Graph Analysis Based on Literature Mining
title_short Global Flood Disaster Research Graph Analysis Based on Literature Mining
title_sort global flood disaster research graph analysis based on literature mining
topic flood disaster
research hotspot
literature mining
natural language processing
knowledge graph
url https://www.mdpi.com/2076-3417/12/6/3066
work_keys_str_mv AT minzhang globalflooddisasterresearchgraphanalysisbasedonliteraturemining
AT juanlewang globalflooddisasterresearchgraphanalysisbasedonliteraturemining