Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs

The exponential growth of scientific publications increases the effort required to identify relevant articles. Moreover, the scale of studies is a frequent barrier to research as the majority of studies are low or medium-scaled and do not generalize well while lacking statistical power. As such, we...

Full description

Bibliographic Details
Main Authors: Razvan Paroiu, Stefan Ruseti, Mihai Dascalu, Stefan Trausan-Matu, Danielle S. McNamara
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/19/3996
_version_ 1797576017452204032
author Razvan Paroiu
Stefan Ruseti
Mihai Dascalu
Stefan Trausan-Matu
Danielle S. McNamara
author_facet Razvan Paroiu
Stefan Ruseti
Mihai Dascalu
Stefan Trausan-Matu
Danielle S. McNamara
author_sort Razvan Paroiu
collection DOAJ
description The exponential growth of scientific publications increases the effort required to identify relevant articles. Moreover, the scale of studies is a frequent barrier to research as the majority of studies are low or medium-scaled and do not generalize well while lacking statistical power. As such, we introduce an automated method that supports the identification of large-scale studies in terms of population. First, we introduce a training corpus of 1229 manually annotated paragraphs extracted from 20 articles with different structures and considered populations. Our method considers prompting a FLAN-T5 language model with targeted questions and paragraphs from the previous corpus so that the model returns the number of participants from the study. We adopt a dialogic extensible approach in which the model is asked a sequence of questions that are gradual in terms of focus. Second, we use a validation corpus with 200 articles labeled for having <i>N</i> larger than 1000 to assess the performance of our language model. Our model, without any preliminary filtering with heuristics, achieves an F1 score of 0.52, surpassing previous analyses performed that obtained an F1 score of 0.51. Moreover, we achieved an F1 score of 0.69 when combined with previous extraction heuristics, thus arguing for the robustness and extensibility of our approach. Finally, we apply our model to a newly introduced dataset of ERIC publications to observe trends across the years in the Education domain. A spike was observed in 2019, followed by a decrease in 2020 and, afterward, a positive trend; nevertheless, the overall percentage is lower than 3%, suggesting a major problem in terms of scale and the need for a change in perspective.
first_indexed 2024-03-10T21:46:32Z
format Article
id doaj.art-5359ec55e22846fd8c30ca8c92d6d7ab
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T21:46:32Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-5359ec55e22846fd8c30ca8c92d6d7ab2023-11-19T14:15:45ZengMDPI AGElectronics2079-92922023-09-011219399610.3390/electronics12193996Asking Questions about Scientific Articles—Identifying Large N Studies with LLMsRazvan Paroiu0Stefan Ruseti1Mihai Dascalu2Stefan Trausan-Matu3Danielle S. McNamara4Computer Science and Engineering Department, National University of Science and Technology Politehnica of Bucharest, 313 Splaiul Independentei, 060042 Bucharest, RomaniaComputer Science and Engineering Department, National University of Science and Technology Politehnica of Bucharest, 313 Splaiul Independentei, 060042 Bucharest, RomaniaComputer Science and Engineering Department, National University of Science and Technology Politehnica of Bucharest, 313 Splaiul Independentei, 060042 Bucharest, RomaniaComputer Science and Engineering Department, National University of Science and Technology Politehnica of Bucharest, 313 Splaiul Independentei, 060042 Bucharest, RomaniaDepartment of Psychology, Arizona State University, Tempe, AZ 85287, USAThe exponential growth of scientific publications increases the effort required to identify relevant articles. Moreover, the scale of studies is a frequent barrier to research as the majority of studies are low or medium-scaled and do not generalize well while lacking statistical power. As such, we introduce an automated method that supports the identification of large-scale studies in terms of population. First, we introduce a training corpus of 1229 manually annotated paragraphs extracted from 20 articles with different structures and considered populations. Our method considers prompting a FLAN-T5 language model with targeted questions and paragraphs from the previous corpus so that the model returns the number of participants from the study. We adopt a dialogic extensible approach in which the model is asked a sequence of questions that are gradual in terms of focus. Second, we use a validation corpus with 200 articles labeled for having <i>N</i> larger than 1000 to assess the performance of our language model. Our model, without any preliminary filtering with heuristics, achieves an F1 score of 0.52, surpassing previous analyses performed that obtained an F1 score of 0.51. Moreover, we achieved an F1 score of 0.69 when combined with previous extraction heuristics, thus arguing for the robustness and extensibility of our approach. Finally, we apply our model to a newly introduced dataset of ERIC publications to observe trends across the years in the Education domain. A spike was observed in 2019, followed by a decrease in 2020 and, afterward, a positive trend; nevertheless, the overall percentage is lower than 3%, suggesting a major problem in terms of scale and the need for a change in perspective.https://www.mdpi.com/2079-9292/12/19/3996large language modelsquestion answeringscientific article processing
spellingShingle Razvan Paroiu
Stefan Ruseti
Mihai Dascalu
Stefan Trausan-Matu
Danielle S. McNamara
Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
Electronics
large language models
question answering
scientific article processing
title Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
title_full Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
title_fullStr Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
title_full_unstemmed Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
title_short Asking Questions about Scientific Articles—Identifying Large N Studies with LLMs
title_sort asking questions about scientific articles identifying large n studies with llms
topic large language models
question answering
scientific article processing
url https://www.mdpi.com/2079-9292/12/19/3996
work_keys_str_mv AT razvanparoiu askingquestionsaboutscientificarticlesidentifyinglargenstudieswithllms
AT stefanruseti askingquestionsaboutscientificarticlesidentifyinglargenstudieswithllms
AT mihaidascalu askingquestionsaboutscientificarticlesidentifyinglargenstudieswithllms
AT stefantrausanmatu askingquestionsaboutscientificarticlesidentifyinglargenstudieswithllms
AT daniellesmcnamara askingquestionsaboutscientificarticlesidentifyinglargenstudieswithllms