LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining
Generally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through teleme...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-03-01
|
Series: | Bioengineering |
Subjects: | |
Online Access: | https://www.mdpi.com/2306-5354/9/3/124 |
_version_ | 1797472779478499328 |
---|---|
author | Nalini Chintalapudi Ulrico Angeloni Gopi Battineni Marzio di Canio Claudia Marotta Giovanni Rezza Getu Gamo Sagaro Andrea Silenzi Francesco Amenta |
author_facet | Nalini Chintalapudi Ulrico Angeloni Gopi Battineni Marzio di Canio Claudia Marotta Giovanni Rezza Getu Gamo Sagaro Andrea Silenzi Francesco Amenta |
author_sort | Nalini Chintalapudi |
collection | DOAJ |
description | Generally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through telemedicine by receiving better symptomatic and clinical details in the health abstracts of seafarers. The adoption of text mining techniques can assist in extracting diagnostic information from clinical texts. We applied lexicon sentimental analysis to explore the automatic labeling of positive and negative healthcare terms to seafarers’ text healthcare documents. This was due to the lack of experimental evaluations using computational techniques. In order to classify diseases and their associated symptoms, the LASSO regression algorithm is applied to analyze these text documents. A visualization of symptomatic data frequency for each disease can be achieved by analyzing TF-IDF values. The proposed approach allows for the classification of text documents with 93.8% accuracy by using a machine learning model called LASSO regression. It is possible to classify text documents effectively with tidy text mining libraries. In addition to delivering health assistance, this method can be used to classify diseases and establish health observatories. Knowledge developed in the present work will be applied to establish an Epidemiological Observatory of Seafarers’ Pathologies and Injuries. This Observatory will be a collaborative initiative of the Italian Ministry of Health, University of Camerino, and International Radio Medical Centre (C.I.R.M.), the Italian TMAS. |
first_indexed | 2024-03-09T20:07:00Z |
format | Article |
id | doaj.art-ac7da23d89d348c6b1739c17d8c2e32b |
institution | Directory Open Access Journal |
issn | 2306-5354 |
language | English |
last_indexed | 2024-03-09T20:07:00Z |
publishDate | 2022-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Bioengineering |
spelling | doaj.art-ac7da23d89d348c6b1739c17d8c2e32b2023-11-24T00:30:27ZengMDPI AGBioengineering2306-53542022-03-019312410.3390/bioengineering9030124LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text MiningNalini Chintalapudi0Ulrico Angeloni1Gopi Battineni2Marzio di Canio3Claudia Marotta4Giovanni Rezza5Getu Gamo Sagaro6Andrea Silenzi7Francesco Amenta8Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, ItalyGeneral Directorate of Health Prevention, Ministry of Health, 00144 Rome, ItalyClinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, ItalyClinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, ItalyGeneral Directorate of Health Prevention, Ministry of Health, 00144 Rome, ItalyGeneral Directorate of Health Prevention, Ministry of Health, 00144 Rome, ItalyClinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, ItalyGeneral Directorate of Health Prevention, Ministry of Health, 00144 Rome, ItalyClinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, ItalyGenerally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through telemedicine by receiving better symptomatic and clinical details in the health abstracts of seafarers. The adoption of text mining techniques can assist in extracting diagnostic information from clinical texts. We applied lexicon sentimental analysis to explore the automatic labeling of positive and negative healthcare terms to seafarers’ text healthcare documents. This was due to the lack of experimental evaluations using computational techniques. In order to classify diseases and their associated symptoms, the LASSO regression algorithm is applied to analyze these text documents. A visualization of symptomatic data frequency for each disease can be achieved by analyzing TF-IDF values. The proposed approach allows for the classification of text documents with 93.8% accuracy by using a machine learning model called LASSO regression. It is possible to classify text documents effectively with tidy text mining libraries. In addition to delivering health assistance, this method can be used to classify diseases and establish health observatories. Knowledge developed in the present work will be applied to establish an Epidemiological Observatory of Seafarers’ Pathologies and Injuries. This Observatory will be a collaborative initiative of the Italian Ministry of Health, University of Camerino, and International Radio Medical Centre (C.I.R.M.), the Italian TMAS.https://www.mdpi.com/2306-5354/9/3/124seafarerstext mininglasso regressiondisease mappingcorrelations |
spellingShingle | Nalini Chintalapudi Ulrico Angeloni Gopi Battineni Marzio di Canio Claudia Marotta Giovanni Rezza Getu Gamo Sagaro Andrea Silenzi Francesco Amenta LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining Bioengineering seafarers text mining lasso regression disease mapping correlations |
title | LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining |
title_full | LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining |
title_fullStr | LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining |
title_full_unstemmed | LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining |
title_short | LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining |
title_sort | lasso regression modeling on prediction of medical terms among seafarers health documents using tidy text mining |
topic | seafarers text mining lasso regression disease mapping correlations |
url | https://www.mdpi.com/2306-5354/9/3/124 |
work_keys_str_mv | AT nalinichintalapudi lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT ulricoangeloni lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT gopibattineni lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT marziodicanio lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT claudiamarotta lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT giovannirezza lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT getugamosagaro lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT andreasilenzi lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining AT francescoamenta lassoregressionmodelingonpredictionofmedicaltermsamongseafarershealthdocumentsusingtidytextmining |