Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare

Health is an individual’s most precious asset and healthcare is one of the vehicles for preserving it. The Indian government’s spend on healthcare system is relatively low (1.2% of GDP). Consequently, Secondary and Tertiary government healthcare centers in India (that are presumed to be of above ave...

Full description

Bibliographic Details
Main Authors: Veena Bansal, Abhishek Poddar, R. Ghosh-Roy
Format: Article
Language:English
Published: MDPI AG 2019-01-01
Series:Information
Subjects:
Online Access:http://www.mdpi.com/2078-2489/10/1/25
_version_ 1811259560555446272
author Veena Bansal
Abhishek Poddar
R. Ghosh-Roy
author_facet Veena Bansal
Abhishek Poddar
R. Ghosh-Roy
author_sort Veena Bansal
collection DOAJ
description Health is an individual’s most precious asset and healthcare is one of the vehicles for preserving it. The Indian government’s spend on healthcare system is relatively low (1.2% of GDP). Consequently, Secondary and Tertiary government healthcare centers in India (that are presumed to be of above average ratings) are always crowded. In Tertiary healthcare centers, like the All India Institute of Medical Science (AIIMS), patients are often unable to articulate their problems correctly to the healthcare center’s reception staff, so that these patients to be directed to the correct healthcare department. In this paper, we propose a system that will scan prescriptions, referral letters and medical diagnostic reports of a patient, process the input using OCR (Optical Character Recognition) engines, coupled with image processing tools, to direct the patient to the most relevant department. We have implemented and tested parts of this system wherein a patient enters his symptoms and/or provisional diagnosis; the system suggests a department based on this user input. Our system suggests the correct department 70.19% of the time. On further investigation, we found that one particular department of the hospital was over-represented. We eliminated the department from the data and performance of the system improved to 92.7%. Our system presently makes its suggestions using random forest algorithm that has been trained using two information repositories-symptoms and disease data, functional description of each medical department. It is our informed assumption that, once we have incorporated medicine information and diagnostics imaging data to train the system; and the complete medical history of the patient, performance of the system will improve further.
first_indexed 2024-04-12T18:33:27Z
format Article
id doaj.art-e0c5f898d2ed478085c4c17b12772af0
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-04-12T18:33:27Z
publishDate 2019-01-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-e0c5f898d2ed478085c4c17b12772af02022-12-22T03:21:01ZengMDPI AGInformation2078-24892019-01-011012510.3390/info10010025info10010025Identifying a Medical Department Based on Unstructured Data: A Big Data Application in HealthcareVeena Bansal0Abhishek Poddar1R. Ghosh-Roy2Indian Institute of Technology Bhilai, Raipur- 492015, IndiaIndian Institute of Technology Kanpur, Kanpur-208016, IndiaIBM United Kingdom Limited, London SE1 9PZ, UKHealth is an individual’s most precious asset and healthcare is one of the vehicles for preserving it. The Indian government’s spend on healthcare system is relatively low (1.2% of GDP). Consequently, Secondary and Tertiary government healthcare centers in India (that are presumed to be of above average ratings) are always crowded. In Tertiary healthcare centers, like the All India Institute of Medical Science (AIIMS), patients are often unable to articulate their problems correctly to the healthcare center’s reception staff, so that these patients to be directed to the correct healthcare department. In this paper, we propose a system that will scan prescriptions, referral letters and medical diagnostic reports of a patient, process the input using OCR (Optical Character Recognition) engines, coupled with image processing tools, to direct the patient to the most relevant department. We have implemented and tested parts of this system wherein a patient enters his symptoms and/or provisional diagnosis; the system suggests a department based on this user input. Our system suggests the correct department 70.19% of the time. On further investigation, we found that one particular department of the hospital was over-represented. We eliminated the department from the data and performance of the system improved to 92.7%. Our system presently makes its suggestions using random forest algorithm that has been trained using two information repositories-symptoms and disease data, functional description of each medical department. It is our informed assumption that, once we have incorporated medicine information and diagnostics imaging data to train the system; and the complete medical history of the patient, performance of the system will improve further.http://www.mdpi.com/2078-2489/10/1/25healthcarebig dataunstructured datatertiary healthcare
spellingShingle Veena Bansal
Abhishek Poddar
R. Ghosh-Roy
Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
Information
healthcare
big data
unstructured data
tertiary healthcare
title Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
title_full Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
title_fullStr Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
title_full_unstemmed Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
title_short Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare
title_sort identifying a medical department based on unstructured data a big data application in healthcare
topic healthcare
big data
unstructured data
tertiary healthcare
url http://www.mdpi.com/2078-2489/10/1/25
work_keys_str_mv AT veenabansal identifyingamedicaldepartmentbasedonunstructureddataabigdataapplicationinhealthcare
AT abhishekpoddar identifyingamedicaldepartmentbasedonunstructureddataabigdataapplicationinhealthcare
AT rghoshroy identifyingamedicaldepartmentbasedonunstructureddataabigdataapplicationinhealthcare