Dual Model Medical Invoices Recognition

Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work us...

Full description

Bibliographic Details
Main Authors: Fei Yi, Yi-Fei Zhao, Guan-Qun Sheng, Kai Xie, Chang Wen, Xin-Gong Tang, Xuan Qi
Format: Article
Language:English
Published: MDPI AG 2019-10-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/19/20/4370
_version_ 1818008185348292608
author Fei Yi
Yi-Fei Zhao
Guan-Qun Sheng
Kai Xie
Chang Wen
Xin-Gong Tang
Xuan Qi
author_facet Fei Yi
Yi-Fei Zhao
Guan-Qun Sheng
Kai Xie
Chang Wen
Xin-Gong Tang
Xuan Qi
author_sort Fei Yi
collection DOAJ
description Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate.
first_indexed 2024-04-14T05:25:56Z
format Article
id doaj.art-0a711fa7e7ca4403b033ba393b7e4051
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-04-14T05:25:56Z
publishDate 2019-10-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-0a711fa7e7ca4403b033ba393b7e40512022-12-22T02:09:59ZengMDPI AGSensors1424-82202019-10-011920437010.3390/s19204370s19204370Dual Model Medical Invoices RecognitionFei Yi0Yi-Fei Zhao1Guan-Qun Sheng2Kai Xie3Chang Wen4Xin-Gong Tang5Xuan Qi6Key Laboratory of Exploration Technologies for Oil and Gas Resources, Yangtze University, Ministry of Education, Wuhan 430100, ChinaSchool of Electronic Information, Yangtze University, Jingzhou 434023, ChinaKey Laboratory of Exploration Technologies for Oil and Gas Resources, Yangtze University, Ministry of Education, Wuhan 430100, ChinaSchool of Electronic Information, Yangtze University, Jingzhou 434023, ChinaSchool of Computer Science, Yangtze University, Jingzhou 434023, ChinaKey Laboratory of Exploration Technologies for Oil and Gas Resources, Yangtze University, Ministry of Education, Wuhan 430100, ChinaSchool of Petroleum Engineering, China University of Petroleum, Beijing 102249, ChinaHospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate.https://www.mdpi.com/1424-8220/19/20/4370medical invoicesbreakpoint fontcnnrnnsemantic revisions
spellingShingle Fei Yi
Yi-Fei Zhao
Guan-Qun Sheng
Kai Xie
Chang Wen
Xin-Gong Tang
Xuan Qi
Dual Model Medical Invoices Recognition
Sensors
medical invoices
breakpoint font
cnn
rnn
semantic revisions
title Dual Model Medical Invoices Recognition
title_full Dual Model Medical Invoices Recognition
title_fullStr Dual Model Medical Invoices Recognition
title_full_unstemmed Dual Model Medical Invoices Recognition
title_short Dual Model Medical Invoices Recognition
title_sort dual model medical invoices recognition
topic medical invoices
breakpoint font
cnn
rnn
semantic revisions
url https://www.mdpi.com/1424-8220/19/20/4370
work_keys_str_mv AT feiyi dualmodelmedicalinvoicesrecognition
AT yifeizhao dualmodelmedicalinvoicesrecognition
AT guanqunsheng dualmodelmedicalinvoicesrecognition
AT kaixie dualmodelmedicalinvoicesrecognition
AT changwen dualmodelmedicalinvoicesrecognition
AT xingongtang dualmodelmedicalinvoicesrecognition
AT xuanqi dualmodelmedicalinvoicesrecognition