Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory

Abstract Purpose Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and m...

Full description

Bibliographic Details
Main Authors:	Bowen Wang, Liangzhi Li, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara
Format:	Article
Language:	English
Published:	BMC 2023-05-01
Series:	BMC Medical Informatics and Decision Making
Subjects:	Surgery time Cataract surgery Long short-term memory
Online Access:	https://doi.org/10.1186/s12911-023-02160-0

_version_	1827952735375327232
author	Bowen Wang Liangzhi Li Yuta Nakashima Ryo Kawasaki Hajime Nagahara
author_facet	Bowen Wang Liangzhi Li Yuta Nakashima Ryo Kawasaki Hajime Nagahara
author_sort	Bowen Wang
collection	DOAJ
description	Abstract Purpose Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery of the surgeon in a well-standardized surgery such as cataract surgery. In this paper, we design and develop a real-time RSD estimation method for cataract surgery that does not require manual labeling and is transferable with minimum fine-tuning. Methods A regression method consisting of convolutional neural networks (CNNs) and long short-term memory (LSTM) is designed for RSD estimation. The model is firstly trained and evaluated for the single main surgeon with a large number of surgeries. Then, the fine-tuning strategy is used to transfer the model to the data of the other two surgeons. Mean Absolute Error (MAE in seconds) was used to evaluate the performance of the RSD estimation. The proposed method is compared with the naïve method which is based on the statistic of the historical data. A transferability experiment is also set to demonstrate the generalizability of the method. Result The mean surgical time for the sample videos was 318.7 s (s) (standard deviation 83.4 s) for the main surgeon for the initial training. In our experiments, the lowest MAE of 19.4 s (equal to about 6.4% of the mean surgical time) is achieved by our best-trained model for the independent test data of the main target surgeon. It reduces the MAE by 35.5 s (-10.2%) compared to the naïve method. The fine-tuning strategy transfers the model trained for the main target to the data of other surgeons with only a small number of training data (20% of the pre-training). The MAEs for the other two surgeons are 28.3 s and 30.6 s with the fine-tuning model, which decreased by -8.1 s and -7.5 s than the Per-surgeon model (average declining of -7.8 s and 1.3% of video duration). External validation study with Cataract-101 outperformed 3 reported methods of TimeLSTM, RSDNet, and CataNet. Conclusion An approach to build a pre-trained model for estimating RSD estimation based on a single surgeon and then transfer to other surgeons demonstrated both low prediction error and good transferability with minimum fine-tuning videos.
first_indexed	2024-04-09T14:02:18Z
format	Article
id	doaj.art-7e22c4c87bbc496ba7ed306ec94139c6
institution	Directory Open Access Journal
issn	1472-6947
language	English
last_indexed	2024-04-09T14:02:18Z
publishDate	2023-05-01
publisher	BMC
record_format	Article
series	BMC Medical Informatics and Decision Making
spelling	doaj.art-7e22c4c87bbc496ba7ed306ec94139c62023-05-07T11:15:08ZengBMCBMC Medical Informatics and Decision Making1472-69472023-05-0123111110.1186/s12911-023-02160-0Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memoryBowen Wang0Liangzhi Li1Yuta Nakashima2Ryo Kawasaki3Hajime Nagahara4Institute for Datability Science (IDS), Osaka UniversityInstitute for Datability Science (IDS), Osaka UniversityInstitute for Datability Science (IDS), Osaka UniversityArtificial Intelligence Center for Medical Research and Application, Osaka University HospitalInstitute for Datability Science (IDS), Osaka UniversityAbstract Purpose Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery of the surgeon in a well-standardized surgery such as cataract surgery. In this paper, we design and develop a real-time RSD estimation method for cataract surgery that does not require manual labeling and is transferable with minimum fine-tuning. Methods A regression method consisting of convolutional neural networks (CNNs) and long short-term memory (LSTM) is designed for RSD estimation. The model is firstly trained and evaluated for the single main surgeon with a large number of surgeries. Then, the fine-tuning strategy is used to transfer the model to the data of the other two surgeons. Mean Absolute Error (MAE in seconds) was used to evaluate the performance of the RSD estimation. The proposed method is compared with the naïve method which is based on the statistic of the historical data. A transferability experiment is also set to demonstrate the generalizability of the method. Result The mean surgical time for the sample videos was 318.7 s (s) (standard deviation 83.4 s) for the main surgeon for the initial training. In our experiments, the lowest MAE of 19.4 s (equal to about 6.4% of the mean surgical time) is achieved by our best-trained model for the independent test data of the main target surgeon. It reduces the MAE by 35.5 s (-10.2%) compared to the naïve method. The fine-tuning strategy transfers the model trained for the main target to the data of other surgeons with only a small number of training data (20% of the pre-training). The MAEs for the other two surgeons are 28.3 s and 30.6 s with the fine-tuning model, which decreased by -8.1 s and -7.5 s than the Per-surgeon model (average declining of -7.8 s and 1.3% of video duration). External validation study with Cataract-101 outperformed 3 reported methods of TimeLSTM, RSDNet, and CataNet. Conclusion An approach to build a pre-trained model for estimating RSD estimation based on a single surgeon and then transfer to other surgeons demonstrated both low prediction error and good transferability with minimum fine-tuning videos.https://doi.org/10.1186/s12911-023-02160-0Surgery timeCataract surgeryLong short-term memory
spellingShingle	Bowen Wang Liangzhi Li Yuta Nakashima Ryo Kawasaki Hajime Nagahara Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory BMC Medical Informatics and Decision Making Surgery time Cataract surgery Long short-term memory
title	Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
title_full	Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
title_fullStr	Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
title_full_unstemmed	Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
title_short	Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
title_sort	real time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short term memory
topic	Surgery time Cataract surgery Long short-term memory
url	https://doi.org/10.1186/s12911-023-02160-0
work_keys_str_mv	AT bowenwang realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT liangzhili realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT yutanakashima realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT ryokawasaki realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT hajimenagahara realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory

Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory

Similar Items