Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific

Ocean observatories collect large volumes of video data, with some data archives now spanning well over a few decades, and bringing the challenges of analytical capacity beyond conventional processing tools. The analysis of such vast and complex datasets can only be achieved with appropriate machine...

Full description

Bibliographic Details
Main Authors: Federico Bonofiglio, Fabio C. De Leo, Connor Yee, Damianos Chatzievangelou, Jacopo Aguzzi, Simone Marini
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-08-01
Series:Frontiers in Marine Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmars.2022.842946/full
_version_ 1828432480466632704
author Federico Bonofiglio
Fabio C. De Leo
Fabio C. De Leo
Connor Yee
Damianos Chatzievangelou
Jacopo Aguzzi
Jacopo Aguzzi
Simone Marini
Simone Marini
author_facet Federico Bonofiglio
Fabio C. De Leo
Fabio C. De Leo
Connor Yee
Damianos Chatzievangelou
Jacopo Aguzzi
Jacopo Aguzzi
Simone Marini
Simone Marini
author_sort Federico Bonofiglio
collection DOAJ
description Ocean observatories collect large volumes of video data, with some data archives now spanning well over a few decades, and bringing the challenges of analytical capacity beyond conventional processing tools. The analysis of such vast and complex datasets can only be achieved with appropriate machine learning and Artificial Intelligence (AI) tools. The implementation of AI monitoring programs for animal tracking and classification becomes necessary in the particular case of deep-sea cabled observatories, as those operated by Ocean Networks Canada (ONC), where Petabytes of data are now collected each and every year since their installation. Here, we present a machine-learning and computer vision automated pipeline to detect and count sablefish (Anoplopoma fimbria), a key commercially exploited species in the N-NE Pacific. We used 651 hours of video footage obtained from three long-term monitoring sites in the NEPTUNE cabled observatory, in Barkley Canyon, on the nearby slope, and at depths ranging from 420 to 985 m. Our proposed AI sablefish detection and classification pipeline was tested and validated for an initial 4.5 month period (Sep 18 2019-Jan 2 2020), and was a first step towards validation for future processing of the now decade-long video archives from Barkley Canyon. For the validation period, we trained a YOLO neural network on 2917 manually annotated frames containing sablefish images to obtain an automatic detector with a 92% Average Precision (AP) on 730 test images, and a 5-fold cross-validation AP of 93% (± 3.7%). We then ran the detector on all video material (i.e., 651 hours from a 4.5 month period), to automatically detect and annotate sablefish. We finally applied a tracking algorithm on detection results, to approximate counts of individual fishes moving on scene and obtain a time series of proxy sablefish abundance. Those proxy abundance estimates are among the first to be made using such a large volume of video data from deep-sea settings. We discuss our AI results for application on a decade-long video monitoring program, and particularly with potential for complementing fisheries management practices of a commercially important species.
first_indexed 2024-12-10T18:20:04Z
format Article
id doaj.art-de163e8b7a2a4b9f8159dbfe85b1f7ec
institution Directory Open Access Journal
issn 2296-7745
language English
last_indexed 2024-12-10T18:20:04Z
publishDate 2022-08-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Marine Science
spelling doaj.art-de163e8b7a2a4b9f8159dbfe85b1f7ec2022-12-22T01:38:14ZengFrontiers Media S.A.Frontiers in Marine Science2296-77452022-08-01910.3389/fmars.2022.842946842946Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE PacificFederico Bonofiglio0Fabio C. De Leo1Fabio C. De Leo2Connor Yee3Damianos Chatzievangelou4Jacopo Aguzzi5Jacopo Aguzzi6Simone Marini7Simone Marini8Institute of Marine Sciences (ISMAR), National Research Council of Italy, Lerici, ItalyOcean Networks Canada, University of Victoria, Victoria, BC, CanadaDepartment of Biology, University of Victoria, Victoria, BC, CanadaDepartment of Biology, University of Victoria, Victoria, BC, CanadaFunctioning and Vulnerability of Marine Ecosystems Group, Department of Renewable Marine Resources, Instituto de Ciencias del Mar - Consejo Superior de Investigaciones Científicas (ICM-CSIC), Barcelona, SpainFunctioning and Vulnerability of Marine Ecosystems Group, Department of Renewable Marine Resources, Instituto de Ciencias del Mar - Consejo Superior de Investigaciones Científicas (ICM-CSIC), Barcelona, SpainStazione Zoologica Anton Dohrn (SZN), Naples, ItalyInstitute of Marine Sciences (ISMAR), National Research Council of Italy, Lerici, ItalyStazione Zoologica Anton Dohrn (SZN), Naples, ItalyOcean observatories collect large volumes of video data, with some data archives now spanning well over a few decades, and bringing the challenges of analytical capacity beyond conventional processing tools. The analysis of such vast and complex datasets can only be achieved with appropriate machine learning and Artificial Intelligence (AI) tools. The implementation of AI monitoring programs for animal tracking and classification becomes necessary in the particular case of deep-sea cabled observatories, as those operated by Ocean Networks Canada (ONC), where Petabytes of data are now collected each and every year since their installation. Here, we present a machine-learning and computer vision automated pipeline to detect and count sablefish (Anoplopoma fimbria), a key commercially exploited species in the N-NE Pacific. We used 651 hours of video footage obtained from three long-term monitoring sites in the NEPTUNE cabled observatory, in Barkley Canyon, on the nearby slope, and at depths ranging from 420 to 985 m. Our proposed AI sablefish detection and classification pipeline was tested and validated for an initial 4.5 month period (Sep 18 2019-Jan 2 2020), and was a first step towards validation for future processing of the now decade-long video archives from Barkley Canyon. For the validation period, we trained a YOLO neural network on 2917 manually annotated frames containing sablefish images to obtain an automatic detector with a 92% Average Precision (AP) on 730 test images, and a 5-fold cross-validation AP of 93% (± 3.7%). We then ran the detector on all video material (i.e., 651 hours from a 4.5 month period), to automatically detect and annotate sablefish. We finally applied a tracking algorithm on detection results, to approximate counts of individual fishes moving on scene and obtain a time series of proxy sablefish abundance. Those proxy abundance estimates are among the first to be made using such a large volume of video data from deep-sea settings. We discuss our AI results for application on a decade-long video monitoring program, and particularly with potential for complementing fisheries management practices of a commercially important species.https://www.frontiersin.org/articles/10.3389/fmars.2022.842946/fullbig datamachine learningmarine observatoriesautomated video analysisfishery independent monitoringocean network Canada
spellingShingle Federico Bonofiglio
Fabio C. De Leo
Fabio C. De Leo
Connor Yee
Damianos Chatzievangelou
Jacopo Aguzzi
Jacopo Aguzzi
Simone Marini
Simone Marini
Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
Frontiers in Marine Science
big data
machine learning
marine observatories
automated video analysis
fishery independent monitoring
ocean network Canada
title Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
title_full Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
title_fullStr Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
title_full_unstemmed Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
title_short Machine learning applied to big data from marine cabled observatories: A case study of sablefish monitoring in the NE Pacific
title_sort machine learning applied to big data from marine cabled observatories a case study of sablefish monitoring in the ne pacific
topic big data
machine learning
marine observatories
automated video analysis
fishery independent monitoring
ocean network Canada
url https://www.frontiersin.org/articles/10.3389/fmars.2022.842946/full
work_keys_str_mv AT federicobonofiglio machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT fabiocdeleo machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT fabiocdeleo machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT connoryee machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT damianoschatzievangelou machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT jacopoaguzzi machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT jacopoaguzzi machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT simonemarini machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific
AT simonemarini machinelearningappliedtobigdatafrommarinecabledobservatoriesacasestudyofsablefishmonitoringinthenepacific