Real-Time Cattle Pose Estimation Based on Improved RTMPose

Accurate cattle pose estimation is essential for Precision Livestock Farming (PLF). Computer vision-based, non-contact cattle pose estimation technology can be applied for behaviour recognition and lameness detection. Existing methods still face challenges in achieving fast cattle pose estimation in...

Full description

Bibliographic Details
Main Authors: Xiaowu Li, Kun Sun, Hongbo Fan, Zihan He
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/13/10/1938
_version_ 1797575076797743104
author Xiaowu Li
Kun Sun
Hongbo Fan
Zihan He
author_facet Xiaowu Li
Kun Sun
Hongbo Fan
Zihan He
author_sort Xiaowu Li
collection DOAJ
description Accurate cattle pose estimation is essential for Precision Livestock Farming (PLF). Computer vision-based, non-contact cattle pose estimation technology can be applied for behaviour recognition and lameness detection. Existing methods still face challenges in achieving fast cattle pose estimation in complex scenarios. In this work, we introduce the FasterNest Block and Depth Block to enhance the performance of cattle pose estimation based on the RTMPose model. First, the accuracy of cattle pose estimation relies on the capture of high-level image features. The FasterNest Block, with its three-branch structure, effectively utilizes high-level feature map information, significantly improving accuracy without a significant decrease in inference speed. Second, large kernel convolutions can increase the computation cost of the model. Therefore, the Depth Block adopts a method based on depthwise separable convolutions to replace large kernel convolutions. This addresses the insensitivity to semantic information while reducing the model’s parameter. Additionally, the SimAM module enhances the model’s spatial learning capabilities without introducing extra parameters. We conducted tests on various datasets, including our collected complex scene dataset (cattle dataset) and the AP-10K public dataset. The results demonstrate that our model achieves the best average accuracy with the lowest model parameters and computational requirements, achieving 82.9% on the cattle test set and 72.0% on the AP-10K test set. Furthermore, in conjunction with the object detection model RTMDet-m, our model reaches a remarkable inference speed of 39FPS on an NVIDIA GTX 2080Ti GPU using the PyTorch framework, making it the fastest among all models. This work provides adequate technical support for fast and accurate cattle pose estimation in complex farm environments.
first_indexed 2024-03-10T21:31:12Z
format Article
id doaj.art-5263b80a41d64c04b0be6d0aa4ba8d9d
institution Directory Open Access Journal
issn 2077-0472
language English
last_indexed 2024-03-10T21:31:12Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Agriculture
spelling doaj.art-5263b80a41d64c04b0be6d0aa4ba8d9d2023-11-19T15:18:53ZengMDPI AGAgriculture2077-04722023-10-011310193810.3390/agriculture13101938Real-Time Cattle Pose Estimation Based on Improved RTMPoseXiaowu Li0Kun Sun1Hongbo Fan2Zihan He3Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaFaculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming 650300, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaAccurate cattle pose estimation is essential for Precision Livestock Farming (PLF). Computer vision-based, non-contact cattle pose estimation technology can be applied for behaviour recognition and lameness detection. Existing methods still face challenges in achieving fast cattle pose estimation in complex scenarios. In this work, we introduce the FasterNest Block and Depth Block to enhance the performance of cattle pose estimation based on the RTMPose model. First, the accuracy of cattle pose estimation relies on the capture of high-level image features. The FasterNest Block, with its three-branch structure, effectively utilizes high-level feature map information, significantly improving accuracy without a significant decrease in inference speed. Second, large kernel convolutions can increase the computation cost of the model. Therefore, the Depth Block adopts a method based on depthwise separable convolutions to replace large kernel convolutions. This addresses the insensitivity to semantic information while reducing the model’s parameter. Additionally, the SimAM module enhances the model’s spatial learning capabilities without introducing extra parameters. We conducted tests on various datasets, including our collected complex scene dataset (cattle dataset) and the AP-10K public dataset. The results demonstrate that our model achieves the best average accuracy with the lowest model parameters and computational requirements, achieving 82.9% on the cattle test set and 72.0% on the AP-10K test set. Furthermore, in conjunction with the object detection model RTMDet-m, our model reaches a remarkable inference speed of 39FPS on an NVIDIA GTX 2080Ti GPU using the PyTorch framework, making it the fastest among all models. This work provides adequate technical support for fast and accurate cattle pose estimation in complex farm environments.https://www.mdpi.com/2077-0472/13/10/1938cattle pose estimationRTMPoseFasterNest BlockSimAM attentionDepth Block
spellingShingle Xiaowu Li
Kun Sun
Hongbo Fan
Zihan He
Real-Time Cattle Pose Estimation Based on Improved RTMPose
Agriculture
cattle pose estimation
RTMPose
FasterNest Block
SimAM attention
Depth Block
title Real-Time Cattle Pose Estimation Based on Improved RTMPose
title_full Real-Time Cattle Pose Estimation Based on Improved RTMPose
title_fullStr Real-Time Cattle Pose Estimation Based on Improved RTMPose
title_full_unstemmed Real-Time Cattle Pose Estimation Based on Improved RTMPose
title_short Real-Time Cattle Pose Estimation Based on Improved RTMPose
title_sort real time cattle pose estimation based on improved rtmpose
topic cattle pose estimation
RTMPose
FasterNest Block
SimAM attention
Depth Block
url https://www.mdpi.com/2077-0472/13/10/1938
work_keys_str_mv AT xiaowuli realtimecattleposeestimationbasedonimprovedrtmpose
AT kunsun realtimecattleposeestimationbasedonimprovedrtmpose
AT hongbofan realtimecattleposeestimationbasedonimprovedrtmpose
AT zihanhe realtimecattleposeestimationbasedonimprovedrtmpose