SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network

Siamese networks have been extensively studied in recent years. Most of the previous research focuses on improving accuracy, while merely a few recognize the necessity of reducing parameter redundancy and computation load. Even less work has been done to optimize the runtime memory cost when designi...

Full description

Bibliographic Details
Main Authors: Li Cheng, Xuemin Zheng, Mingxin Zhao, Runjiang Dou, Shuangming Yu, Nanjian Wu, Liyuan Liu
Format: Article
Language:English
Published: MDPI AG 2022-02-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/4/1585
_version_ 1797476688406249472
author Li Cheng
Xuemin Zheng
Mingxin Zhao
Runjiang Dou
Shuangming Yu
Nanjian Wu
Liyuan Liu
author_facet Li Cheng
Xuemin Zheng
Mingxin Zhao
Runjiang Dou
Shuangming Yu
Nanjian Wu
Liyuan Liu
author_sort Li Cheng
collection DOAJ
description Siamese networks have been extensively studied in recent years. Most of the previous research focuses on improving accuracy, while merely a few recognize the necessity of reducing parameter redundancy and computation load. Even less work has been done to optimize the runtime memory cost when designing networks, making the Siamese-network-based tracker difficult to deploy on edge devices. In this paper, we present SiamMixer, a lightweight and hardware-friendly visual object-tracking network. It uses patch-by-patch inference to reduce memory use in shallow layers, where each small image region is processed individually. It merges and globally encodes feature maps in deep layers to enhance accuracy. Benefiting from these techniques, SiamMixer demonstrates a comparable accuracy to other large trackers with only 286 kB parameters and 196 kB extra memory use for feature maps. Additionally, we verify the impact of various activation functions and replace all activation functions with ReLU in SiamMixer. This reduces the cost when deploying on mobile devices.
first_indexed 2024-03-09T21:05:12Z
format Article
id doaj.art-6d49481484d04214a3ac3cd4d9998aa7
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T21:05:12Z
publishDate 2022-02-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-6d49481484d04214a3ac3cd4d9998aa72023-11-23T22:01:54ZengMDPI AGSensors1424-82202022-02-01224158510.3390/s22041585SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking NetworkLi Cheng0Xuemin Zheng1Mingxin Zhao2Runjiang Dou3Shuangming Yu4Nanjian Wu5Liyuan Liu6State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaState Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, ChinaSiamese networks have been extensively studied in recent years. Most of the previous research focuses on improving accuracy, while merely a few recognize the necessity of reducing parameter redundancy and computation load. Even less work has been done to optimize the runtime memory cost when designing networks, making the Siamese-network-based tracker difficult to deploy on edge devices. In this paper, we present SiamMixer, a lightweight and hardware-friendly visual object-tracking network. It uses patch-by-patch inference to reduce memory use in shallow layers, where each small image region is processed individually. It merges and globally encodes feature maps in deep layers to enhance accuracy. Benefiting from these techniques, SiamMixer demonstrates a comparable accuracy to other large trackers with only 286 kB parameters and 196 kB extra memory use for feature maps. Additionally, we verify the impact of various activation functions and replace all activation functions with ReLU in SiamMixer. This reduces the cost when deploying on mobile devices.https://www.mdpi.com/1424-8220/22/4/1585visual object-trackingdeep featuressiamese networklightweight neural networkedge computing devices
spellingShingle Li Cheng
Xuemin Zheng
Mingxin Zhao
Runjiang Dou
Shuangming Yu
Nanjian Wu
Liyuan Liu
SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
Sensors
visual object-tracking
deep features
siamese network
lightweight neural network
edge computing devices
title SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
title_full SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
title_fullStr SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
title_full_unstemmed SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
title_short SiamMixer: A Lightweight and Hardware-Friendly Visual Object-Tracking Network
title_sort siammixer a lightweight and hardware friendly visual object tracking network
topic visual object-tracking
deep features
siamese network
lightweight neural network
edge computing devices
url https://www.mdpi.com/1424-8220/22/4/1585
work_keys_str_mv AT licheng siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT xueminzheng siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT mingxinzhao siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT runjiangdou siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT shuangmingyu siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT nanjianwu siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork
AT liyuanliu siammixeralightweightandhardwarefriendlyvisualobjecttrackingnetwork