Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection

Voice- and gaze-based hands-free input are increasingly used in human-machine interaction. Attempts to combine them into a hybrid technology typically employ the voice channel as an information-rich channel. Voice seems to be “overqualified” to serve simply as a substitute of a computer mouse click,...

Full description

Bibliographic Details
Main Authors: Darisy G. Zhao, Nikita D. Karikov, Eugeny V. Melnichuk, Boris M. Velichkovsky, Sergei L. Shishkin
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/24/8791
_version_ 1797545322892754944
author Darisy G. Zhao
Nikita D. Karikov
Eugeny V. Melnichuk
Boris M. Velichkovsky
Sergei L. Shishkin
author_facet Darisy G. Zhao
Nikita D. Karikov
Eugeny V. Melnichuk
Boris M. Velichkovsky
Sergei L. Shishkin
author_sort Darisy G. Zhao
collection DOAJ
description Voice- and gaze-based hands-free input are increasingly used in human-machine interaction. Attempts to combine them into a hybrid technology typically employ the voice channel as an information-rich channel. Voice seems to be “overqualified” to serve simply as a substitute of a computer mouse click, to confirm selections made by gaze. It could be expected that the user would feel discomfort if they had to frequently make “clicks” using their voice, or easily get bored, which also could lead to low performance. To test this, we asked 23 healthy participants to select moving objects with smooth pursuit eye movements. Manual confirmation of selection was faster and rated as more convenient than voice-based confirmation. However, the difference was not high, especially when voice was used to pronounce objects’ numbers (speech recognition was not applied): Score of convenience (M ± SD) was 9.2 ± 1.1 for manual and 8.0 ± 2.1 for voice confirmation, and time spent per object was 1269 ± 265 ms and 1626 ± 331 ms, respectively. We conclude that “voice-as-click” can be used to confirm selection in gaze-based interaction with computers as a substitute for the computer mouse click when manual confirmation cannot be used.
first_indexed 2024-03-10T14:13:46Z
format Article
id doaj.art-873c6fd30f3a4691836fa4ffc77d2bec
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T14:13:46Z
publishDate 2020-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-873c6fd30f3a4691836fa4ffc77d2bec2023-11-20T23:57:31ZengMDPI AGApplied Sciences2076-34172020-12-011024879110.3390/app10248791Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice SelectionDarisy G. Zhao0Nikita D. Karikov1Eugeny V. Melnichuk2Boris M. Velichkovsky3Sergei L. Shishkin4Laboratory for Neurocognitive Technologies, NRC “Kurchatov Institute”, 123182 Moscow, RussiaLaboratory for Neurocognitive Technologies, NRC “Kurchatov Institute”, 123182 Moscow, RussiaLaboratory for Neurocognitive Technologies, NRC “Kurchatov Institute”, 123182 Moscow, RussiaLaboratory for Neurocognitive Technologies, NRC “Kurchatov Institute”, 123182 Moscow, RussiaLaboratory for Neurocognitive Technologies, NRC “Kurchatov Institute”, 123182 Moscow, RussiaVoice- and gaze-based hands-free input are increasingly used in human-machine interaction. Attempts to combine them into a hybrid technology typically employ the voice channel as an information-rich channel. Voice seems to be “overqualified” to serve simply as a substitute of a computer mouse click, to confirm selections made by gaze. It could be expected that the user would feel discomfort if they had to frequently make “clicks” using their voice, or easily get bored, which also could lead to low performance. To test this, we asked 23 healthy participants to select moving objects with smooth pursuit eye movements. Manual confirmation of selection was faster and rated as more convenient than voice-based confirmation. However, the difference was not high, especially when voice was used to pronounce objects’ numbers (speech recognition was not applied): Score of convenience (M ± SD) was 9.2 ± 1.1 for manual and 8.0 ± 2.1 for voice confirmation, and time spent per object was 1269 ± 265 ms and 1626 ± 331 ms, respectively. We conclude that “voice-as-click” can be used to confirm selection in gaze-based interaction with computers as a substitute for the computer mouse click when manual confirmation cannot be used.https://www.mdpi.com/2076-3417/10/24/8791hands-free interactiongaze-based interactionvoicevoice commandselectionmoving objects
spellingShingle Darisy G. Zhao
Nikita D. Karikov
Eugeny V. Melnichuk
Boris M. Velichkovsky
Sergei L. Shishkin
Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
Applied Sciences
hands-free interaction
gaze-based interaction
voice
voice command
selection
moving objects
title Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
title_full Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
title_fullStr Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
title_full_unstemmed Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
title_short Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection
title_sort voice as a mouse click usability and effectiveness of simplified hands free gaze voice selection
topic hands-free interaction
gaze-based interaction
voice
voice command
selection
moving objects
url https://www.mdpi.com/2076-3417/10/24/8791
work_keys_str_mv AT darisygzhao voiceasamouseclickusabilityandeffectivenessofsimplifiedhandsfreegazevoiceselection
AT nikitadkarikov voiceasamouseclickusabilityandeffectivenessofsimplifiedhandsfreegazevoiceselection
AT eugenyvmelnichuk voiceasamouseclickusabilityandeffectivenessofsimplifiedhandsfreegazevoiceselection
AT borismvelichkovsky voiceasamouseclickusabilityandeffectivenessofsimplifiedhandsfreegazevoiceselection
AT sergeilshishkin voiceasamouseclickusabilityandeffectivenessofsimplifiedhandsfreegazevoiceselection