Autonomous learning of speaker identity and WiFi geofence from noisy sensor data

A fundamental building block towards intelligent environments is the ability to understand who is present in a certain area. A ubiquitous way of detecting this is to exploit unique vocal characteristics as people interact with one another in common spaces. However, manually enrolling users into a bi...

Полное описание

Библиографические подробности
Главные авторы:	Lu, C, Xiangli, Y, Zhao, P, Chen, C, Trigoni, N, Markham, A
Формат:	Journal article
Язык:	English
Опубликовано:	Institute of Electrical and Electronics Engineers 2019

_version_	1826265481963634688
author	Lu, C Xiangli, Y Zhao, P Chen, C Trigoni, N Markham, A
author_facet	Lu, C Xiangli, Y Zhao, P Chen, C Trigoni, N Markham, A
author_sort	Lu, C
collection	OXFORD
description	A fundamental building block towards intelligent environments is the ability to understand who is present in a certain area. A ubiquitous way of detecting this is to exploit unique vocal characteristics as people interact with one another in common spaces. However, manually enrolling users into a biometric database is time-consuming and not robust to vocal deviations over time. Instead, consider audio features sampled during a meeting, yielding a noisy set of possible voiceprints. With a number of meetings and knowledge of participation, e.g., sniffed wireless Media Access Control (MAC) addresses, can we learn to associate a specific identity with a particular voiceprint? To address this problem, this paper advocates an Internet of Things (IoT) solution and proposes to use co-located WiFi as supervisory weak labels to automatically bootstrap the labelling process. In particular, a novel cross-modality labelling algorithm is proposed that jointly optimises the clustering and association process, which solves the inherent mismatching issues arising from heterogeneous sensor data. At the same time, we further propose to reuse the labelled data to iteratively update wireless geofence models and curate device specific thresholds. Extensive experimental results from two different scenarios demonstrate that our proposed method is able to achieve 2-fold improvement in labelling compared with conventional methods and can achieve reliable speaker recognition in the wild.
first_indexed	2024-03-06T20:24:22Z
format	Journal article
id	oxford-uuid:2ee9c18a-3fb8-4d92-bbfe-e2a7efe91b9c
institution	University of Oxford
language	English
last_indexed	2024-03-06T20:24:22Z
publishDate	2019
publisher	Institute of Electrical and Electronics Engineers
record_format	dspace
spelling	oxford-uuid:2ee9c18a-3fb8-4d92-bbfe-e2a7efe91b9c2022-03-26T12:51:50ZAutonomous learning of speaker identity and WiFi geofence from noisy sensor dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:2ee9c18a-3fb8-4d92-bbfe-e2a7efe91b9cEnglishSymplectic Elements at OxfordInstitute of Electrical and Electronics Engineers2019Lu, CXiangli, YZhao, PChen, CTrigoni, NMarkham, AA fundamental building block towards intelligent environments is the ability to understand who is present in a certain area. A ubiquitous way of detecting this is to exploit unique vocal characteristics as people interact with one another in common spaces. However, manually enrolling users into a biometric database is time-consuming and not robust to vocal deviations over time. Instead, consider audio features sampled during a meeting, yielding a noisy set of possible voiceprints. With a number of meetings and knowledge of participation, e.g., sniffed wireless Media Access Control (MAC) addresses, can we learn to associate a specific identity with a particular voiceprint? To address this problem, this paper advocates an Internet of Things (IoT) solution and proposes to use co-located WiFi as supervisory weak labels to automatically bootstrap the labelling process. In particular, a novel cross-modality labelling algorithm is proposed that jointly optimises the clustering and association process, which solves the inherent mismatching issues arising from heterogeneous sensor data. At the same time, we further propose to reuse the labelled data to iteratively update wireless geofence models and curate device specific thresholds. Extensive experimental results from two different scenarios demonstrate that our proposed method is able to achieve 2-fold improvement in labelling compared with conventional methods and can achieve reliable speaker recognition in the wild.
spellingShingle	Lu, C Xiangli, Y Zhao, P Chen, C Trigoni, N Markham, A Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title	Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title_full	Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title_fullStr	Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title_full_unstemmed	Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title_short	Autonomous learning of speaker identity and WiFi geofence from noisy sensor data
title_sort	autonomous learning of speaker identity and wifi geofence from noisy sensor data
work_keys_str_mv	AT luc autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata AT xiangliy autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata AT zhaop autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata AT chenc autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata AT trigonin autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata AT markhama autonomouslearningofspeakeridentityandwifigeofencefromnoisysensordata

Autonomous learning of speaker identity and WiFi geofence from noisy sensor data

Схожие документы