Generating identities with mixture models for speaker anonymization

Speaker anonymization methods are a growing research area, due to the common use of voice interfaces coupled with growing privacy requirements. However, existing systems suffer from several issues, in particular a reduction in the entropy space of the newly created voices. This is problematic as it...

Full description

Bibliographic Details
Main Authors: Turner, H, Lovisotto, G, Martinovic, I
Format: Journal article
Language:English
Published: Elsevier 2021
_version_ 1826308900979212288
author Turner, H
Lovisotto, G
Martinovic, I
author_facet Turner, H
Lovisotto, G
Martinovic, I
author_sort Turner, H
collection OXFORD
description Speaker anonymization methods are a growing research area, due to the common use of voice interfaces coupled with growing privacy requirements. However, existing systems suffer from several issues, in particular a reduction in the entropy space of the newly created voices. This is problematic as it reduces the diversity of the produced anonymous voices, thus making distinguishing between anonymized voices more difficult, and limiting the number of anonymous voices that can be generated. In this work we propose a method for creating the new identity component for anonymized voices, termed an x-vector, which aims to better reflect the natural diversity of voices, in turn increasing the diversity of the voices of anonymized speakers. We combine this identity generation method with existing anonymization schemes, to produce an overall anonymization system, which we evaluate. Our results demonstrate that our scheme creates more diverse anonymized voices than the existing baseline method. Furthermore, our results show that the assumption of perfect de-coupling between identity and non-identity voice components used in existing speaker anonymization frameworks does not hold, highlighting a clear avenue for future work.
first_indexed 2024-03-07T07:26:07Z
format Journal article
id oxford-uuid:be4c1542-7582-43b5-b4b8-334cbaec23a2
institution University of Oxford
language English
last_indexed 2024-03-07T07:26:07Z
publishDate 2021
publisher Elsevier
record_format dspace
spelling oxford-uuid:be4c1542-7582-43b5-b4b8-334cbaec23a22022-11-14T09:15:17ZGenerating identities with mixture models for speaker anonymizationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:be4c1542-7582-43b5-b4b8-334cbaec23a2EnglishSymplectic ElementsElsevier2021Turner, HLovisotto, GMartinovic, ISpeaker anonymization methods are a growing research area, due to the common use of voice interfaces coupled with growing privacy requirements. However, existing systems suffer from several issues, in particular a reduction in the entropy space of the newly created voices. This is problematic as it reduces the diversity of the produced anonymous voices, thus making distinguishing between anonymized voices more difficult, and limiting the number of anonymous voices that can be generated. In this work we propose a method for creating the new identity component for anonymized voices, termed an x-vector, which aims to better reflect the natural diversity of voices, in turn increasing the diversity of the voices of anonymized speakers. We combine this identity generation method with existing anonymization schemes, to produce an overall anonymization system, which we evaluate. Our results demonstrate that our scheme creates more diverse anonymized voices than the existing baseline method. Furthermore, our results show that the assumption of perfect de-coupling between identity and non-identity voice components used in existing speaker anonymization frameworks does not hold, highlighting a clear avenue for future work.
spellingShingle Turner, H
Lovisotto, G
Martinovic, I
Generating identities with mixture models for speaker anonymization
title Generating identities with mixture models for speaker anonymization
title_full Generating identities with mixture models for speaker anonymization
title_fullStr Generating identities with mixture models for speaker anonymization
title_full_unstemmed Generating identities with mixture models for speaker anonymization
title_short Generating identities with mixture models for speaker anonymization
title_sort generating identities with mixture models for speaker anonymization
work_keys_str_mv AT turnerh generatingidentitieswithmixturemodelsforspeakeranonymization
AT lovisottog generatingidentitieswithmixturemodelsforspeakeranonymization
AT martinovici generatingidentitieswithmixturemodelsforspeakeranonymization