Acoustic research for telecoms: bridging the heritage to the future

In its early age, telecommunication was focused on voice communications, and acoustics was at the heart of the work related to speech coding and transmission, automatic speech recognition or speech synthesis, aiming at offering better quality (Quality of Experience or QoE) and enhanced services to u...

Full description

Bibliographic Details
Main Authors: Nicol Rozenn, Monfort Jean-Yves
Format: Article
Language:English
Published: EDP Sciences 2023-01-01
Series:Acta Acustica
Subjects:
Online Access:https://acta-acustica.edpsciences.org/articles/aacus/full_html/2023/01/aacus230021/aacus230021.html
_version_ 1797367358388436992
author Nicol Rozenn
Monfort Jean-Yves
author_facet Nicol Rozenn
Monfort Jean-Yves
author_sort Nicol Rozenn
collection DOAJ
description In its early age, telecommunication was focused on voice communications, and acoustics was at the heart of the work related to speech coding and transmission, automatic speech recognition or speech synthesis, aiming at offering better quality (Quality of Experience or QoE) and enhanced services to users. As technology has evolved, the research themes have diversified, but acoustics remains essential. This paper gives an overview of the evolution of acoustic research for telecommunication. Communication was initially (and for a long time) only audio with a monophonic narrow-band sound (i.e. [300–3400 Hz]). After the bandwidth extension (from the wide-band [100–7000 Hz] to the full-band [20 Hz–20 kHz] range), a new break was the introduction of 3D sound, either to provide telepresence in audioconferencing or videoconferencing, or to enhance the QoE of contents such as radio, television, VOD, or video games. Loudspeaker or microphone arrays have been deployed to implement “Holophonic” or “Ambisonic” systems. The interaction between spatialized sounds and 3D images was also investigated. At the end of the 2000s, smartphones invaded our lives. Binaural sound was immediately acknowledged as the most suitable technology for reproducing 3D audio on smartphones. However, to achieve a satisfactory QoE, binaural filters need to be customized in relation with the listener’s morphology. This question is the main obstacle to a mass-market distribution of binaural sound, and its solving has prompted a large amount of work. In parallel with the development of technologies, their perceptual evaluation was an equally important area of research. In addition to conventional methods, innovative approaches have been explored for the assessment of sound spatialization, such as physiological measurement, neuroscience tools or Virtual Reality (VR). The latest development is the use of acoustics as a universal sensor for the Internet of Things (IoT) and connected environments. Microphones can be deployed, preferably with parcimony, in order to monitor surrounding sounds, with the goal of detecting information or events thanks to models of automatic sound recognition based on neural networks. Applications range from security and personal assistance to acoustic measurement of biodiversity. As for the control of environments or objects, voice commands have become widespread in recent years thanks to the tremendous progress made in speech recognition, but an even more intuitive mode based on direct control by the mind is proposed by Brain Computer Interfaces (BCIs), which rely on sensory stimulation using different modalities, among which the auditory one offers some advantages.
first_indexed 2024-03-08T17:17:19Z
format Article
id doaj.art-da29515caa364a0cbcf2da64f0e40b22
institution Directory Open Access Journal
issn 2681-4617
language English
last_indexed 2024-03-08T17:17:19Z
publishDate 2023-01-01
publisher EDP Sciences
record_format Article
series Acta Acustica
spelling doaj.art-da29515caa364a0cbcf2da64f0e40b222024-01-03T10:47:20ZengEDP SciencesActa Acustica2681-46172023-01-0176410.1051/aacus/2023056aacus230021Acoustic research for telecoms: bridging the heritage to the futureNicol Rozenn0https://orcid.org/0009-0002-7060-1445Monfort Jean-Yves1Orange LabsPrésident du Centre de Découverte du Son, KerouspicIn its early age, telecommunication was focused on voice communications, and acoustics was at the heart of the work related to speech coding and transmission, automatic speech recognition or speech synthesis, aiming at offering better quality (Quality of Experience or QoE) and enhanced services to users. As technology has evolved, the research themes have diversified, but acoustics remains essential. This paper gives an overview of the evolution of acoustic research for telecommunication. Communication was initially (and for a long time) only audio with a monophonic narrow-band sound (i.e. [300–3400 Hz]). After the bandwidth extension (from the wide-band [100–7000 Hz] to the full-band [20 Hz–20 kHz] range), a new break was the introduction of 3D sound, either to provide telepresence in audioconferencing or videoconferencing, or to enhance the QoE of contents such as radio, television, VOD, or video games. Loudspeaker or microphone arrays have been deployed to implement “Holophonic” or “Ambisonic” systems. The interaction between spatialized sounds and 3D images was also investigated. At the end of the 2000s, smartphones invaded our lives. Binaural sound was immediately acknowledged as the most suitable technology for reproducing 3D audio on smartphones. However, to achieve a satisfactory QoE, binaural filters need to be customized in relation with the listener’s morphology. This question is the main obstacle to a mass-market distribution of binaural sound, and its solving has prompted a large amount of work. In parallel with the development of technologies, their perceptual evaluation was an equally important area of research. In addition to conventional methods, innovative approaches have been explored for the assessment of sound spatialization, such as physiological measurement, neuroscience tools or Virtual Reality (VR). The latest development is the use of acoustics as a universal sensor for the Internet of Things (IoT) and connected environments. Microphones can be deployed, preferably with parcimony, in order to monitor surrounding sounds, with the goal of detecting information or events thanks to models of automatic sound recognition based on neural networks. Applications range from security and personal assistance to acoustic measurement of biodiversity. As for the control of environments or objects, voice commands have become widespread in recent years thanks to the tremendous progress made in speech recognition, but an even more intuitive mode based on direct control by the mind is proposed by Brain Computer Interfaces (BCIs), which rely on sensory stimulation using different modalities, among which the auditory one offers some advantages.https://acta-acustica.edpsciences.org/articles/aacus/full_html/2023/01/aacus230021/aacus230021.htmltelecommunicationspatial audio (wavefield synthesis – wfs, higher order ambisonics – hoa, binaural)quality of experienceelectroencephalogram (eeg)automatic sound recognition
spellingShingle Nicol Rozenn
Monfort Jean-Yves
Acoustic research for telecoms: bridging the heritage to the future
Acta Acustica
telecommunication
spatial audio (wavefield synthesis – wfs, higher order ambisonics – hoa, binaural)
quality of experience
electroencephalogram (eeg)
automatic sound recognition
title Acoustic research for telecoms: bridging the heritage to the future
title_full Acoustic research for telecoms: bridging the heritage to the future
title_fullStr Acoustic research for telecoms: bridging the heritage to the future
title_full_unstemmed Acoustic research for telecoms: bridging the heritage to the future
title_short Acoustic research for telecoms: bridging the heritage to the future
title_sort acoustic research for telecoms bridging the heritage to the future
topic telecommunication
spatial audio (wavefield synthesis – wfs, higher order ambisonics – hoa, binaural)
quality of experience
electroencephalogram (eeg)
automatic sound recognition
url https://acta-acustica.edpsciences.org/articles/aacus/full_html/2023/01/aacus230021/aacus230021.html
work_keys_str_mv AT nicolrozenn acousticresearchfortelecomsbridgingtheheritagetothefuture
AT monfortjeanyves acousticresearchfortelecomsbridgingtheheritagetothefuture