3 directional Inception-ResUNet: Deep spatial feature learning for multichannel singing voice separation with distortion.

Singing voice separation on robots faces the problem of interpreting ambiguous auditory signals. The acoustic signal, which the humanoid robot perceives through its onboard microphones, is a mixture of singing voice, music, and noise, with distortion, attenuation, and reverberation. In this paper, w...

Full description

Bibliographic Details
Main Authors: DaDong Wang, Jie Wang, MingChen Sun
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0289453