Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Show other versions (1)

© 2019 Association for Computing Machinery. Generative audio models based on neural networks have led to considerable improvements across fields including speech enhancement, source separation, and text-to-speech synthesis. These systems are typically trained in a supervised fashion using simple ele...

Full description

Bibliographic Details
Main Authors:	Ananthabhotla, Ishwarya, Ewert, Sebastian, Paradiso, Joseph A
Format:	Article
Language:	English
Published:	Association for Computing Machinery (ACM) 2021
Online Access:	https://hdl.handle.net/1721.1/137115

Similar Items

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks
by: Ananthabhotla, I, et al.
Published: (2021)

Towards audio codec-based speech separation
by: Yip, Jia Qi, et al.
Published: (2024)

Cognitive Audio: Enabling Auditory Interfaces with an Understanding of How We Hear
by: Ananthabhotla, Ishwarya
Published: (2022)

Manipulating Causal Uncertainty in Sound Objects
by: Boger, Tal, et al.
Published: (2022)

Perceptual synthesis engine : an audio-driven timbre generator
by: Jehan, Tristan, 1974-
Published: (2011)

Implementation of low bitrate audio codec using spectral band replication
by: Ramachandra Iyer Ananth
Published: (2010)

Companding techniques for high dynamic range audio CODEC receiver path
by: Ma, Yunjie, M. Eng. Massachusetts Institute of Technology
Published: (2010)

Perceptual coding of audio signals
by: Teh, Do Hui.
Published: (2009)

HCU400: an Annotated Dataset for Exploring Aural Phenomenology through Causal Uncertainty
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

HCU400: an Annotated Dataset for Exploring Aural Phenomenology through Causal Uncertainty
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

System specific power reduction techniques for wearable navigation technology
by: Ananthabhotla, Ishwarya
Published: (2016)

Learning efficiently with approximate inference via dual losses
by: Meshi, Ofer, et al.
Published: (2011)

Perceptual watermarking and data concealment in audio signals
by: McLoughlin, Ian.
Published: (2008)

Perceptual watermarking and data concealment in audio signals
by: Tio, Cedric Meng Meng.
Published: (2008)

Performance of voice over IP (VoIP) over a wireless LAN (WLAN) for different audio/voice codecs
by: Mohd., Alias, et al.
Published: (2007)

Product perceptual mapping on fashion designs with Gaussian mixture variational autoencoder and triplet loss
by: Wang, Mike,M. Eng.Massachusetts Institute of Technology.
Published: (2019)

Probing shallower: perceptual loss trained Phase Extraction Neural Network (PLT-PhENN) for artifact-free reconstruction at low photon budget
by: Deng, Mo, et al.
Published: (2020)

A simple and efficient algorithm for fused lasso signal approximator with convex loss function
by: Wang, Lichun, et al.
Published: (2013)

A simple and efficient algorithm for fused lasso signal approximator with convex loss function
by: You, Yuan, et al.
Published: (2013)

Physics-informed neural networks with non-differentiable loss
by: Yang, Junyan
Published: (2022)

Catalogue of a Loss
by: Berger, Larisa (Larisa A.)
Published: (2013)

Analysis of loss mechanisms in superconducting windings for rotating electric generators
by: Minervini, Joseph Vito
Published: (2009)

Efficiency loss in a class of two-sided market mechanisms
by: Neumayer, Sebastian James
Published: (2007)

Hardware architecture for perceptual watermarking and data concealment in audio signals
by: Robertus Wahendro Ali.
Published: (2008)

Turbo codec for software radio receivers
by: Guan, Yong Liang, et al.
Published: (2008)

Codec design for turbo coding scheme
by: Wang, Lei.
Published: (2008)

Learning with a Wasserstein loss
by: Araya-Polo, Mauricio, et al.
Published: (2017)

Robust generation of frequency combs in a microresonator with strong and narrowband loss
by: Wang, Jing, et al.
Published: (2021)

Audio quality moderates localisation accuracy : two distinct perceptual effects?
by: Lindborg, PerMagnus, et al.
Published: (2015)

Network reconfiguration for loss reduction with distributed generations using PSO
by: Dahalan, W.M., et al.
Published: (2012)

Artificial neural network approach to network reconfiguration for loss minimization in distribution networks
by: Kashem, M.A., et al.
Published: (1998)

When do stop-loss rules stop losses?
by: Kaminski, Kathryn M., et al.
Published: (2018)

Big Picture Codec for Multimedia Conferencing System
by: Ramadass, Sureswaran
Published: (2007)

Performance analysis of voice codec for VoIP
by: Abd. Khuther, Ali
Published: (2008)

Enhancement of GSM codec for different language contexts
by: Ding, Zhong Qiang
Published: (2008)

Indoor path loss modeling for fifth generation applications
by: Majed, Mohammed Bahjat
Published: (2023)

Towards Perceptual Augmentation
by: Chin, Sam
Published: (2024)

A study of consequential loss insurance.
by: Gwee, Chin Chin., et al.
Published: (2013)