Hierarchical Reinforcement Learning for Open-Domain Dialog

<jats:p>Open-domain dialog generation is a challenging problem; maximum likelihood training can lead to repetitive outputs, models have difficulty tracking long-term conversational goals, and training on standard movie or online datasets may lead to the generation of inappropriate, biased, or...

Full description

Bibliographic Details
Main Authors: Saleh, Abdelrhman, Jaques, Natasha, Ghandeharioun, Asma, Shen, Judy, Picard, Rosalind W.
Other Authors: Program in Media Arts and Sciences (Massachusetts Institute of Technology)
Format: Article
Language:English
Published: Association for the Advancement of Artificial Intelligence (AAAI) 2022
Online Access:https://hdl.handle.net/1721.1/146530