Hierarchical Reinforcement Learning for Open-Domain Dialog
<jats:p>Open-domain dialog generation is a challenging problem; maximum likelihood training can lead to repetitive outputs, models have difficulty tracking long-term conversational goals, and training on standard movie or online datasets may lead to the generation of inappropriate, biased, or...
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
Association for the Advancement of Artificial Intelligence (AAAI)
2022
|
Online Access: | https://hdl.handle.net/1721.1/146530 |