Human-centric dialog training via offline reinforcement learning
Main Authors: | , , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
Association for Computational Linguistics (ACL)
2022
|
Online Access: | https://hdl.handle.net/1721.1/146608 |
_version_ | 1826204959510626304 |
---|---|
author | Jaques, Natasha Shen, Judy Hanwen Ghandeharioun, Asma Ferguson, Craig Lapedriza, Agata Jones, Noah Gu, Shixiang Picard, Rosalind W. |
author2 | Program in Media Arts and Sciences (Massachusetts Institute of Technology) |
author_facet | Program in Media Arts and Sciences (Massachusetts Institute of Technology) Jaques, Natasha Shen, Judy Hanwen Ghandeharioun, Asma Ferguson, Craig Lapedriza, Agata Jones, Noah Gu, Shixiang Picard, Rosalind W. |
author_sort | Jaques, Natasha |
collection | MIT |
first_indexed | 2024-09-23T13:04:19Z |
format | Article |
id | mit-1721.1/146608 |
institution | Massachusetts Institute of Technology |
language | English |
last_indexed | 2024-09-23T13:04:19Z |
publishDate | 2022 |
publisher | Association for Computational Linguistics (ACL) |
record_format | dspace |
spelling | mit-1721.1/1466082024-08-09T20:13:52Z Human-centric dialog training via offline reinforcement learning Jaques, Natasha Shen, Judy Hanwen Ghandeharioun, Asma Ferguson, Craig Lapedriza, Agata Jones, Noah Gu, Shixiang Picard, Rosalind W. Program in Media Arts and Sciences (Massachusetts Institute of Technology) 2022-11-23T14:51:55Z 2022-11-23T14:51:55Z 2020 2022-11-23T14:43:22Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/146608 Jaques, Natasha, Shen, Judy Hanwen, Ghandeharioun, Asma, Ferguson, Craig, Lapedriza, Agata et al. 2020. "Human-centric dialog training via offline reinforcement learning." Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). en 10.18653/V1/2020.EMNLP-MAIN.327 Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. application/pdf Association for Computational Linguistics (ACL) Association for Computational Linguistics |
spellingShingle | Jaques, Natasha Shen, Judy Hanwen Ghandeharioun, Asma Ferguson, Craig Lapedriza, Agata Jones, Noah Gu, Shixiang Picard, Rosalind W. Human-centric dialog training via offline reinforcement learning |
title | Human-centric dialog training via offline reinforcement learning |
title_full | Human-centric dialog training via offline reinforcement learning |
title_fullStr | Human-centric dialog training via offline reinforcement learning |
title_full_unstemmed | Human-centric dialog training via offline reinforcement learning |
title_short | Human-centric dialog training via offline reinforcement learning |
title_sort | human centric dialog training via offline reinforcement learning |
url | https://hdl.handle.net/1721.1/146608 |
work_keys_str_mv | AT jaquesnatasha humancentricdialogtrainingviaofflinereinforcementlearning AT shenjudyhanwen humancentricdialogtrainingviaofflinereinforcementlearning AT ghandehariounasma humancentricdialogtrainingviaofflinereinforcementlearning AT fergusoncraig humancentricdialogtrainingviaofflinereinforcementlearning AT lapedrizaagata humancentricdialogtrainingviaofflinereinforcementlearning AT jonesnoah humancentricdialogtrainingviaofflinereinforcementlearning AT gushixiang humancentricdialogtrainingviaofflinereinforcementlearning AT picardrosalindw humancentricdialogtrainingviaofflinereinforcementlearning |