Model-based Bayesian Reinforcement Learning for Dialogue Management

dc.date.accessioned	2014-07-16T13:22:42Z
dc.date.available	2014-07-16T13:22:42Z
dc.date.created	2013-09-05T11:04:59Z
dc.date.issued	2013
dc.identifier.citation	Lison, Pierre . Model-based Bayesian Reinforcement Learning for Dialogue Management. Proceedings of the International Conference on Spoken Language Processing. 2013
dc.identifier.uri	http://hdl.handle.net/10852/39380
dc.description.abstract	Reinforcement learning methods are increasingly used to optimise dialogue policies from experience. Most current techniques are model-free: they directly estimate the utility of various actions, without explicit model of the interaction dynamics. In this paper, we investigate an alternative strategy grounded in model-based Bayesian reinforcement learning. Bayesian inference is used to maintain a posterior distribution over the model parameters, reflecting the model uncertainty. This parameter distribution is gradually refined as more data is collected and simultaneously used to plan the agent's actions. Within this learning framework, we carried out experiments with two alternative formalisations of the transition model, one encoded with standard multinomial distributions, and one structured with probabilistic rules. We demonstrate the potential of our approach with empirical results on a user simulator constructed from Wizard-of-Oz data in a human-robot interaction scenario. The results illustrate in particular the benefits of capturing prior domain knowledge with high-level rules. Lison, Pierre (2013): "Model-based Bayesian reinforcement learning for dialogue management", In INTERSPEECH-2013, 475-479, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, ed. by F. Bimbot, C. Cerisara, C. Fougeron, G. Gravier, L. Lamel, F. Pellegrino, and P. Perrier, ISSN 2308-457X; ISCA Archive, http://www.isca-speech.org/archive/interspeech_2013
dc.language	EN
dc.language.iso	en	en_US
dc.publisher	International Speech Communication Association
dc.title	Model-based Bayesian Reinforcement Learning for Dialogue Management	en_US
dc.type	Journal article	en_US
dc.creator.author	Lison, Pierre
cristin.unitcode	185,15,5,56
cristin.unitname	Forskningsgruppen for språkteknologi
cristin.ispublished	true
cristin.fulltext	postprint
cristin.qualitycode	1
dc.identifier.cristin	1047112
dc.identifier.bibliographiccitation	info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Proceedings of the International Conference on Spoken Language Processing&rft.volume=&rft.spage=&rft.date=2013
dc.identifier.jtitle	Proceedings of the International Conference on Spoken Language Processing
dc.identifier.urn	URN:NBN:no-44199
dc.type.document	Tidsskriftartikkel	en_US
dc.type.peerreviewed	Peer reviewed
dc.source.issn	1990-9772
dc.identifier.fulltext	Fulltext https://www.duo.uio.no/bitstream/handle/10852/39380/2/mbbrldm-plison-is2013.pdf
dc.type.version	AcceptedVersion

Files in this item

Name:: mbbrldm-plison-is2013.pdf
Size:: 795.9Kb
Format:: application/

View/Open

Appears in the following Collection

Institutt for informatikk [4944]
CRIStin høstingsarkiv [31417]

Hide metadata

Model-based Bayesian Reinforcement Learning for Dialogue Management

Files in this item

Appears in the following Collection

Browse

For library staff

RSS Feeds