Hide metadata

dc.date.accessioned2014-07-16T13:22:42Z
dc.date.available2014-07-16T13:22:42Z
dc.date.created2013-09-05T11:04:59Z
dc.date.issued2013
dc.identifier.citationLison, Pierre . Model-based Bayesian Reinforcement Learning for Dialogue Management. Proceedings of the International Conference on Spoken Language Processing. 2013
dc.identifier.urihttp://hdl.handle.net/10852/39380
dc.description.abstractReinforcement learning methods are increasingly used to optimise dialogue policies from experience. Most current techniques are model-free: they directly estimate the utility of various actions, without explicit model of the interaction dynamics. In this paper, we investigate an alternative strategy grounded in model-based Bayesian reinforcement learning. Bayesian inference is used to maintain a posterior distribution over the model parameters, reflecting the model uncertainty. This parameter distribution is gradually refined as more data is collected and simultaneously used to plan the agent's actions. Within this learning framework, we carried out experiments with two alternative formalisations of the transition model, one encoded with standard multinomial distributions, and one structured with probabilistic rules. We demonstrate the potential of our approach with empirical results on a user simulator constructed from Wizard-of-Oz data in a human-robot interaction scenario. The results illustrate in particular the benefits of capturing prior domain knowledge with high-level rules. Lison, Pierre (2013): "Model-based Bayesian reinforcement learning for dialogue management", In INTERSPEECH-2013, 475-479, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, ed. by F. Bimbot, C. Cerisara, C. Fougeron, G. Gravier, L. Lamel, F. Pellegrino, and P. Perrier, ISSN 2308-457X; ISCA Archive, http://www.isca-speech.org/archive/interspeech_2013
dc.languageEN
dc.language.isoenen_US
dc.publisherInternational Speech Communication Association
dc.titleModel-based Bayesian Reinforcement Learning for Dialogue Managementen_US
dc.typeJournal articleen_US
dc.creator.authorLison, Pierre
cristin.unitcode185,15,5,56
cristin.unitnameForskningsgruppen for språkteknologi
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1
dc.identifier.cristin1047112
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Proceedings of the International Conference on Spoken Language Processing&rft.volume=&rft.spage=&rft.date=2013
dc.identifier.jtitleProceedings of the International Conference on Spoken Language Processing
dc.identifier.urnURN:NBN:no-44199
dc.type.documentTidsskriftartikkelen_US
dc.type.peerreviewedPeer reviewed
dc.source.issn1990-9772
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/39380/2/mbbrldm-plison-is2013.pdf
dc.type.versionAcceptedVersion


Files in this item

Appears in the following Collection

Hide metadata