A comparative study of methods for estimating model-agnostic Shapley value explanations

dc.date.accessioned	2024-04-11T15:32:56Z
dc.date.available	2024-04-11T15:32:56Z
dc.date.created	2024-04-09T13:42:49Z
dc.date.issued	2024
dc.identifier.citation	Olsen, Lars Henry Berge Glad, Ingrid Kristine Jullum, Martin Aas, Kjersti . A comparative study of methods for estimating model-agnostic Shapley value explanations. Data mining and knowledge discovery. 2024
dc.identifier.uri	http://hdl.handle.net/10852/110572
dc.description.abstract	Shapley values originated in cooperative game theory but are extensively used today as a model-agnostic explanation framework to explain predictions made by complex machine learning models in the industry and academia. There are several algorithmic approaches for computing different versions of Shapley value explanations. Here, we consider Shapley values incorporating feature dependencies, referred to as conditional Shapley values, for predictive models fitted to tabular data. Estimating precise conditional Shapley values is difficult as they require the estimation of non-trivial conditional expectations. In this article, we develop new methods, extend earlier proposed approaches, and systematize the new refined and existing methods into different method classes for comparison and evaluation. The method classes use either Monte Carlo integration or regression to model the conditional expectations. We conduct extensive simulation studies to evaluate how precisely the different method classes estimate the conditional expectations, and thereby the conditional Shapley values, for different setups. We also apply the methods to several real-world data experiments and provide recommendations for when to use the different method classes and approaches. Roughly speaking, we recommend using parametric methods when we can specify the data distribution almost correctly, as they generally produce the most accurate Shapley value explanations. When the distribution is unknown, both generative methods and regression models with a similar form as the underlying predictive model are good and stable options. Regression-based methods are often slow to train but quickly produce the Shapley value explanations once trained. The vice versa is true for Monte Carlo-based methods, making the different methods appropriate in different practical situations.
dc.description.abstract	A comparative study of methods for estimating model-agnostic Shapley value explanations
dc.language	EN
dc.rights	Attribution 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.title	A comparative study of methods for estimating model-agnostic Shapley value explanations
dc.title.alternative	ENEngelskEnglishA comparative study of methods for estimating model-agnostic Shapley value explanations
dc.type	Journal article
dc.creator.author	Olsen, Lars Henry Berge
dc.creator.author	Glad, Ingrid Kristine
dc.creator.author	Jullum, Martin
dc.creator.author	Aas, Kjersti
cristin.unitcode	185,15,13,25
cristin.unitname	Statistikk og Data Science
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	2
dc.identifier.cristin	2260263
dc.identifier.bibliographiccitation	info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Data mining and knowledge discovery&rft.volume=&rft.spage=&rft.date=2024
dc.identifier.jtitle	Data mining and knowledge discovery
dc.identifier.doi	https://doi.org/10.1007/s10618-024-01016-z
dc.type.document	Tidsskriftartikkel
dc.type.peerreviewed	Peer reviewed
dc.source.issn	1384-5810
dc.type.version	PublishedVersion
dc.relation.project	NFR/237718