The fraud loss for selecting the model complexity in fraud detection

dc.date.accessioned	2023-02-09T18:14:18Z
dc.date.available	2023-02-09T18:14:18Z
dc.date.created	2022-08-29T13:56:12Z
dc.date.issued	2022
dc.identifier.citation	Brant, Simon Boge Haff, Ingrid Hobæk . The fraud loss for selecting the model complexity in fraud detection. Journal of Applied Statistics. 2022
dc.identifier.uri	http://hdl.handle.net/10852/99854
dc.description.abstract	Statistical fraud detection consists in making a system that automatically selects a subset of all cases (insurance claims, financial transactions, etc.) that are the most interesting for further investigation. The reason why such a system is needed is that the total number of cases typically is much higher than one realistically could investigate manually and that fraud tends to be quite rare. Further, the investigator is typically limited to controlling a restricted number k of cases, due to limited resources. The most efficient manner of allocating these resources is then to try selecting the k cases with the highest probability of being fraudulent. The prediction model used for this purpose must normally be regularised to avoid overfitting and consequently bad prediction performance. A loss function, denoted the fraud loss, is proposed for selecting the model complexity via a tuning parameter. A simulation study is performed to find the optimal settings for validation. Further, the performance of the proposed procedure is compared to the most relevant competing procedure, based on the area under the receiver operating characteristic curve (AUC), in a set of simulations, as well as on a credit card default dataset. Choosing the complexity of the model by the fraud loss resulted in either comparable or better results in terms of the fraud loss than choosing it according to the AUC.
dc.language	EN
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.title	The fraud loss for selecting the model complexity in fraud detection
dc.title.alternative	ENEngelskEnglishThe fraud loss for selecting the model complexity in fraud detection
dc.type	Journal article
dc.creator.author	Brant, Simon Boge
dc.creator.author	Haff, Ingrid Hobæk
cristin.unitcode	185,15,13,25
cristin.unitname	Statistikk og Data Science
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1
dc.identifier.cristin	2046770
dc.identifier.bibliographiccitation	info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal of Applied Statistics&rft.volume=&rft.spage=&rft.date=2022
dc.identifier.jtitle	Journal of Applied Statistics
dc.identifier.startpage	1
dc.identifier.endpage	19
dc.identifier.pagecount	0
dc.identifier.doi	https://doi.org/10.1080/02664763.2022.2070137
dc.type.document	Tidsskriftartikkel
dc.type.peerreviewed	Peer reviewed
dc.source.issn	0266-4763
dc.type.version	PublishedVersion