Hide metadata

dc.date.accessioned2023-02-09T18:14:18Z
dc.date.available2023-02-09T18:14:18Z
dc.date.created2022-08-29T13:56:12Z
dc.date.issued2022
dc.identifier.citationBrant, Simon Boge Haff, Ingrid Hobæk . The fraud loss for selecting the model complexity in fraud detection. Journal of Applied Statistics. 2022
dc.identifier.urihttp://hdl.handle.net/10852/99854
dc.description.abstractStatistical fraud detection consists in making a system that automatically selects a subset of all cases (insurance claims, financial transactions, etc.) that are the most interesting for further investigation. The reason why such a system is needed is that the total number of cases typically is much higher than one realistically could investigate manually and that fraud tends to be quite rare. Further, the investigator is typically limited to controlling a restricted number k of cases, due to limited resources. The most efficient manner of allocating these resources is then to try selecting the k cases with the highest probability of being fraudulent. The prediction model used for this purpose must normally be regularised to avoid overfitting and consequently bad prediction performance. A loss function, denoted the fraud loss, is proposed for selecting the model complexity via a tuning parameter. A simulation study is performed to find the optimal settings for validation. Further, the performance of the proposed procedure is compared to the most relevant competing procedure, based on the area under the receiver operating characteristic curve (AUC), in a set of simulations, as well as on a credit card default dataset. Choosing the complexity of the model by the fraud loss resulted in either comparable or better results in terms of the fraud loss than choosing it according to the AUC.
dc.languageEN
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleThe fraud loss for selecting the model complexity in fraud detection
dc.title.alternativeENEngelskEnglishThe fraud loss for selecting the model complexity in fraud detection
dc.typeJournal article
dc.creator.authorBrant, Simon Boge
dc.creator.authorHaff, Ingrid Hobæk
cristin.unitcode185,15,13,25
cristin.unitnameStatistikk og Data Science
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1
dc.identifier.cristin2046770
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal of Applied Statistics&rft.volume=&rft.spage=&rft.date=2022
dc.identifier.jtitleJournal of Applied Statistics
dc.identifier.startpage1
dc.identifier.endpage19
dc.identifier.pagecount0
dc.identifier.doihttps://doi.org/10.1080/02664763.2022.2070137
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn0266-4763
dc.type.versionPublishedVersion


Files in this item

Appears in the following Collection

Hide metadata

Attribution-NonCommercial-NoDerivatives 4.0 International
This item's license is: Attribution-NonCommercial-NoDerivatives 4.0 International