Hide metadata

dc.date.accessioned2017-12-12T16:36:57Z
dc.date.available2018-11-28T23:31:25Z
dc.date.created2017-11-28T15:46:13Z
dc.date.issued2018
dc.identifier.citationSeibold, Heidi Bernau, Christoph Boulesteix, Anne-Laure De Bin, Riccardo . On the choice and influence of the number of boosting steps for high-dimensional linear Cox-models. Computational statistics (Zeitschrift). 2018
dc.identifier.urihttp://hdl.handle.net/10852/59345
dc.description.abstractIn biomedical research, boosting-based regression approaches have gained much attention in the last decade. Their intrinsic variable selection procedure and ability to shrink the estimates of the regression coefficients toward 0 make these techniques appropriate to fit prediction models in the case of high-dimensional data, e.g. gene expressions. Their prediction performance, however, highly depends on specific tuning parameters, in particular on the number of boosting iterations to perform. This crucial parameter is usually selected via cross-validation. The cross-validation procedure may highly depend on a completely random component, namely the considered fold partition. We empirically study how much this randomness affects the results of the boosting techniques, in terms of selected predictors and prediction ability of the related models. We use four publicly available data sets related to four different diseases. In these studies, the goal is to predict survival end-points when a large number of continuous candidate predictors are available. We focus on two well known boosting approaches implemented in the R-packages CoxBoost and mboost, assuming the validity of the proportional hazards assumption and the linearity of the effects of the predictors. We show that the variability in selected predictors and prediction ability of the model is reduced by averaging over several repetitions of cross-validation in the selection of the tuning parameters. The final version of this research has been published in Computational Statistics. © 2017 Springer Verlagen_US
dc.languageEN
dc.titleOn the choice and influence of the number of boosting steps for high-dimensional linear Cox-modelsen_US
dc.typeJournal articleen_US
dc.creator.authorSeibold, Heidi
dc.creator.authorBernau, Christoph
dc.creator.authorBoulesteix, Anne-Laure
dc.creator.authorDe Bin, Riccardo
cristin.unitcode185,15,13,25
cristin.unitnameStatistikk og biostatistikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1
dc.identifier.cristin1519748
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Computational statistics (Zeitschrift)&rft.volume=&rft.spage=&rft.date=2018
dc.identifier.jtitleComputational statistics (Zeitschrift)
dc.identifier.pagecount21
dc.identifier.doihttp://dx.doi.org/10.1007/s00180-017-0773-8) contains
dc.identifier.urnURN:NBN:no-62031
dc.type.documentTidsskriftartikkelen_US
dc.type.peerreviewedPeer reviewed
dc.source.issn0943-4062
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/59345/2/choice-influence-number.pdf
dc.type.versionAcceptedVersion


Files in this item

Appears in the following Collection

Hide metadata