Hide metadata

dc.date.accessioned2022-02-05T18:16:01Z
dc.date.available2022-02-05T18:16:01Z
dc.date.created2021-11-17T13:53:44Z
dc.date.issued2021
dc.identifier.citationSolberg, Lars Erik Dahl, Tobias Gulden Næs, Tormod . Making sense of multiple distance matrices through common and distinct components. Journal of Chemometrics. 2021, 1-24
dc.identifier.urihttp://hdl.handle.net/10852/90556
dc.description.abstractMultiblock analysis attacks the problem of how to combine data from various data sources for purposes such as prediction, classification, clustering, or visual data analysis. A key concept is the distinction between “common” and “distinct” parts, that is, what information repeats itself across the blocks and what is unique to an individual block. The statistical field of multiblock analysis holds many different approaches, which leads to different treatments both of the terms distinct and common themselves and to differences in the numerical results. In this article, we extend the discussion of distinct and common in multiblock analysis to the domain of distance matrices, that is, the situation where data point sets, so-called configurations, are analyzed via relative distances either because configurations are not available directly or because a distance representation is favorable. Situations typical for chemometrics will be highlighted and illustrated in examples. When analyzing different methods, we have focused on three key aspects. First, during the transition from the distance to configuration domains, one needs to consider how multiple distance matrices are treated. Second, when extracting common and distinct parts, one needs to manage a tradeoff between explaining variance and ensuring similarity between subspaces. Third, there is a design choice to be made as to whether the subspace containing the common parts is “shared” between blocks or if separate subspaces are associated with each individual block. The three aspects help to categorize and explain well-known methods in the field. A selection of methods was analyzed and subsequently applied to examples.
dc.languageEN
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleMaking sense of multiple distance matrices through common and distinct components
dc.typeJournal article
dc.creator.authorSolberg, Lars Erik
dc.creator.authorDahl, Tobias Gulden
dc.creator.authorNæs, Tormod
cristin.unitcode185,15,5,0
cristin.unitnameInstitutt for informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1
dc.identifier.cristin1955569
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal of Chemometrics&rft.volume=&rft.spage=1&rft.date=2021
dc.identifier.jtitleJournal of Chemometrics
dc.identifier.volume35
dc.identifier.issue11
dc.identifier.doihttps://doi.org/10.1002/cem.3372
dc.identifier.urnURN:NBN:no-93158
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn0886-9383
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/90556/1/Journal%2Bof%2BChemometrics%2B-%2B2021%2B-%2BSolberg%2B-%2BMaking%2Bsense%2Bof%2Bmultiple%2Bdistance%2Bmatrices%2Bthrough%2Bcommon%2Band%2Bdistinct.pdf
dc.type.versionPublishedVersion
cristin.articleide3372
dc.relation.projectNOFIMA/201702
dc.relation.projectNFR/262308


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International