Hide metadata

dc.date.accessioned2022-08-18T15:08:58Z
dc.date.available2022-08-18T15:08:58Z
dc.date.created2022-07-27T22:58:02Z
dc.date.issued2022
dc.identifier.citationRognes, Torbjørn Scheffer, Lonneke Greiff, Victor Sandve, Geir Kjetil Ferkingstad . CompAIRR: ultra-fast comparison of adaptive immune receptor repertoires by exact and approximate sequence matching. Bioinformatics. 2022
dc.identifier.urihttp://hdl.handle.net/10852/95065
dc.description.abstractAbstract Motivation Adaptive immune receptor (AIR) repertoires (AIRRs) record past immune encounters with exquisite specificity. Therefore, identifying identical or similar AIR sequences across individuals is a key step in AIRR analysis for revealing convergent immune response patterns that may be exploited for diagnostics and therapy. Existing methods for quantifying AIRR overlap scale poorly with increasing dataset numbers and sizes. To address this limitation, we developed CompAIRR, which enables ultra-fast computation of AIRR overlap, based on either exact or approximate sequence matching. Results CompAIRR improves computational speed 1000-fold relative to the state of the art and uses only one-third of the memory: on the same machine, the exact pairwise AIRR overlap of 104 AIRRs with 105 sequences is found in ∼17 min, while the fastest alternative tool requires 10 days. CompAIRR has been integrated with the machine learning ecosystem immuneML to speed up commonly used AIRR-based machine learning applications. Availability and implementation CompAIRR code and documentation are available at https://github.com/uio-bmi/compairr. Docker images are available at https://hub.docker.com/r/torognes/compairr. The code to replicate the synthetic datasets, scripts for benchmarking and creating figures, and all raw data underlying the figures are available at https://github.com/uio-bmi/compairr-benchmarking. Supplementary information Supplementary data are available at Bioinformatics online.
dc.languageEN
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleCompAIRR: ultra-fast comparison of adaptive immune receptor repertoires by exact and approximate sequence matching
dc.title.alternativeENEngelskEnglishCompAIRR: ultra-fast comparison of adaptive immune receptor repertoires by exact and approximate sequence matching
dc.typeJournal article
dc.creator.authorRognes, Torbjørn
dc.creator.authorScheffer, Lonneke
dc.creator.authorGreiff, Victor
dc.creator.authorSandve, Geir Kjetil Ferkingstad
cristin.unitcode185,15,5,43
cristin.unitnameBiomedisinsk Informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2
dc.identifier.cristin2039858
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Bioinformatics&rft.volume=&rft.spage=&rft.date=2022
dc.identifier.jtitleBioinformatics
dc.identifier.doihttps://doi.org/10.1093/bioinformatics/btac505
dc.identifier.urnURN:NBN:no-97588
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn1367-4803
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/95065/1/btac505.pdf
dc.type.versionPublishedVersion
cristin.articleidbtac505
dc.relation.projectSIGMA2/NN9383K
dc.relation.projectNFR/311341
dc.relation.projectNFR/300740
dc.relation.projectKF/215817
dc.relation.projectSIGMA2/NN9603K,NS9603K


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International