Hide metadata

dc.contributor.authorWold, Sondre
dc.date.accessioned2022-08-23T22:04:18Z
dc.date.issued2022
dc.identifier.citationWold, Sondre. Language Models and World Knowledge: Injecting structured information using masked language modeling and adapters. Master thesis, University of Oslo, 2022
dc.identifier.urihttp://hdl.handle.net/10852/95611
dc.description.abstractCombining structured information with language models is a standing problem in NLP. Building on previous work, we study how lightweight neural networks, known as adapters, can be used to inject information from a knowledge graph into two popular pre-trained language models based on the transformer architecture. The adapters are trained using the masked language modeling objective over extracted triples from ConceptNet, a knowledge graph that captures a range of world knowledge and commonsense concepts and relations. Experiments on three popular NLP benchmarks believed to require world knowledge and commonsense reasoning abilities show that the adapter injection does not increase performance on these tasks. However, probing experiments indicate that the injected models are better at recovering factual information seen during training, and that this can be achieved by introducing a small amount of additional parameters to the overall model. Ablation studies show that the injected knowledge is distributed equally among the layers in the underlying model. Furthermore, using the AdapterFusion framework, we propose and perform initial testing of a two-step learning algorithm that partitions ConceptNet by predicate type and trains a set of disjoint adapters that are later combined using an attention mechanism. For reproducibility, we present a reproduction of the most related previous work and release our code.eng
dc.language.isoeng
dc.subject
dc.titleLanguage Models and World Knowledge: Injecting structured information using masked language modeling and adapterseng
dc.typeMaster thesis
dc.date.updated2022-08-24T22:01:31Z
dc.creator.authorWold, Sondre
dc.identifier.urnURN:NBN:no-98134
dc.type.documentMasteroppgave
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/95611/1/thesis.pdf


Files in this item

Appears in the following Collection

Hide metadata