Hide metadata

dc.date.accessioned2020-06-08T18:22:13Z
dc.date.available2020-06-08T18:22:13Z
dc.date.created2020-02-06T22:49:09Z
dc.date.issued2019
dc.identifier.citationKåsen, Andre Hagen, Kristin Nøklestad, Anders Priestley, Joel . Tagging a Norwegian Dialect Corpus. Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa). 2019, 350-355 Linköping University Electronic Press
dc.identifier.urihttp://hdl.handle.net/10852/76798
dc.description.abstractThis paper describes an evaluation of five data-driven part-of-speech (PoS) taggers for spoken Norwegian. The taggers all rely on different machine learning mechanisms: decision trees, hidden Markov models (HMMs), conditional random fields (CRFs), long-short term memory networks (LSTMs), and convolutional neural networks (CNNs). We go into some of the challenges posed by the task of tagging spoken, as opposed to written, language, and in particular a wide range of dialects as is found in the recordings of the LIA (Language Infrastructure made Accessible) project. The results show that the taggers based on either conditional random fields or neural networks perform much better than the rest, with the LSTM tagger getting the highest score.
dc.languageEN
dc.publisherLinköping University Electronic Press
dc.relation.ispartofLinköping Electronic Conference Proceedings
dc.relation.ispartofseriesLinköping Electronic Conference Proceedings
dc.titleTagging a Norwegian Dialect Corpus
dc.typeChapter
dc.creator.authorKåsen, Andre
dc.creator.authorHagen, Kristin
dc.creator.authorNøklestad, Anders
dc.creator.authorPriestley, Joel
cristin.unitcode185,15,5,0
cristin.unitnameInstitutt for informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
dc.identifier.cristin1791843
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa)&rft.spage=350&rft.date=2019
dc.identifier.startpage350
dc.identifier.endpage355
dc.identifier.pagecount410
dc.identifier.urnURN:NBN:no-79896
dc.type.documentBokkapittel
dc.type.peerreviewedPeer reviewed
dc.source.isbn978-91-7929-995-8
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/76798/2/TaggingaNorwegianDialectCorpus.pdf
dc.type.versionPublishedVersion
cristin.btitleProceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa)
dc.relation.projectNFR/225941
dc.relation.projectNFR/223265
dc.relation.projectNOTUR/NORSTORE/NS9014K
dc.relation.projectNOTUR/NORSTORE/NN9139K


Files in this item

Appears in the following Collection

Hide metadata