Hide metadata

dc.date.accessioned2019-03-28T08:45:59Z
dc.date.available2019-03-28T08:45:59Z
dc.date.created2018-05-22T09:30:11Z
dc.date.issued2018
dc.identifier.citationØvrelid, Lilja Kåsen, Andre Hagen, Kristin Nøklestad, Anders Solberg, Per Erik Johannessen, Janne Bondi . The LIA Treebank of Spoken Norwegian Dialects. Proceedings of the Eleventh International Conference on Language Resources and Evaluation. 2018, 4482-4488 European Language Resources Association
dc.identifier.urihttp://hdl.handle.net/10852/67446
dc.description.abstractThis article presents the LIA treebank of transcribed spoken Norwegian dialects. It consists of dialect recordings made in the period between 1950--1990, which have been digitised, transcribed, and subsequently annotated with morphological and dependency-style syntactic analysis as part of the LIA (Language Infrastructure made Accessible) project at the University of Oslo. In this article, we describe the LIA material of dialect recordings and its transcription, transliteration and further morphosyntactic annotation. We focus in particular on the extension of the native NDT annotation scheme to spoken language phenomena, such as pauses and various types of disfluencies, and present the subsequent conversion of the treebank to the Universal Dependencies scheme. The treebank currently consists of 13,608 tokens, distributed over 1396 segments taken from three different dialects of spoken Norwegian. The LIA treebank annotation is an on-going effort and future releases will extend on the current data set.
dc.languageEN
dc.publisherEuropean Language Resources Association
dc.rightsAttribution-NonCommercial 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/
dc.titleThe LIA Treebank of Spoken Norwegian Dialects
dc.title.alternativeENEngelskEnglishThe LIA Treebank of Spoken Norwegian Dialects
dc.typeChapter
dc.creator.authorØvrelid, Lilja
dc.creator.authorKåsen, Andre
dc.creator.authorHagen, Kristin
dc.creator.authorNøklestad, Anders
dc.creator.authorSolberg, Per Erik
dc.creator.authorJohannessen, Janne Bondi
cristin.unitcode185,15,5,0
cristin.unitnameInstitutt for informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
dc.identifier.cristin1585815
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of the Eleventh International Conference on Language Resources and Evaluation&rft.spage=4482&rft.date=2018
dc.identifier.startpage4482
dc.identifier.endpage4488
dc.identifier.pagecount2240
dc.identifier.urnURN:NBN:no-70649
dc.type.documentBokkapittel
dc.type.peerreviewedPeer reviewed
dc.source.isbn979-10-95546-00-9
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/67446/1/LIA%2BTreebank_2018.pdf
dc.type.versionPublishedVersion
cristin.btitleProceedings of the Eleventh International Conference on Language Resources and Evaluation


Files in this item

Appears in the following Collection

Hide metadata

Attribution-NonCommercial 4.0 International
This item's license is: Attribution-NonCommercial 4.0 International