dc.date.accessioned | 2019-03-28T08:45:59Z | |
dc.date.available | 2019-03-28T08:45:59Z | |
dc.date.created | 2018-05-22T09:30:11Z | |
dc.date.issued | 2018 | |
dc.identifier.citation | Øvrelid, Lilja Kåsen, Andre Hagen, Kristin Nøklestad, Anders Solberg, Per Erik Johannessen, Janne Bondi . The LIA Treebank of Spoken Norwegian Dialects. Proceedings of the Eleventh International Conference on Language Resources and Evaluation. 2018, 4482-4488 European Language Resources Association | |
dc.identifier.uri | http://hdl.handle.net/10852/67446 | |
dc.description.abstract | This article presents the LIA treebank of transcribed spoken Norwegian dialects. It consists of dialect recordings made in the period between 1950--1990, which have been digitised, transcribed, and subsequently annotated with morphological and dependency-style syntactic analysis as part of the LIA (Language Infrastructure made Accessible) project at the University of Oslo. In this article, we describe the LIA material of dialect recordings and its transcription, transliteration and further morphosyntactic annotation. We focus in particular on the extension of the native NDT annotation scheme to spoken language phenomena, such as pauses and various types of disfluencies, and present the subsequent conversion of the treebank to the Universal Dependencies scheme. The treebank currently consists of 13,608 tokens, distributed over 1396 segments taken from three different dialects of spoken Norwegian. The LIA treebank annotation is an on-going effort and future releases will extend on the current data set. | |
dc.language | EN | |
dc.publisher | European Language Resources Association | |
dc.rights | Attribution-NonCommercial 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc/4.0/ | |
dc.title | The LIA Treebank of Spoken Norwegian Dialects | |
dc.title.alternative | ENEngelskEnglishThe LIA Treebank of Spoken Norwegian Dialects | |
dc.type | Chapter | |
dc.creator.author | Øvrelid, Lilja | |
dc.creator.author | Kåsen, Andre | |
dc.creator.author | Hagen, Kristin | |
dc.creator.author | Nøklestad, Anders | |
dc.creator.author | Solberg, Per Erik | |
dc.creator.author | Johannessen, Janne Bondi | |
cristin.unitcode | 185,15,5,0 | |
cristin.unitname | Institutt for informatikk | |
cristin.ispublished | true | |
cristin.fulltext | original | |
dc.identifier.cristin | 1585815 | |
dc.identifier.bibliographiccitation | info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of the Eleventh International Conference on Language Resources and Evaluation&rft.spage=4482&rft.date=2018 | |
dc.identifier.startpage | 4482 | |
dc.identifier.endpage | 4488 | |
dc.identifier.pagecount | 2240 | |
dc.identifier.urn | URN:NBN:no-70649 | |
dc.type.document | Bokkapittel | |
dc.type.peerreviewed | Peer reviewed | |
dc.source.isbn | 979-10-95546-00-9 | |
dc.identifier.fulltext | Fulltext https://www.duo.uio.no/bitstream/handle/10852/67446/1/LIA%2BTreebank_2018.pdf | |
dc.type.version | PublishedVersion | |
cristin.btitle | Proceedings of the Eleventh International Conference on Language Resources and Evaluation | |