Hide metadata

dc.date.accessioned2021-12-17T19:11:15Z
dc.date.available2021-12-17T19:11:15Z
dc.date.created2021-11-30T11:14:10Z
dc.date.issued2021
dc.identifier.citationPõldvere, Nele Johansson, Victoria Paradis, Carita . On the London–Lund Corpus 2: Design, challenges and innovations. English Language and Linguistics. 2021, 25(3), 459-483
dc.identifier.urihttp://hdl.handle.net/10852/89624
dc.description.abstractThis article describes and critically examines the challenging task of compiling The London–Lund Corpus 2 (LLC–2) from start to end, accounting for the methodological decisions made in each stage and highlighting the innovations. LLC–2 is a half-a-million-word corpus of contemporary spoken British English with recordings from 2014 to 2019. Its size and design are the same as those of the world's first machine-readable spoken corpus, The London–Lund Corpus of Spoken English with data from the 1950s to 1980s. In this way, LLC–2 allows not only for synchronic investigations of contemporary speech but also for principled diachronic research of spoken language across time. Each stage of the compilation of LLC–2 posed its own challenges, ranging from the design of the corpus, the recruitment of the speakers, transcription, markup and annotation procedures, to the release of the corpus to the international research community. The decisions and solutions represent state-of-the-art practices of spoken corpus compilation with important innovations that enhance the value of LLC–2 for spoken corpus research, such as the availability of both the transcriptions and the corresponding time-aligned audio files in a standard compliant format.
dc.languageEN
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleOn the London–Lund Corpus 2: Design, challenges and innovations
dc.typeJournal article
dc.creator.authorPõldvere, Nele
dc.creator.authorJohansson, Victoria
dc.creator.authorParadis, Carita
cristin.unitcode185,14,34,70
cristin.unitnameRussland, Sentral-Europa og Balkan
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2
dc.identifier.cristin1961534
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=English Language and Linguistics&rft.volume=25&rft.spage=459&rft.date=2021
dc.identifier.jtitleEnglish Language and Linguistics
dc.identifier.volume25
dc.identifier.issue3
dc.identifier.startpage459
dc.identifier.endpage483
dc.identifier.doihttps://doi.org/10.1017/S1360674321000186
dc.identifier.urnURN:NBN:no-92212
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn1360-6743
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/89624/1/P%25C3%25B5ldvereetal_2021_LondonLundCorpus2.pdf
dc.type.versionPublishedVersion


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International