Now showing items 1-11 of 11

  • de Gibert, Ona; Nail, Graeme; Arefev, Nikolay; Bañón, Marta; van der Linde, Jelmer; Ji, Shaoxiong; Zaragoza-Bernabeu, Jaume; Aulamo, Mikko; Ramírez-Sánchez, Gema; Kutuzov, Andrei; Pyysalo, Sampo; Oepen, Stephan; Tiedemann, Jörg (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2024)
    We present the HPLT (High Performance Language Technologies) language resources, a new massive multilingual dataset including both monolingual and bilingual corpora extracted from CommonCrawl and previously unused web ...
  • Buljan, Maja; Nivre, Joakim; Oepen, Stephan; Øvrelid, Lilja (Journal article / Tidsskriftartikkel / PublishedVersion; Peer reviewed, 2022)
    Abstract We discuss methodological choices in diagnostic evaluation and error analysis in meaning representation parsing (MRP), i.e. mapping from natural language utterances to graph-based encodings of semantic structure. ...
  • Buljan, Maja; Nivre, Joakim; Oepen, Stephan; Øvrelid, Lilja (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2020)
    We discuss methodological choices in contrastive and diagnostic evaluation in meaning representation parsing, i.e. mapping from natural language utterances to graph-based encodings of its semantic structure. Drawing ...
  • Lapponi, Emanuele; Velldal, Erik; Vasov, Nikolay; Oepen, Stephan (Chapter / Bokkapittel / AcceptedVersion; Peer reviewed, 2013)
    This demonstration presents a first operable pilot of the Language Analysis Portal (LAP), an ongoing project within the Norwegian CLARINO initiative that aims at providing easy access to Language Technology (LT) tools ...
  • Ivanova, Angelina; Oepen, Stephan; Dridan, Rebecca; Flickinger, Dan; Øvrelid, Lilja (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2013)
    We compare three different approaches to parsing into syntactic, bi-lexical dependencies for English: a ‘direct’ data-driven dependency parser, a statistical phrase structure parser, and a hybrid, ‘deep’ grammar-driven ...
  • Kouylekov, Milen; Oepen, Stephan (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2014)
    With growing interest in the creation and search of linguistic annotations that form general graphs (in contrast to formally simpler, rooted trees), there also is an increased need for infrastructures that support the ...
  • Lapponi, Emanuele; Søyland, Martin G.; Velldal, Erik; Oepen, Stephan (Journal article / Tidsskriftartikkel / PublishedVersion; Peer reviewed, 2018)
    In this work we present the Talk of Norway (ToN) data set, a collection of Norwegian Parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata harvested from different sources, and augmented ...
  • Lapponi, Emanuele; Velldal, Erik; Vazov, Nikolay Aleksandrov; Oepen, Stephan (Chapter / Bokkapittel / AcceptedVersion; Peer reviewed, 2013)
    This paper documents ongoing work within the Norwegian CLARINO project on building a Language Analysis Portal (LAP). The portal will provide an intuitive and easily accessible web interface to a centralized repository of ...
  • Fares, Murhaf; Oepen, Stephan; Velldal, Erik (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2018)
    In this paper, we empirically evaluate the utility of transfer and multi-task learning on a challenging semantic classification task: semantic interpretation of noun--noun compounds. Through a comprehensive series of ...
  • Flickinger, Dan; Oepen, Stephan; Ytrestøl, Gisle (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2010)
  • Kutuzov, Andrei; Fares, Murhaf; Oepen, Stephan; Velldal, Erik (Chapter / Bokkapittel / PublishedVersion; Peer reviewed, 2017)
    This paper describes an emerging shared repository of large-text resources for creating word vectors, including pre-processed corpora and pre-trained vectors for a range of frameworks and configurations. This will facilitate ...