dc.date.accessioned | 2017-04-26T10:56:25Z | |
dc.date.available | 2017-04-26T10:56:25Z | |
dc.date.created | 2017-04-19T17:08:30Z | |
dc.date.issued | 2017 | |
dc.identifier.citation | Kutuzov, Andrei Kuzmenko, Elizaveta Pivovarova, Lidia . Clustering of Russian Adjective-Noun Constructions using Word Embeddings. Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing. 2017, 3-13 Association for Computational Linguistics | |
dc.identifier.uri | http://hdl.handle.net/10852/55264 | |
dc.description.abstract | This paper presents a method of automatic construction extraction from a large corpus of Russian. The term ‘construction’ here means a multi-word expression in which a variable can be replaced with an- other word from the same semantic class, for example, a glass of [water/juice/milk]. We deal with constructions that consist of a noun and its adjective modifier. We propose a method of grouping such constructions into semantic classes via 2-step clustering of word vectors in distributional models. We compare it with other clustering techniques and evaluate it against. A Russian-English Collocational Dictionary of the Human Body that contains manually annotated groups of constructions with nouns denoting human body parts. The best performing method is used to cluster all adjective-noun bigrams in the Russian National Corpus. Results of this procedure are publicly available and can be used to build a Russian construction dictionary, accelerate theoretical studies of constructions as well as facilitate teaching Russian as a foreign language. | en_US |
dc.language | EN | |
dc.publisher | Association for Computational Linguistics | |
dc.rights | Attribution 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Clustering of Russian Adjective-Noun Constructions using Word Embeddings | en_US |
dc.type | Chapter | en_US |
dc.creator.author | Kutuzov, Andrei | |
dc.creator.author | Kuzmenko, Elizaveta | |
dc.creator.author | Pivovarova, Lidia | |
cristin.unitcode | 185,15,5,56 | |
cristin.unitname | Forskningsgruppen for språkteknologi | |
cristin.ispublished | true | |
cristin.fulltext | original | |
dc.identifier.cristin | 1465584 | |
dc.identifier.bibliographiccitation | info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing&rft.spage=3&rft.date=2017 | |
dc.identifier.startpage | 3 | |
dc.identifier.endpage | 13 | |
dc.identifier.pagecount | 125 | |
dc.identifier.urn | URN:NBN:no-58071 | |
dc.type.document | Bokkapittel | en_US |
dc.type.peerreviewed | Peer reviewed | |
dc.source.isbn | 978-1-945626-45-6 | |
dc.identifier.fulltext | Fulltext https://www.duo.uio.no/bitstream/handle/10852/55264/1/clustering_constructions.pdf | |
dc.type.version | PublishedVersion | |
cristin.btitle | Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing | |