Hide metadata

dc.date.accessioned2017-04-26T10:56:25Z
dc.date.available2017-04-26T10:56:25Z
dc.date.created2017-04-19T17:08:30Z
dc.date.issued2017
dc.identifier.citationKutuzov, Andrei Kuzmenko, Elizaveta Pivovarova, Lidia . Clustering of Russian Adjective-Noun Constructions using Word Embeddings. Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing. 2017, 3-13 Association for Computational Linguistics
dc.identifier.urihttp://hdl.handle.net/10852/55264
dc.description.abstractThis paper presents a method of automatic construction extraction from a large corpus of Russian. The term ‘construction’ here means a multi-word expression in which a variable can be replaced with an- other word from the same semantic class, for example, a glass of [water/juice/milk]. We deal with constructions that consist of a noun and its adjective modifier. We propose a method of grouping such constructions into semantic classes via 2-step clustering of word vectors in distributional models. We compare it with other clustering techniques and evaluate it against. A Russian-English Collocational Dictionary of the Human Body that contains manually annotated groups of constructions with nouns denoting human body parts. The best performing method is used to cluster all adjective-noun bigrams in the Russian National Corpus. Results of this procedure are publicly available and can be used to build a Russian construction dictionary, accelerate theoretical studies of constructions as well as facilitate teaching Russian as a foreign language.en_US
dc.languageEN
dc.publisherAssociation for Computational Linguistics
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleClustering of Russian Adjective-Noun Constructions using Word Embeddingsen_US
dc.typeChapteren_US
dc.creator.authorKutuzov, Andrei
dc.creator.authorKuzmenko, Elizaveta
dc.creator.authorPivovarova, Lidia
cristin.unitcode185,15,5,56
cristin.unitnameForskningsgruppen for språkteknologi
cristin.ispublishedtrue
cristin.fulltextoriginal
dc.identifier.cristin1465584
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing&rft.spage=3&rft.date=2017
dc.identifier.startpage3
dc.identifier.endpage13
dc.identifier.pagecount125
dc.identifier.urnURN:NBN:no-58071
dc.type.documentBokkapittelen_US
dc.type.peerreviewedPeer reviewed
dc.source.isbn978-1-945626-45-6
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/55264/1/clustering_constructions.pdf
dc.type.versionPublishedVersion
cristin.btitleProceedings of the 6th Workshop on Balto-Slavic Natural Language Processing


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International