Transformer and LSTM Models for Automatic Counterpoint Generation using Raw Audio

dc.date.accessioned	2023-01-25T18:00:05Z
dc.date.available	2023-01-25T18:00:05Z
dc.date.created	2022-06-13T16:14:45Z
dc.date.issued	2022
dc.identifier.citation	Bentsen, Lars Ødegaard Simionato, Riccardo Wallace, Benedikte Krzyzaniak, Michael Joseph . Transformer and LSTM Models for Automatic Counterpoint Generation using Raw Audio. Proceedings of the SMC Conferences. 2022
dc.identifier.uri	http://hdl.handle.net/10852/99177
dc.description.abstract	A study investigating Transformer and LSTM models applied to raw audio for automatic generation of counterpoint was conducted. In particular, the models learned to generate missing voices from an input melody, using a collection of raw audio waveforms of various pieces of Bach’s work, played on different instruments. The research demonstrated the efficacy and behaviour of the two deep learning (DL) architectures when applied to raw audio data, which are typically characterised by much longer sequences than symbolic music representations, such as MIDI. Currently, the LSTM model has been the quintessential DL model for sequence-based tasks, such as generative audio models, but the research conducted in this study shows that the Transformer model can achieve competitive results on a fairly complex raw audio task. The research therefore aims to spark further research and investigation into how Trans- former models can be used for applications typically dominated by recurrent neural networks (RNN). In general, both models yielded excellent results and generated sequences with temporal patterns similar to the input targets for songs that were not present in the training data, as well as for a sample taken from a completely different dataset.
dc.language	EN
dc.publisher	Society for Sound and Music Computing
dc.rights	Attribution 3.0 Unported
dc.rights.uri	https://creativecommons.org/licenses/by/3.0/
dc.title	Transformer and LSTM Models for Automatic Counterpoint Generation using Raw Audio
dc.title.alternative	ENEngelskEnglishTransformer and LSTM Models for Automatic Counterpoint Generation using Raw Audio
dc.type	Journal article
dc.creator.author	Bentsen, Lars Ødegaard
dc.creator.author	Simionato, Riccardo
dc.creator.author	Wallace, Benedikte
dc.creator.author	Krzyzaniak, Michael Joseph
cristin.unitcode	185,15,30,0
cristin.unitname	Institutt for teknologisystemer
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1
dc.identifier.cristin	2031494
dc.identifier.bibliographiccitation	info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Proceedings of the SMC Conferences&rft.volume=&rft.spage=&rft.date=2022
dc.identifier.jtitle	Proceedings of the SMC Conferences
dc.identifier.doi	https://doi.org/10.5281/zenodo.6572847
dc.type.document	Tidsskriftartikkel
dc.type.peerreviewed	Peer reviewed
dc.source.issn	2518-3672
dc.type.version	PublishedVersion

Tilhørende fil(er)

Filnavn:: TransLSTMaudio.pdf
Størrelse:: 884.9Kb
Format:: application/

Åpne

Finnes i følgende samling

Skjul metadata

Dette verket har følgende lisens: Attribution 3.0 Unported

Transformer and LSTM Models for Automatic Counterpoint Generation using Raw Audio

Tilhørende fil(er)

Finnes i følgende samling

Bla i:

For bibliotekansatte

RSS