Hide metadata

dc.contributor.authorFalch, Arvid Andreas
dc.date.accessioned2023-08-21T22:03:16Z
dc.date.available2023-08-21T22:03:16Z
dc.date.issued2023
dc.identifier.citationFalch, Arvid Andreas. Raw Audio End-to-End Deep Learning Architectures for Sound Event Detection. Master thesis, University of Oslo, 2023
dc.identifier.urihttp://hdl.handle.net/10852/103569
dc.description.abstractThis thesis proposes deep learning architectures for sound event detection that aims to work fully end-to-end, working with raw audio as input, which can be directly compared to models using fixed graphical time-frequency representations as input. The primary objective is to assess the effectiveness of employing raw audio input in comparison to the conventional fixed graphical time-frequency representations. To achieve this, pairs of similar models based on convolutional recurrent neural networks commonly utilized in sound event detection, are trained using either raw audio or fixed graphical time-frequency representations to enable a comprehensive comparison. The findings reveal that the proposed deep learning architectures, operating on raw audio input, can achieve comparable performance to models based on fixed graphical time-frequency representations in sound event detection. Moreover, in specific applications where high temporal resolution is of importance, the architectures utilizing raw audio input showcase superior performance when compared to their fixed graphical counterparts. This finding highlights the potential of raw audio end-to-end deep learning architectures in capturing fine-grained temporal information critical for accurate sound event detection.eng
dc.language.isoeng
dc.subjectdeep learning
dc.subjectaudio analysis
dc.subjectmachine learning
dc.subjectraw audio input
dc.subjectSound event detection
dc.titleRaw Audio End-to-End Deep Learning Architectures for Sound Event Detectioneng
dc.typeMaster thesis
dc.date.updated2023-08-22T22:01:25Z
dc.creator.authorFalch, Arvid Andreas
dc.type.documentMasteroppgave


Files in this item

Appears in the following Collection

Hide metadata