IIIT-Delhi Institutional Repository

Learning speaker, emotion, age, and gender information through disentanglement of speech pre-trained representations

Show simple item record

dc.contributor.author Koshal, Devyani
dc.contributor.author Buduru, Arun Balaji (Advisor)
dc.date.accessioned 2024-05-13T11:10:31Z
dc.date.available 2024-05-13T11:10:31Z
dc.date.issued 2023-11-29
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1448
dc.description.abstract Forensic speech science, rooted in acoustics, plays a key role in legal investigations. Among its diverse applications, automatic speaker recognition (ASR) stands as a primary task within forensic speech analysis followed by speech emotion recognition (SER), gender recognition (GR) and age estimation (AE). Expanding beyond conventional identification methods, leveraging multi-task learning and speech-pre-trained models (PTM) representations enhances the scope of analysis and is more resource-friendly. This approach allows simultaneous exploration of multiple facets, including speaker information, emotional cues, gender characterization, and age estimation embedded within speech. Additionally, this modeling prevents training models for tasks individually and resulting in preservation of computational resources as well as time. This multi-dimensional analysis aids in offering insights beyond identification and enriches the depth of the investigations via a comprehensive comparison of representations from various PTMs for the aforementioned tasks. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Speech Forensics en_US
dc.subject Self-Supervised Learning en_US
dc.subject Pre-Trained Models en_US
dc.subject Multi-Task Learning en_US
dc.subject Convolutional Neural Networks en_US
dc.title Learning speaker, emotion, age, and gender information through disentanglement of speech pre-trained representations en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account