Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1590
Title: Deep learning for multimedia application
Authors: Ahuja, Aditya
Shah, Rajiv Ratn (Advisor)
Keywords: Speech Separation
Deep Learning
Speech Processing
Deep Neural Networks
Issue Date: 29-Nov-2023
Publisher: IIIT-Delhi
Abstract: Recent advancements in speech applications prominently feature Deep Learning, driving significant progress in the challenging task of separating speech signals from multi-speaker speech mixtures. Speech Separation models have a wide range of applications ranging from enhancing the performance of hearing aids, use in telecommunications and serving as a pre-processing model in automatic speech recognition. In the following report, we analyze recent advancements in Deep Learning models for Monaural Speech Separation and discuss some ideas for the future direction of this work.
URI: http://repository.iiitd.edu.in/xmlui/handle/123456789/1590
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP_Report_2020275 - Aditya Ahuja.pdf
  Restricted Access
594.57 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.