Deep learning for multimedia application

Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1590

Title:	Deep learning for multimedia application
Authors:	Ahuja, Aditya Shah, Rajiv Ratn (Advisor)
Keywords:	Speech Separation Deep Learning Speech Processing Deep Neural Networks
Issue Date:	29-Nov-2023
Publisher:	IIIT-Delhi
Abstract:	Recent advancements in speech applications prominently feature Deep Learning, driving significant progress in the challenging task of separating speech signals from multi-speaker speech mixtures. Speech Separation models have a wide range of applications ranging from enhancing the performance of hearing aids, use in telecommunications and serving as a pre-processing model in automatic speech recognition. In the following report, we analyze recent advancements in Deep Learning models for Monaural Speech Separation and discuss some ideas for the future direction of this work.
URI:	http://repository.iiitd.edu.in/xmlui/handle/123456789/1590
Appears in Collections:	Year-2023

Files in This Item:

File	Description	Size	Format
BTP_Report_2020275 - Aditya Ahuja.pdf Restricted Access		594.57 kB	Adobe PDF	View/Open Request a copy

DSpace JSPUI