Please use this identifier to cite or link to this item:
http://repository.iiitd.edu.in/xmlui/handle/123456789/1590| Title: | Deep learning for multimedia application |
| Authors: | Ahuja, Aditya Shah, Rajiv Ratn (Advisor) |
| Keywords: | Speech Separation Deep Learning Speech Processing Deep Neural Networks |
| Issue Date: | 29-Nov-2023 |
| Publisher: | IIIT-Delhi |
| Abstract: | Recent advancements in speech applications prominently feature Deep Learning, driving significant progress in the challenging task of separating speech signals from multi-speaker speech mixtures. Speech Separation models have a wide range of applications ranging from enhancing the performance of hearing aids, use in telecommunications and serving as a pre-processing model in automatic speech recognition. In the following report, we analyze recent advancements in Deep Learning models for Monaural Speech Separation and discuss some ideas for the future direction of this work. |
| URI: | http://repository.iiitd.edu.in/xmlui/handle/123456789/1590 |
| Appears in Collections: | Year-2023 |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| BTP_Report_2020275 - Aditya Ahuja.pdf Restricted Access | 594.57 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.