Please use this identifier to cite or link to this item:
http://repository.iiitd.edu.in/xmlui/handle/123456789/1536Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Mehrotra, Sparsh | - |
| dc.contributor.author | Roy, Sayan Basu (Advisor) | - |
| dc.date.accessioned | 2024-05-20T09:32:33Z | - |
| dc.date.available | 2024-05-20T09:32:33Z | - |
| dc.date.issued | 2023-12 | - |
| dc.identifier.uri | http://repository.iiitd.edu.in/xmlui/handle/123456789/1536 | - |
| dc.description.abstract | We use different optimizers in everyday Machine Learning and Deep Learning applications. The task for any machine learning algorithm is to πππ π (π₯), where π is the objective function and π₯ is the input parameter. We can use standard algorithms like gradient descent for simple convex functions. Nowadays, more complex state-of-the-art optimizers like ADAM, ADAMSSD, and DADAM are used. Recent advancements have tackled the situation of finding the minimum for possibly non-convex settings. Recent state-of-the-art optimizers aim to solve the minimization problem for online and distributed settings as well. The aim is to develop an optimizer for distributed / online settings using the control theory analysis. Recent work in control theory doesnβt explore the idea of distributed and online settings. | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | IIIT-Delhi | en_US |
| dc.subject | AdamSSD | en_US |
| dc.subject | DADAM | en_US |
| dc.subject | G-AdaGrad | en_US |
| dc.title | Design & optimizing distributed learning gradients using control theory | en_US |
| dc.type | Other | en_US |
| Appears in Collections: | Year-2023 | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| BTP_Thesis_Monsoon (1) - Sparsh Mehrotra.pdf Restricted Access | 188.46 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.