Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1507
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPareek, Animesh
dc.contributor.authorDarak, Sumit Jagdish (Advisor)
dc.date.accessioned2024-05-17T09:30:39Z
dc.date.available2024-05-17T09:30:39Z
dc.date.issued2023-11-29
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/1507
dc.description.abstractThe Multi Armed Bandit (MAB) problem in the Reinforcement Learning field refers to the problem of allocating resources in certain choices for achieving an overall goal with the objective of maximizing expected gain. Our BTP Journey is based on the goal of using MAB algorithms on hardware to effectively achieve better designs of hardware accelerators for various problems. In this semester we have focussed on learning about UCB (Upper confidence Bound) algorithm and its role in Early-Exiting of Deep Neural networks (with split computing) [SplitEE].en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectSplit Computingen_US
dc.subjectEarly-Exitingen_US
dc.subjectDeep Neural Networken_US
dc.subjectMulti Armed Banditen_US
dc.subjectUpper confidence Bounden_US
dc.subjectReinforcement Learningen_US
dc.subjectSplitEEen_US
dc.titleMAB on hardwareen_US
dc.typeOtheren_US
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP sem-1 Report - Animesh Pareek.pdf
  Restricted Access
897.94 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.