MAB on hardware

Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1507

Title:	MAB on hardware
Authors:	Pareek, Animesh Darak, Sumit Jagdish (Advisor)
Keywords:	Split Computing Early-Exiting Deep Neural Network Multi Armed Bandit Upper confidence Bound Reinforcement Learning SplitEE
Issue Date:	29-Nov-2023
Publisher:	IIIT-Delhi
Abstract:	The Multi Armed Bandit (MAB) problem in the Reinforcement Learning field refers to the problem of allocating resources in certain choices for achieving an overall goal with the objective of maximizing expected gain. Our BTP Journey is based on the goal of using MAB algorithms on hardware to effectively achieve better designs of hardware accelerators for various problems. In this semester we have focussed on learning about UCB (Upper confidence Bound) algorithm and its role in Early-Exiting of Deep Neural networks (with split computing) [SplitEE].
URI:	http://repository.iiitd.edu.in/xmlui/handle/123456789/1507
Appears in Collections:	Year-2023

Files in This Item:

File	Description	Size	Format
BTP sem-1 Report - Animesh Pareek.pdf Restricted Access		897.94 kB	Adobe PDF	View/Open Request a copy

DSpace JSPUI