Home
→
Browsing by Subject

Browsing by Subject "Upper confidence Bound"

Now showing items 1-1 of 1

MAB on hardware

Pareek, Animesh; Darak, Sumit Jagdish (Advisor) (IIIT-Delhi, 2023-11-29)

The Multi Armed Bandit (MAB) problem in the Reinforcement Learning field refers to the problem of allocating resources in certain choices for achieving an overall goal with the objective of maximizing expected gain. Our ...

Now showing items 1-1 of 1

Search Repository

Advanced Search

Browse

All of Repository

My Account