Zaid, Kunwar; Ghatak, Gourab (Advisor)
(IIIT-Delhi, 2020)
The stationary multi-armed bandit (MAB) framework is a well-studied problem in literature, with many rigorous mathematical treatments and optimal solutions. However, for a non-stationary environment, i.e., when the reward ...