Santosh, Siripurapu Venkata Sai; Darak, Sumit Jagdish (Advisor)
(IIIT- Delhi, 2020-12)
Multi-armed bandit (MAB) algorithms are designed to identify the best arm among several arms in an unknown environment. They guarantee optimal balance between exploration (select all arms sufficient number of times) and ...