Bandit Algorithm (Online Machine Learning)

Bandit Algorithm (Online Machine Learning) by Indian Institute of Technology Bombay

Beginner 0(0 Ratings) 0 Students enrolled
Created by IIT Bombay Staff Last updated Wed, 30-Mar-2022 English


Bandit Algorithm (Online Machine Learning) free videos and free material uploaded by IIT Bombay Staff .

Syllabus / What will i learn?

Week 1:Introduction to Bandit Algorithms. From Batch to Online Setting

Week 2:Adversarial Setting with Full information (Halving, WM Algorithm )

Week 3:Adversarial Setting with Bandit Information

Week 4:Regret lower bounds for adversarial Setting

Week 5:Introduction to Stochastic Setting and various regret notions

Week 6:A primer on Concentration inequalities

Week 7:Stochastic Bandit Algorithms UCB, KL-UCB

Week 8:Lower bounds for stochastic Bandits

Week 9:Introductions to contextual bandits

Week 10:Overview of contextual bandit algorithms

Week 11:Introduction to pure exploration setups (fixed confidence vs budget)

Week 12:Algorithms for pure explorations (LUCB, KL-LUCB, lil’UCB).



Curriculum for this course
0 Lessons 00:00:00 Hours
+ View more
Description

In many scenarios one faces uncertain environments where a-priori the best action to play is unknown. How to obtain best possible reward/utility in such scenarios. One natural way is to first explore the environment and to identify the `best’ actions and exploit them. However, this give raise to an exploration vs exploitation dilemma, where on hand hand we need to do sufficient explorations to identify the best action so that we are confident about its optimality, and on the other hand, best actions need to exploited more number of times to obtain higher reward. In this course we will study many bandit algorithms that balance exploration and exploitation well in various random environment to accumulate good rewards over the duration of play. Bandit algorithms find applications in online advertising, recommendation systems, auctions, routing, e-commerce or in any filed online scenarios where information can be gather in an increment fashion.

INTENDED AUDIENCE :Computer Sceince, Electrical Engineering, Operations Research, Mathematics and Statistics

PREREQUISITES :Basics of Probability Theory and Optimization

INDUSTRIES SUPPORT :All companies related to Internet Technologies (ex. Google, Microsoft, Flipkart, Ola, Amazon, etc.)

You need online training / explanation for this course?

1 to 1 Online Training contact instructor for demo :


+ View more

Other related courses
About the instructor
  • 0 Reviews
  • 2 Students
  • 148 Courses
+ View more
Student feedback
0
Average rating
  • 0%
  • 0%
  • 0%
  • 0%
  • 0%
Reviews

Material price :

Free

1:1 Online Training Fee: 1 /-
Contact instructor for demo :