Skip to main content

Menu

Enter the terms you wish to search for.

Faculty & Research

Menu

Faculty
Publications
Books
Working Papers
Case Studies
Postdoctoral Scholars
Research Labs & Initiatives
Behavioral Lab
Data, Analytics & Research Computing

Faculty
Publications
Books
Working Papers
Case Studies
Research Labs & Initiatives
Behavioral Lab
DARC

Faculty & Research Working Papers Best Arm Identification in Generalized Linear Bandits

Best Arm Identification in Generalized Linear Bandits

By Abbas KazerouniLawrence M. Wein

May2019| Working Paper No. 3784

Operations, Information & Technology

Download

Submitted to Operations Research

Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls required to identify an arm that is sufficiently close to optimal with a sufficiently high probability. Building on recent progress in best-arm identification for linear bandits (Xu et al. 2018), we propose the first algorithm for best-arm identification for generalized linear bandits, provide theoretical guarantees on its accuracy and sampling efficiency, and evaluate its performance in various scenarios via simulation.

Keywords

best arm identification

generalized linear bandits

sequential clinical trial

Footer contact links

Contact Us
Visit Us
Stay In Touch

Footer 1

Companies, Organizations & Recruiters
Stanford Community
Newsroom

Footer 2

Library
Jobs
MyGSB

Footer legal links

Accessibility
Non-Discrimination Policy
Privacy Policy
Terms of Use
Stanford University