Overlapping Multi-Bandit Best Arm Identification

Scarlett, Jonathan; Bogunovic, Ilija; Cevher, Volkan

doi:10.1109/ISIT.2019.8849327

conference paper

Overlapping Multi-Bandit Best Arm Identification

Scarlett, Jonathan

•

Bogunovic, Ilija

•

Cevher, Volkan

2019

2019 IEEE International Symposium On Information Theory (Isit)

The 2019 IEEE International Symposium on Information Theory (ISIT)

In the multi-armed bandit literature, the multi-bandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in every group, and we complement these bounds with a near-matching algorithm-independent lower bound. In addition, we show that a specific choice of the groups recovers the top-k ranking problem.

Name

MultiBandit_postprint.pdf

Type

Postprint

Access type

openaccess

Size

236.79 KB

Format

Adobe PDF

Checksum (MD5)

abefe0910805c11ad3249fb043dab353

Name

MultiBandit_FULL.pdf

Type

publisher

Access type

openaccess

License Condition

copyright

Size

297.13 KB

Format

Adobe PDF

Checksum (MD5)

f85aab2bcd449516be9887688432255c