Regularizing towards Causal Invariance - Staff at the Department of Mathematical Sciences

Regularizing towards Causal Invariance: Linear Models with Proxies

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Documents

Fulltext
Final published version, 978 KB, PDF document

Michael Oberst
Thams, Nikolaj Theodor Birkmose
Jonas Martin Peters
David Sontag

We propose an algorithm for stochastic and adversarial multiarmed bandits with switching costs, where the algorithm pays a price λ every time it switches the arm being played. Our algorithm is based on adaptation of the Tsallis-INF algorithm of Zimmert and Seldin (2021) and requires no prior knowledge of the regime or time horizon. In the oblivious adversarial setting it achieves the minimax optimal regret bound of O((λK)1/3T2/3+KT−−−√), where T is the time horizon and K is the number of arms. In the stochastically constrained adversarial regime, which includes the stochastic regime as a special case, it achieves a regret bound of O((λK)2/3T1/3+lnT)∑i≠i∗Δ−1i), where Δi are suboptimality gaps and i∗ is the unique optimal arm. In the special case of λ=0 (no switching costs), both bounds are minimax optimal within constants. We also explore variants of the problem, where switching cost is allowed to change over time. We provide experimental evaluation showing competitiveness of our algorithm with the relevant baselines in the stochastic, stochastically constrained adversarial, and adversarial regimes with fixed switching cost.

Original language	English
Title of host publication	Proceedings of the 38th International Conference on Machine Learning (ICML)
Publisher	PMLR
Publication date	2021
Pages	1-11
Publication status	Published - 2021
Event	38th International Conference on Machine Learning (ICML) - Virtual Duration: 18 Jul 2021 → 24 Jul 2021

Conference

Conference	38th International Conference on Machine Learning (ICML)
By	Virtual
Periode	18/07/2021 → 24/07/2021

Series	Proceedings of Machine Learning Research
Volume	139
ISSN	1938-7228

Department of Mathematical Sciences

Regularizing towards Causal Invariance: Linear Models with Proxies

Documents

Conference

Links