Van Roy, Benjamin
Statistics - Machine Learning | Computer Science - Artificial Intelligence | Computer Science - Learning
Thompson sampling has emerged as an effective heuristic for a broad range of online decision problems. In its basic form, the algorithm requires computing and sampling from a posterior distribution over models, which is tractable only for simple special cases. This pape...