
arXiv: 1709.03093
We revisit the problem of online linear optimization in the case where the set of feasible actions is accessible through an approximated linear optimization oracle with a factor α multiplicative approximation guarantee. This setting in particular is interesting because it captures natural online extensions of well-studied offline linear optimization problems that are NP-hard yet admit efficient approximation algorithms. The goal here is to minimize the α-regret, which is the natural extension to this setting of the standard regret in online learning. We present new algorithms with significantly improved oracle complexity for both the full-information and bandit variants of the problem. Mainly, for both variants, we present α-regret bounds of [Formula: see text], were T is the number of prediction rounds, using only [Formula: see text] calls to the approximation oracle per iteration, on average. These are the first results to obtain both the average oracle complexity of [Formula: see text] (or even polylogarithmic in T) and α -regret bound [Formula: see text] for a constant c > 0 for both variants.
FOS: Computer and information sciences, Computer Science - Machine Learning, online algorithms, online learning, Learning and adaptive systems in artificial intelligence, Approximation methods and heuristics in mathematical programming, Approximation algorithms, online linear optimization, Machine Learning (cs.LG), Optimization and Control (math.OC), Linear programming, FOS: Mathematics, Online algorithms; streaming algorithms, Rationality and learning in game theory, approximation algorithms, Mathematics - Optimization and Control, regret minimization
FOS: Computer and information sciences, Computer Science - Machine Learning, online algorithms, online learning, Learning and adaptive systems in artificial intelligence, Approximation methods and heuristics in mathematical programming, Approximation algorithms, online linear optimization, Machine Learning (cs.LG), Optimization and Control (math.OC), Linear programming, FOS: Mathematics, Online algorithms; streaming algorithms, Rationality and learning in game theory, approximation algorithms, Mathematics - Optimization and Control, regret minimization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 5 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
