|
TR-747-06
Logarithmic Regret Algorithms for Online Convex Optimization |
|
| Authors: | Hazan, Elad, Kalai, Adam, Kale, Satyen, Agarwal, Amit |
| Date: | February 2006 |
| Pages: | 15 |
| Download Formats: | [PDF] |
In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., choose a sequence of points in Euclidean space, from a fixed feasible set. After each point is chosen, it encounters an sequence of (possibly unrelated) convex cost functions. Zinkevich introduced this framework, which models many natural repeated decision-making problems and generalizes many existing problems such as Prediction from Expert Advice and Cover's Universal Portfolios. Zinkevich showed that a simple online gradient descent algorithm achieves additive regret O(sqrt{T}), for an arbitrary sequence of T convex cost functions (of bounded gradients), with respect to the best single decision in hindsight. |
|