Home . Bio . Publications . Code & Data

 

Publications

Conference and Journal papers

  • On Stochastic and Worst-case Models for Investing. (with S. Kale) To appear NIPS 2009.
    In practice, most investing is done assuming a probabilistic model of stock price returns known as the Geometric Brownian Motion (GBM). While a very good approximation, the GBM model is not always valid empirically. This motivates a worst-case approach to investing, called universal portfolio management, where the objective is to maximize wealth relative to the wealth earned by the best fixed portfolio in hindsight. In this paper we tie the two approaches, and design an investment strategy which is universal in the worst-case, and yet capable of exploiting the mostly valid GBM model.

  • Online Submodular Minimization. (with S. Kale) To appear NIPS 2009.
    We consider an online decision problem over a discrete space in which the loss function is submodular. We give algorithms which are computationally efficient and are Hannan-consistent in both the full information and bandit settings.

  • An Efficient Interior-Point Method for Minimum-Regret Learning in Online Convex Optimization. (with N. Megiddo) To appear NIPS 2009.
    A new algorithm for regret minimization in online convex optimization is described. The regret of the algorithm after T time periods is almost the minimum possible. However, in n-dimensional space, during each iteration the new algorithm essentially solves a system of linear equations of order n, whereas previous algorithms have to solve some constrained convex optimization problem in n dimensions and possibly many constraints. Thus, the new algorithm improves running time by a factor of at least √ n, and much more for nontrivial domains.

  • Efficient learning algorithms for changing environments. (with C. Seshadhri) The 26th International Conference on Machine Learning (ICML 2009).
    We describe a series of reductions, based upon data streaming techniques, which transform algorithms for minimizing (standard) regret into adaptive algorithms albeit incurring only poly-logarithmic computational overhead. Using this method, we obtain efficient low adaptive-regret algorithms for the problem of general online convex optimization.

  • How hard is it to approximate the best Nash equilibrium? (with Robert Krauthgamer) ACM-SIAM Symposium on Discrete Algorithms (SODA09).
    The quest for a PTAS for Nash equilibrium in a two-player game is a major open question in Algorithmic Game Theory. We show that the optimization version of problem, namely, finding in a two-player game an approximate equilibrium achieving large social welfare is unlikely to have a polynomial time algorithm. Technically, our result is a reduction from a seemingly unrelated yet notoriously difficult problem in modern Combinatorics, of finding a planted (but hidden) clique in a random graph.

  • Better Algorithms for Benign Bandits. (with Satyen Kale) ACM-SIAM Symposium on Discrete Algorithms (SODA09).
    We describe a novel algorithm for the classical Multi-Armed-Bandit problem and its generalizations, which attains scenario-dependent regret bounds thereby improving the state of the art. The new method attains regret of √Q, where Q is the total variation in the cost functions. This regret bound, previously conjectured to hold in the full information case, shows that it is possible to incur much less regret in a slowly changing environment even in the bandit setting. Our algorithm is efficient and applies several new ideas to bandit optimization such as reservoir sampling.

  • An Efficient Algorithm for Bandit Linear Optimization. (with Jacob Abernethy and Alexander Rakhlin) The 21st Annual Conference on Learning Theory (COLT 2008).
    We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal √T regret. The setting is a natural generalization of the non-stochastic multi-armed bandit problem, and the existence of an efficient optimal algorithm has been posed as an open problem in a number of recent papers. We show how the difficulties encountered by previous approaches are overcome by the use of a self-concordant potential function.
    IBM 2008 Pat Goldberg Memorial Best Paper Award
    COLT 2008 Machine Learning Journal Best Student Paper Award

  • Extracting Certainty from Uncertainty: Regret Bounded by Variation in Costs. (with Satyen Kale) The 21st Annual Conference on Learning Theory (COLT 2008). Invited to special issue of Machine Learning Journal.
    Prediction from expert advice is a fundamental problem in machine learning. A major pillar of the field is the existence of learning algorithms whose average loss approaches that of the best expert in hindsight (in other words, whose average regret approaches zero). Traditionally the regret of online algorithms was bounded in terms of the number of prediction rounds. Cesa-Bianchi, Mansour and Stoltz posed the question whether it is possible to bound the regret of an online algorithm by the variation of the observed costs. In this paper we resolve this question, and prove such bounds in the fully adversarial setting, in two important online learning scenarios: prediction from expert advice, and online linear optimization.

  • Sparse Approximate Solutions to Semidefinite Programs. LATIN 2008.
    We analyze a generalization of the Frank-Wolfe algorithm for approximately maximizing a concave function over the bounded semi-definite cone which produces sparse solutions (sparsity for SDP corresponds to low rank matrices).

  • Adaptive Online Gradient Descent. (with Peter Bartlett and Alexander Rakhlin) Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, (NIPS) 2007.
    We study the rates of growth of the regret in online convex optimization, and propose an algorithm which interpolates between previous regret bounds for linear and strongly convex loss functions, achieving intermediate rates between root(T) and logarithmic in T.

  • Computational Equivalence of Fixed Points and No Regret Algorithms, and Convergence to Equilibria. (with Satyen Kale) Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, (NIPS) 2007.
    We study some general notions of game-theoretic equilibria, and show that it is possible to efficiently approximate fixed points corresponding to certain equilibria if and only if there exist efficient no regret algorithms. (the interested reader should also check out the recent Stoltz and Lugosi paper)

  • Online Learning with Prior Information.(with Nimrod Megiddo) Proceedings of 20th Annual Conference on Learning Theory, (COLT) 2007.
    We extend the standard so-called ``experts`` framework by allowing an experts algorithm to rely on state information, namely, partial information about the future which is revealed to the decision maker before the latter chooses an action. This extension is very natural in prediction problems. For illustration, an experts algorithm, which is supposed to predict whether the next day will be rainy, can be extended to predicting the same given the current temperature. We introduce new algorithms, which attain almost optimal performance in the new framework, and apply to more general settings than variants of regression that have been considered in the statistics literature.

  • Logarithmic Regret Algorithms for Online Convex Optimization. (with Amit Agarwal and Satyen Kale) In Machine Learning Volume 69 , Issue 2-3 Pages: 169 - 192 (December 2007)
    We present new algorithms for the problem of online convex optimization, that use second derivatives, analogous to Newton's method. These new algorithms attain logarithmic regret (versus polynomial regret of other methods) and are computationally efficient. We also show a surprising connection to the intuitive "follow the leader" approach, and prove on the affirmative the conjecture about universal portfolios made by Ordentlich and independently by Kalai and Vempala, that the FTL method attains low regret for certain convex loss functions. This paper is a journal version, combining results from a conference paper that appeared in COLT 2006 (with Adam Kalai, Satyen Kale and Amit Agarwal) and a ECCC report (with Amit Agarwal).

  • Algorithms for Portfolio Management based on the Newton Method. (with Amit Agarwal, Satyen Kale and Robert E. Schapire) ICML 2006: 23rd International Conference on Machine Learning.
    We experimentally study the application of the on-line algorithms proposed in the above paper to portfolio management.

  • A Fast Random Sampling Algorithm for Sparsifying Matrices. (with Sanjeev Arora and Satyen Kale) In APPROX 2006
    We present a simple procedure for sparsifying matrices. Unlike previous results, our performance guarantees can be proved with nothing more than simple Chernoff bounds.

  • Fast Algorithms for Approximate Semidefinite Programming using the Multiplicative Weights Update Method. In Proceedings of the 46th Annual Symposium on the Foundations of Computer Science, (FOCS) 2005 (with Sanjeev Arora and Satyen Kale)
    Semidefinite programming (SDP) relaxations appear in many recent approximation algorithms but the only general technique for solving such SDP relaxations is via interior point methods. We use a Lagrangian-relaxation based technique (modified from the papers of Plotkin, Shmoys, and Tardos (PST), and Klein and Lu) to derive faster algorithms for approximately solving several families of SDP relaxations. The algorithms are based upon some improvements to the PST ideas --- which lead to new results even for their framework--- as well as improvements in approximate eigenvalue computations by using random sampling.

  • On Non-Approximability for Quadratic Programs (with Sanjeev Arora, Eli Berger, Guy Kindler, and Muli Safra.) In Proceedings of the 46th Annual Symposium on the Foundations of Computer Science, (FOCS) 2005
    This paper studies the computational complexity of the following type of quadratic programs: given an arbitrary matrix whose diagonal elements are zero, find x in {-1,1}^n that maximizes x^T M x. This problem recently attracted attention due to its application in various clustering settings, as well as an intriguing connection to the famous Grothendieck inequality. It is approximable to within a factor of O(log n), and known to be NP-hard to approximate within a certain constant factor. We show that it is quasi-NP-hard to approximate to a poly-logarithmic factor. The integrality gap of the natural semidefinite relaxation for this problem is known as the Grothendieck constant of the complete graph, and known to be Theta(log n). The proof of this fact was nonconstructive, and did not yield an explicit problem instance where this integrality gap is achieved. Our techniques yield an explicit instance for which the integrality gap is tight up to a log(log(n)) factor, essentially answering one of the open problems of Alon et al.

  • HAPLOFREQ - Estimating Haplotype Frequencies Efficiently(with Eran Halperin) Special issue of Journal of Computational Biology,March 2006, Vol. 13, No. 2: 481-500. Extended abstract appeared in RECOMB 2005.
    This paper gives efficient algorithms for a computational problem in bio-informatics. We give theoretical analysis as well as experiments which show the algorithm's advantage over previous methods.

  • (√(log n)) approximation to SPARSEST CUT can be found in Õ(n²) time. (with Sanjeev Arora and Satyen Kale.) In Proc. 45th FOCS, 238-247, 2004. Submitted to SIAM Journal on Computing.
    We show how to compute O(√log n)-approximations to SPARSEST CUT and BALANCED SEPARATOR problems in Õ(n²) time, improving upon the algorithm of Arora, Rao and Vazirani (2004) which uses semidefinite programming and required Õ(n4.5) time. Our algorithm relies on efficiently finding expander flows in the graph and does not solve semidefinite programs.

  • On the Complexity of Approximating k-Set Packing. (with Muli Safra and Oded Schwartz.) In Computational Complexity 15(1): 20-39 (2006). Extended abstract appeared in APPROX 2003
    We study the fundamental computational problem of set packing in bounded-degree graphs, and prove near-optimal lower bounds.

Technical Reports

Thesis and Surveys