Deepak Chawla, Lin Li, and
Stephen D. Scott.
On approximating weighted sums with exponentially many terms.
Technical report UNL-CSE-2003-1, University of Nebraska, 2003.
Abstract
350Kb PDF
Abstract
Multiplicative weight-update algorithms such as Winnow and Weighted Majority have been studied extensively due to their on-line mistake bounds' logarithmic dependence on N, the total number of inputs, which allows them to be applied to problems where N is exponential. However, a large N requires techniques to efficiently compute the weighted sums of inputs to these algorithms. In special cases, the weighted sum can be exactly computed efficiently, but for numerous problems such an approach seems infeasible. Thus we explore applications of Markov chain Monte Carlo (MCMC) methods to estimate the total weight. Our methods are very general and applicable to any representation of a learning problem for which the inputs to a linear learning algorithm can be represented as states in a completely connected, untruncated Markov chain. We give theoretical worst-case guarantees on our technique and then apply it to two problems: learning DNF formulas using Winnow, and pruning classifier ensembles using Weighted Majority. We then present empirical results on simulated data indicating that in practice, the time complexity is much better than what is implied by our worst-case theoretical analysis.Keywords: Markov chain Monte Carlo approximation, Winnow, Weighted Majority, multiplicative weight updates, DNF learning, boosting
To Stephen D. Scott's home page
Last modified 28 March 2003.