Analysis of Perceptron-Based Active Learning
Author(s)
Dasgupta, Sanjoy; Kalai, Adam Tauman; Monteleoni, Claire
DownloadMIT-CSAIL-TR-2005-075.ps (11222Kb)
Additional downloads
Metadata
Show full item recordAbstract
We start by showing that in an active learning setting, the Perceptron algorithm needs $\Omega(\frac{1}{\epsilon^2})$ labels to learn linear separators within generalization error $\epsilon$. We then present a simple selective sampling algorithm for this problem, which combines a modification of the perceptron update with an adaptive filtering rule for deciding which points to query. For data distributed uniformly over the unit sphere, we show that our algorithm reaches generalization error $\epsilon$ after asking for just $\tilde{O}(d \log \frac{1}{\epsilon})$ labels. This exponential improvement over the usual sample complexity of supervised learning has previously been demonstrated only for the computationally more complex query-by-committee algorithm.
Date issued
2005-11-17Other identifiers
MIT-CSAIL-TR-2005-075
AIM-2005-033
Series/Report no.
Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory
Keywords
AI, active learning, perceptron, label-complexity, mistake bound, selective sampling