Analysis of Perceptron-Based Active Learning

Dasgupta, Sanjoy; Kalai, Adam Tauman; Monteleoni, Claire

Author(s)

Dasgupta, Sanjoy; Kalai, Adam Tauman; Monteleoni, Claire

DownloadMIT-CSAIL-TR-2005-075.ps (11222Kb)

Additional downloads

Metadata

Show full item record

Abstract

We start by showing that in an active learning setting, the Perceptron algorithm needs $\Omega(\frac{1}{\epsilon^2})$ labels to learn linear separators within generalization error $\epsilon$. We then present a simple selective sampling algorithm for this problem, which combines a modification of the perceptron update with an adaptive filtering rule for deciding which points to query. For data distributed uniformly over the unit sphere, we show that our algorithm reaches generalization error $\epsilon$ after asking for just $\tilde{O}(d \log \frac{1}{\epsilon})$ labels. This exponential improvement over the usual sample complexity of supervised learning has previously been demonstrated only for the computationally more complex query-by-committee algorithm.

Date issued

2005-11-17

URI

http://hdl.handle.net/1721.1/30585

Other identifiers

MIT-CSAIL-TR-2005-075

AIM-2005-033

Series/Report no.

Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory

Keywords

AI, active learning, perceptron, label-complexity, mistake bound, selective sampling

Collections

CSAIL Technical Reports (July 1, 2003 - present)