The true sample complexity of active learning

Machine Learning - Tập 80 - Trang 111-139 - 2010
Maria-Florina Balcan1, Steve Hanneke2, Jennifer Wortman Vaughan3
1College of Computing, School of Computer Science, Georgia Institute of Technology, Atlanta, USA
2Department of Statistics, Carnegie Mellon University, Pittsburgh, USA
3School of Engineering and Applied Sciences, Harvard University, Cambridge, USA

Tóm tắt

We describe and explore a new perspective on the sample complexity of active learning. In many situations where it was generally believed that active learning does not help, we show that active learning does help in the limit, often with exponential improvements in sample complexity. This contrasts with the traditional analysis of active learning problems such as non-homogeneous linear separators or depth-limited decision trees, in which Ω(1/ε) lower bounds are common. Such lower bounds should be interpreted carefully; indeed, we prove that it is always possible to learn an ε-good classifier with a number of samples asymptotically smaller than this. These new insights arise from a subtle variation on the traditional definition of sample complexity, not previously recognized in the active learning literature.

Tài liệu tham khảo