Multi-category classifiers and sample width
Tài liệu tham khảo
Anthony, 2000, Function learning from interpolation, Comb. Probab. Comput., 9, 213, 10.1017/S0963548300004247
Anthony, 1999
Alon, 1997, Scale-sensitive dimensions, uniform convergence, and learnability, J. ACM, 44, 615, 10.1145/263867.263927
Anthony, 2010, Maximal width learning of binary functions, Theor. Comput. Sci., 411, 138, 10.1016/j.tcs.2009.09.020
Anthony, 2014, Learning bounds via sample width for classifiers on finite metric spaces, Theor. Comput. Sci., 529, 2, 10.1016/j.tcs.2013.07.004
Anthony, 2012, Analysis of a multi-category classifier, Discrete Appl. Math., 160, 2329, 10.1016/j.dam.2012.07.010
Anthony, 2015, A probabilistic approach to case-based inference, Theor. Comput. Sci., 589, 61, 10.1016/j.tcs.2015.04.016
Bartlett, 1998, The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network, IEEE Trans. Inf. Theory, 44, 525, 10.1109/18.661502
Blumer, 1989, Learnability and the Vapnik–Chervonenkis dimension, J. ACM, 36, 929, 10.1145/76359.76371
Dudley, 1999, Uniform Central Limit Theorems, vol. 63
Guermeur, 2007, VC theory of large margin multi-category classifiers, J. Mach. Learn. Res., 8, 2551
Haussler, 1992, Decision theoretic generalizations of the pac model for neural net and other learning applications, Inf. Comput., 100, 78, 10.1016/0890-5401(92)90010-D
Pollard, 1984
Shawe-Taylor, 1996, Structural risk minimization over data-dependent hierarchies, IEEE Trans. Inf. Theory, 44, 1926, 10.1109/18.705570
Smola, 2000
Vapnik, 1998
Vapnik, 1971, On the uniform convergence of relative frequencies of events to their probabilities, Theory Probab. Appl., 16, 264, 10.1137/1116025