Gradient-based learning applied to document recognition

Proceedings of the IEEE - Tập 86 Số 11 - Trang 2278-2324 - 1998
Yann LeCun1, Léon Bottou1, Yoshua Bengio1,2, Patrick Haffner1
1Speech and Image Processing Services Research Laboratory
2Université de Montréal

Tóm tắt

Từ khóa


Tài liệu tham khảo

10.1109/72.554195

10.1162/neco.1995.7.6.1289

mozer, 1991, The perception of multiple objects A connectionist approach

10.1016/0031-3203(82)90024-3

10.1007/BF00342633

10.1113/jphysiol.1962.sp006837

10.1109/ICNN.1993.298793

10.1162/neco.1993.5.3.419

lecun, 1990, Handwritten digit recognition with a back-propagation network, Advances in Neural Information Processing Systems 2 (NIPS'89)

lecun, 1989, Connectionism in Perspective

10.1109/5.18626

matan, 1992, Neural Information Processing Systems, 4

bourland, 1994, Connectionist Speech Recognition A Hybrid Approach, 10.1007/978-1-4615-3210-1

lecun, 1987, Mod�les Connexionnistes de l'Apprentissage (Connectionist Learning Models)

bottou, 1991, Advances in neural information processing systems, 3

lecun, 1988, A theoretical framework for back-propagation, Proc 1988 Connectionist Models Summer School, 21

10.1109/5.156474

10.1109/5.156477

mohri, 1997, Finite-state transducers in language and speech processing, Computational Linguistics, 23, 269

keeler, 1991, Neural Information Processing Systems, 3, 557

pereira, 1997, Finite State Devices for Natural Language Processing

lecun, 1989, Handwritten digit recognition: Applications of neural net chips and automatic learning, IEEE Trans Commun, 37, 41

10.1109/78.175747

lecun, 1995, Comparison of learning algorithms for handwritten digit recognition, Int Conf Artificial Neural Networks, 53

10.1007/BF00116037

10.1162/neco.1992.4.6.888

10.1007/BF02551274

saad, 1996, Advances in neural information processing systems, 8, 302

10.1162/neco.1991.3.3.440

10.1109/TC.1978.1675179

ott, 1976, Construction of quadratic polynomial classifiers, Proc IEEE Int Conf Pattern Recognition, 161

guyon, 1989, Comparing different neural net architectures for classifying handwritten digits, Proc IEEE IJCNN, ii, 127

lang, 1988, A time-delay neural-network architecture for speech recognition

10.1162/neco.1994.6.5.851

10.1103/PhysRevA.45.6056

10.1007/978-1-4757-2440-0

cortes, 1994, Advances in Neural Information Processing Systems 6, 327

press, 1986, Numerical Recipes The Art of Scientific Computing

10.1016/0885-2308(87)90010-6

vapnik, 1998, Statistical Learning Theory

10.1109/PGEC.1967.264666

10.1103/PhysRevLett.66.2396

10.1142/S0218001493000339

10.1109/ICASSP.1986.1169179

dietterich, 1995, Solving multiclass learning problems via error-correcting output codes, J Artificial Intell Res, 2, 263, 10.1613/jair.105

10.1016/0893-6080(90)90028-J

10.1109/29.21701

10.1016/0031-3203(91)90081-F

haffner, 0, Time-delay neural networks embedding time alignment: A performance analysis, Proc Eurospeech 91 2nd European Conf Speech Communication and Technology

10.1016/0167-6393(90)90049-F

10.1109/ICASSP.1989.266355

lippmann, 1987, Neural-net classifiers useful for speech recognition, Proc IEEE 1st Int Conf Neural Networks, 417

10.1109/TIT.1967.1054010

driancourt, 1991, MLP, LVQ and DP: Comparison & cooperation, Proc Int Joint Conf Neural Networks, 2, 815

10.1109/72.125866

10.1109/ICASSP.1990.115733

10.1109/ICASSP.1990.115724

haffner, 1992, Advances in neural information processing systems, 4, 579

10.1109/72.279181

drucker, 1993, Advances in Neural Information Processing Systems 5, 42

10.1145/130385.130401

simard, 1993, Advances in neural information processing systems, 5

burges, 1997, Advances in Neural Information Processing Systems 9

10.1109/72.129422

bridle, 1989, Neurocomputing Algorithms Architectures and Applications

lecun, 1997, Reading checks with graph transformer networks, Proc IEEE Int Conf Acoustics Speech Signal Processing, 1, 151

bengio, 1996, Neural Networks for Speech and Sequence Recognition

10.1109/IJCNN.1992.227175

10.1162/neco.1989.1.4.541

10.1109/ICPR.1994.576889

duda, 1973, Pattern Classification and Scene Analysis

bengio, 1996, An EM algorithm for asynchronous input/output hidden Markov models, Proc Int Conf Neural Information Processing, 328

nowlan, 1995, Advances in neural information processing systems, 7, 901

rahim, 0, Discriminative feature and model design for automatic speech recognition, Proc Eurospeech '97

wolf, 1994, Advances in neural information processing systems, 6, 745

10.1109/89.568733

10.1049/ip-vis:19941301

mohri, 1997, A Rational Design for a Weighted Finite-State Transducer Library

bengio, 1994, Word normalization for on-line handwritten word recognition, Proc IEEE Int Conf Pattern Recognition, 10.1109/ICPR.1994.576966

10.1109/72.536317

10.1109/2.144441

bengio, 1996, Advances in neural information processing systems, 7, 427

10.1109/89.260364

mohri, 0, Weighted determinization and minimization for large vocabulary recognition, Proc Eurospeech '97, 131

guyon, 1996, In Handbook of Character Recognition and Document Image Analysis

10.1109/ICASSP.1994.389576

gilloux, 1993, Recognition of cursive script amounts on postal checks, Proc Europ Conf Postal Technol, 705

10.1109/34.57669

bourlard, 1989, Advances in neural information processing systems, 1, 186

bengio, 1992, Advances in neural information processing systems, 4, 175

10.1109/CVPR.1996.517075

10.1109/CVPR.1997.609310

tsypkin, 1971, Adaptation and Learning in Automatic Systems

tsypkin, 1973, Foundations of the Theory of Learning Systems

minsky, 1961, Learning in random nets, Proc 4th London Symp Information Theory, 335

10.1207/s15516709cog0901_7

hinton, 1986, Parallel Distributed Processing Explorations in the Microstructure of Cognition Volume 1 Foundations

rumelhart, 1986, Parallel Distributed Processing Explorations in the Microstructure of Cognition, i, 318

10.1162/neco.1992.4.2.141

bryson, 1969, Applied Optimal Control

denker, 1995, The Mathematics of Induction

10.1109/72.363436

lecun, 1985, A learning scheme for asymmetric threshold networks, Proc of COGNITIVA'85, 599

haffner, 0, Connectionist speech recognition with a global MMI algorithm, Proc Eurospeech 93 3rd Europ Conf Speech Communication and Technology, 1929

10.1007/978-3-642-82657-3_24

rahim, 1997, Disriminative feature and model design for automatic speech recognition, Proc EUROSPEECH, 75, 10.21437/Eurospeech.1997-46

kramer, 1988, Advances in neural information processing systems, 1, 40

parker, 1985, Learning-logic

bottou, 1991, Une approche th�orique de l'Apprentissage Connexionniste Applications � la reconnaissance de la Parole

10.1109/ICSMC.1995.538133

10.1109/ICDAR.1995.598933

lecun, 1993, On-line handwriting recognition with neural networks: Spatial representation versus temporal representation, Proceedings of the 6th International Conference on Handwriting and Drawing

10.1109/ICNN.1988.23829

10.1142/S0218001493000340

moller, 1993, "Efficient Training of Feed-Forward Neural Networks "

schenkel, 1993, Advances in neural information processing systems, 5, 723

becker, 1988, Improving the convergence of back-propagation learning with second-order methods

10.1109/ICASSP.1993.319196

10.3115/1075812.1075870

10.1109/12.210173

10.1109/4.104196