Learning long-term dependencies with gradient descent is difficult
Tóm tắt
Từ khóa
Tài liệu tham khảo
grossman, 0, Learning by choice of internal representation, Neural Information Processing Systems, 1, 73
kirkpatrick, 1983, Optimization by simulated annealing, Science, 220, 671, 10.1126/science.220.4598.671
kuhn, 1987, A first look at phonetic discrimination using connectionist models with recurrent links
lang, 1988, The development of the time-delay neural network architecture for speech recognition
mozer, 1989, A focused back-propagation algorithm for temporal pattern recognition, Complex Systems, 3, 349
mozer, 1992, Advances in neural information processing systems, 4, 275
bengio, 1991, Artificial neural networks and their application to sequence recognition
frasconi, 0, Unified integration of explicit rules and learning by example in recurrent networks, IEEE Trans on Knowledge and Data Engineering
becker, 0, Improving the convergence of backpropagation learning with second order methods, Proceedings of the 1988 Connectionist Models Summer School, 29
ortega, 1960, Iterative Solution of Non-linear Equations in Several Variables and Systems of Equations
rumelhart, 1986, Parallel Distributed Processing, 1, 318
rohwer, 1990, Advances in neural information processing systems, 2, 558