Deep Learning for Computer Vision: A Brief Review
Tóm tắt
Over the last years deep learning methods have been shown to outperform previous state-of-the-art machine learning techniques in several fields, with computer vision being one of the most prominent cases. This review paper provides a brief overview of some of the most significant deep learning schemes used in computer vision problems, that is, Convolutional Neural Networks, Deep Boltzmann Machines and Deep Belief Networks, and Stacked Denoising Autoencoders. A brief account of their history, structure, advantages, and limitations is given, followed by a description of their applications in various computer vision tasks, such as object detection, face recognition, action and activity recognition, and human pose estimation. Finally, a brief overview is given of future directions in designing deep learning schemes for computer vision problems and the challenges involved therein.
Từ khóa
Tài liệu tham khảo
1990, Handwritten digit recognition with a back-propagation network
2012, Theano: new features and speed improvements
1986, Information processing in dynamical systems: Foundations of harmony theory, 1, 194
1986, Learning and Relearning in Boltzmann Machines, 1, 4.2
2010, Momentum, 9, 926
2014, Journal of Machine Learning Research, 15, 2949
2007, Greedy layer-wise training of deep networks, 19, 153
2017, IEEE Transactions on Image Processing