Deep learning with long short-term memory networks for financial market predictions

European Journal of Operational Research - Tập 270 Số 2 - Trang 654-669 - 2018
Thomas Fischer1, Christopher Krauß1
1Department of Statistics and Econometrics, University of Erlangen-Nürnberg, Lange Gasse 20, 90403 Nürnberg, Germany

Tóm tắt

Từ khóa


Tài liệu tham khảo

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., et al., (2015). TensorFlow: Large-scale machine learning on heterogeneous systems. Software available from tensorflow.org, http://tensorflow.org/.

Atsalakis, 2009, Surveying stock market forecasting techniques – Part II: Soft computing methods, Expert Systems with Applications, 36, 5932, 10.1016/j.eswa.2008.07.006

Avellaneda, 2010, Statistical arbitrage in the US equities market, Quantitative Finance, 10, 761, 10.1080/14697680903124632

Baker, 2011, Benchmarks as limits to arbitrage: Understanding the low-volatility anomaly, Financial Analysts Journal, 67, 40, 10.2469/faj.v67.n1.4

Bali, 2011, Maxing out: Stocks as lotteries and the cross-section of expected returns, Journal of Financial Economics, 99, 427, 10.1016/j.jfineco.2010.08.014

Bogomolov, 2013, Pairs trading based on statistical variability of the spread process, Quantitative Finance, 13, 1411, 10.1080/14697688.2012.748934

Boyer, 2010, Expected idiosyncratic skewness, Review of Financial Studies, 23, 169, 10.1093/rfs/hhp041

Breiman, 2001, Random forests, Machine learning, 45, 5, 10.1023/A:1010933404324

Britz, D. (2015). Recurrent neural network tutorial, part 4 – Implementing a GRU/LSTM RNN with Python and Theano. http://www.wildml.com/2015/10/recurrent-neural-network-tutorial-part-4-implementing-a-grulstm-rnn-with-python-and-theano/.

Carhart, 1997, On persistence in mutual fund performance, The Journal of Finance, 52, 57, 10.1111/j.1540-6261.1997.tb03808.x

Chollet, F. (2016). Keras. https://github.com/fchollet/keras

Clegg, 2018, Pairs trading with partial cointegration, Quantitative Finance, 18, 121, 10.1080/14697688.2017.1370122

Conrad, 1989, Mean reversion in short-horizon expected returns, Review of Financial Studies, 2, 225, 10.1093/rfs/2.2.225

Diebold, 1995, Comparing predictive accuracy, Journal of Business & Economic Statistics, 13, 253

Dixon, 2015, Implementing deep neural networks for financial market prediction on the Intel Xeon Phi, 1

Engelberg, 2017, Short selling risk, Journal of Finance

Fama, 1970, Efficient capital markets: A review of theory and empirical work, The Journal of Finance, 25, 383, 10.2307/2325486

Fama, 1996, Multifactor explanations of asset pricing anomalies, The Journal of Finance, 51, 55, 10.1111/j.1540-6261.1996.tb05202.x

Frazzini, 2014, Betting against beta, Journal of Financial Economics, 111, 1, 10.1016/j.jfineco.2013.10.005

Gal, 2016, A theoretically grounded application of dropout in recurrent neural networks, 1019

Gatev, 2006, Pairs trading: Performance of a relative-value arbitrage rule, Review of Financial Studies, 19, 797, 10.1093/rfs/hhj020

Gers, 2000, Learning to forget: Continual prediction with LSTM, Neural Computation, 12, 2451, 10.1162/089976600300015015

Giles, 2001, Noisy time series prediction using recurrent neural networks and grammatical inference, Machine Learning, 44, 161, 10.1023/A:1010884214864

Goodfellow, 2013, Maxout networks, 1319

Granger, 1993, Strategies for modelling nonlinear time-series relationships, Economic Record, 69, 233, 10.1111/j.1475-4932.1993.tb02103.x

Graves, A. (2013). Generating sequences with recurrent neural networks. CoRR, arXiv preprint arXiv:1308.0850.

Graves, 2009, A novel connectionist system for unconstrained handwriting recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 31, 855, 10.1109/TPAMI.2008.137

Graves, 2013, Speech recognition with deep recurrent neural networks, 6645

Graves, 2005, Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Networks, 18, 602, 10.1016/j.neunet.2005.06.042

Green, 2017, The characteristics that provide independent information about average U.S. monthly stock returns, The Review of Financial Studies, 30, 4389, 10.1093/rfs/hhx019

Green, 2013, The supraview of return predictive signals, Review of Accounting Studies, 18, 692, 10.1007/s11142-013-9231-1

Gregoriou, 2012

H2O (2016). H2O documentation. http://h2o.ai/docs, http://h2o-release.s3.amazonaws.com/h2o/rel-tukey/4/docs-website/h2o-docs/index.html.

Ho, 1995, Random decision forests, 1, 278

Hochreiter, 1997, Long short-term memory, Neural computation, 9, 1735, 10.1162/neco.1997.9.8.1735

Hong, 2016, Speculative betas, The Journal of Finance, 71, 2095, 10.1111/jofi.12431

Huck, 2009, Pairs selection and outranking: An application to the S&P 100 index, European Journal of Operational Research, 196, 819, 10.1016/j.ejor.2008.03.025

Huck, 2010, Pairs trading and outranking: The multi-step-ahead forecasting case, European Journal of Operational Research, 207, 1702, 10.1016/j.ejor.2010.06.043

Jacobs, 2015, What explains the dynamics of 100 anomalies?, Journal of Banking & Finance, 57, 65, 10.1016/j.jbankfin.2015.03.006

Jacobs, 2015, On the determinants of pairs trading profitability, Journal of Financial Markets, 23, 75, 10.1016/j.finmar.2014.12.001

Jegadeesh, 1990, Evidence of predictable behavior of security returns, The Journal of Finance, 45, 881, 10.1111/j.1540-6261.1990.tb05110.x

Jegadeesh, 1993, Returns to buying winners and selling losers: Implications for stock market efficiency, The Journal of Finance, 48, 65, 10.1111/j.1540-6261.1993.tb04702.x

Jegadeesh, 1995, Overreaction, delayed reaction, and contrarian profits, Review of Financial Studies, 8, 973, 10.1093/rfs/8.4.973

Jha, 2016, Timing equity quant positions with short-horizon alphas, The Journal of Trading, 11, 53, 10.3905/jot.2016.11.3.053

Karpathy, A. (2015). The unreasonable effectiveness of recurrent neural networks. http://karpathy.github.io/2015/05/21/rnn-effectiveness/.

Krauss, 2017, Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500, European Journal of Operational Research, 259, 689, 10.1016/j.ejor.2016.10.031

Kumar, 2009, Who gambles in the stock market?, The Journal of Finance, 64, 1889, 10.1111/j.1540-6261.2009.01483.x

LeBaron, 1992, Some relations between volatility and serial correlations in stock market returns, The Journal of Business, 65, 199, 10.1086/296565

LeCun, 2015, Deep learning, Nature, 521, 436, 10.1038/nature14539

Lee, 2003, Short-term contrarian investing – Is it profitable?... Yes and No, Journal of Multinational Financial Management, 13, 385, 10.1016/S1042-444X(03)00017-3

Lehmann, 1990, Fads, martingales, and market efficiency, The Quarterly Journal of Economics, 105, 1, 10.2307/2937816

Lo, 1990, When are contrarian profits due to stock market overreaction?, Review of Financial Studies, 3, 175, 10.1093/rfs/3.2.175

Maechler, M. (2016). Rmpfr: R MPFR – multiple precision floating-point reliable. R package. https://cran.r-project.org/package=Rmpfr.

Malkiel, 2007

McKinney, 2010, Data structures for statistical computing in Python, 445, 51, 10.25080/Majora-92bf1922-00a

Medsker, 2000, Recurrent neural networks: Design and applications

Moritz, 2014, Deep conditional portfolio sorts: The relation between past and future stock returns

Olah, C. (2015). Understanding LSTM Networks. http://colah.github.io/posts/2015-08-Understanding-LSTMs/.

Pedregosa, 2011, Scikit-learn: Machine learning in python, Journal of Machine Learning Research, 12, 2825

Peterson, B. G., & Carl, P. (2014). PerformanceAnalytics: Econometric tools for performance and risk analysis. R package, http://CRAN.R-project.org/package=PerformanceAnalytics.

Python Software Foundation (2016). Python 3.5.2 documentation. Available at https://docs.python.org/3.5/.

R Core Team (2016). R: A language and environment for statistical computing. http://www.R-project.org/.

Rad, 2016, The profitability of pairs trading strategies: Distance, cointegration and copula methods, Quantitative Finance, 16, 1541, 10.1080/14697688.2016.1164337

Sak, H., Senior, A.W., & Beaufays, F. (2014). Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition. CoRR, arXiv preprint arXiv:1402.1128.

Schmidhuber, 2015, Deep learning in neural networks: An overview, Neural Networks, 61, 85, 10.1016/j.neunet.2014.09.003

Sermpinis, 2013, Forecasting foreign exchange rates with adaptive neural networks using radial-basis functions and particle swarm optimization, European Journal of Operational Research, 225, 528, 10.1016/j.ejor.2012.10.020

Siah, K. W., & Myers, P. (2016). Stock market prediction through technical and public sentiment analysis. http://kienwei.mit.edu/sites/default/files/images/stock-market-prediction.pdf.

Takeuchi, L., & Lee, Y.-Y. (2013). Applying deep learning to enhance momentum trading strategies in stocks. Working paper, Stanford University.

Tieleman, 2012, Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude, COURSERA: Neural Networks for Machine Learning, 4, 26

Van Der Walt, 2011, The numpy array: a structure for efficient numerical computation, Computing in Science & Engineering, 13, 22, 10.1109/MCSE.2011.37

Xiong, R., Nichols, E. P., & Shen, Y. (2015). Deep learning stock volatility with Google domestic trends. arXiv e-prints arXiv:1512.04916.