GINN: gradient interpretable neural networks for visualizing financial texts

International Journal of Data Science and Analytics - Tập 9 - Trang 431-445 - 2018

Tomoki Ito¹, Hiroki Sakaji¹, Kiyoshi Izumi¹, Kota Tsubouchi², Tatsuo Yamashita²

¹Graduate School of Engineering, The University of Tokyo, Tokyo, Japan

²Yahoo Japan Corporation, Tokyo, Japan

Tóm tắt

This study aims to visualize financial documents in such a way that even nonexperts can understand the sentiments contained therein. To achieve this, we propose a novel text visualization method using an interpretable neural network (NN) architecture, called a gradient interpretable NN (GINN). A GINN can visualize a market sentiment score from an entire financial document and the sentiment gradient scores in both word and concept units. Moreover, the GINN can visualize important concepts given in various sentence contexts. Such visualization helps nonexperts easily understand financial documents. We theoretically analyze the validity of the GINN and experimentally demonstrate the validity of text visualization produced by the GINN using real financial texts.

Tài liệu tham khảo

Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89(C), 14–46 (2015) Hechtlinger, Y.: Interpretation of prediction models using the input gradient. In: NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems (2016) Bach, S., Binder, A., Montavon, G., Klauschen, F., Muller, K.R., Samek, W.: On pixel-wise explanations for nonlinear classifier decisions by layer-wise relevance propagation. PLOS ONE 10(7), 1–46 (2015) Mikolov, T., Chen, K., Sutskever, I., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. NIPS 2013, 3111–3119 (2013) Hornik, K., Feinerer, I., Kober, M., Buchta, C.: Spherical k-means clustering. J. Stat. Softw. 50(10), 1–22 (2012) Yuan, Y., He, L., Peng, L., Huang, Z.: A new study based on word2vec and cluster for document categorization. J. Comput. Inf. Syst. 10(21), 9301–9308 (2014) Zhao, P., Zhang, T.: Accelerating Minibatch stochastic gradient descent using stratified sampling. arXiv:1405.3080v1 (2014) Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: ICLR (2015) Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. JMLR 15(1), 1929–1958 (2014) Kudo, T., Yamamoto, K., Matsumoto. Y.: Applying conditional random fields to japanese morphological analysis. In: EMNLP 2004(2004) Fang, A., Macdonald, C., Ounis, I., Habel, P.: Using word embedding to evaluate the coherence of topics from twitter data. In: SIGIR 2016 (2016) Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: LREC 2010 Workshop (2010) Shrikumar, A., Greenside, P., Kundaje, A.: Learning important features through propagating activation differences. In: ICML 2017 (2017) Xu, Q., Zhao, Q., Pei, W., Yang, L., He, Z.: Design interpretable neural network trees through self-organized learning of features. In: IJCNN 2004 (2004) Zhang, Q., Wu, Y.N., Zhu, S.: Interpretable convolutional neural networks. In: CVPR 2018 (2018) Mnih, V., Heess, N., Graves, A., Kavukcuoglu, K.: Recurrent models of visual attention. NIPS 2014, 2204–2212 (2014) Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhutdinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. ICML 2015, 77–81 (2015) Dong, Y., Su, H., Zhu, J., Zhang, B.: Improving interpretability of deep neural networks with semantic information. In: CVPR 2017 (2017) Patrik, E.K., Liu, Y.: A survey on interactivity in topic models. IJACSA 7(4), 456–461 (2016) Jeffrey, L., Connor, C., Kevin, S., Jordan, B.: Tandem anchoring: a multiword anchor approach for interactive topic modeling. In: ACL 2017, pp. 896–905 (2017) Hu, L., Jian, S., Cao, L., Chen, Q.: Interpretable recommendation via attraction modeling: learning multilevel attractiveness over multimodal movie contents. In: IJCAI 2018 (2018) Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: NAACL 2016 (2016) Rahman, M.K.M., Chow, W.S.C.: Content-based hierarchical document organization using multi-layer hybrid network and tree-structured features. Expert Syst. Appl. 37(4), 2874–2881 (2010) Zhao, H., Du, L., Buntine, W., Zhou, M.: Inter and intra topic structure learning with word embeddings. In: ICML 2018 (2018) Hasan, M., RundensteinerE., Agu, E.: Automatic emotion detection in text streams by analyzing Twitter data. Int. J. Data Sci. Anal. (2018) https://doi.org/10.1007/s41060-018-0096-z Barranco, R.C., Boedihardjo, A.P., Hossain, M.S.: Analyzing evolving stories in news articles. Int. J. Data Sci. Anal. (2017). https://doi.org/10.1007/s41060-017-0091-9 Ito, T., Sakaji, H., Tsubouchi, K., Izumi, K., Yamashita, T.: Text-visualizing neural network model: understanding online financial textual data. In: PAKDD 2018 (2018)

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA