Deep neural network for hierarchical extreme multi-label text classification

Applied Soft Computing - Tập 79 - Trang 125-138 - 2019
Francesco Gargiulo1, Stefano Silvestri2,1, Mario Ciampi1, Giuseppe De Pietro1
1Institute for High Performance Computing and Networking of National Research Council, ICAR-CNR, Via Pietro Castellino 111 - 80131, Naples, Italy
2Department of Engineering, University of Naples “Parthenope”, Centro Direzionale di Napoli, Isola C4 - 80143, Naples, Italy

Tóm tắt

Từ khóa


Tài liệu tham khảo

Mirończuk, 2018, A recent overview of the state-of-the-art elements of text classification, Expert Syst. Appl., 106, 36, 10.1016/j.eswa.2018.03.058

Alicante, 2016, A distributed architecture to integrate ontological knowledge into information extraction, Int. J. Grid Utility Comput., 7, 245, 10.1504/IJGUC.2016.081011

Liu, 2017, Deep learning for extreme multi-label text classification, 115

Wang, 2016, Biomedical event trigger detection by dependency-based word embedding, BMC Med. Genomics, 9, 45, 10.1186/s12920-016-0203-8

Komninos, 2016, Dependency based embeddings for sentence classification tasks, 1490

Levy, 2014, Dependency-based word embeddings, 302

A. Trask, P. Michalak, J. Liu, sense2vec - A fast and accurate method for word sense disambiguation in neural word embeddings, arXiv preprint.

Nentidis, 2017, Results of the fifth edition of the BioASQ Challenge, 48

Holzinger, 2018, Current advances, trends and challenges of machine learning and knowledge extraction: from machine learning to explainable AI, vol. 11015, 1

Goebel, 2018, Explainable AI: the new 42?, vol. 11015, 295

Zhang, 2017, LF-LDA: a topic model for multi-label classification, 618

Pavlinek, 2017, Text classification method based on self-training and lda topic models, Expert Syst. Appl., 80, 83, 10.1016/j.eswa.2017.03.020

Nam, 2014, Large-scale multi-label text classification - revisiting neural networks, 437

Hughes, 2017, Medical text classification using convolutional neural networks, CoRR, 235, 246

D. Yogatama, C. Dyer, W. Ling, P. Blunsom, Generative and Discriminative Text Classification with Recurrent Neural Networks, CoRR abs/1703.01898.

Schwenk, 2017, Very deep convolutional networks for text classification, 1107

Yan, 2018, LSTM2 : multi-label ranking for document classification, Neural Process. Lett., 47, 117, 10.1007/s11063-017-9636-0

Wang, 2016, Recurrent residual learning for sequence classification, 938

Chen, 2017, Ensemble application of convolutional and recurrent neural networks for multi-label text categorization, 2377

Baumel, 2018, Multi-label classification of patient notes: case study on ICD code assignment, 409

Nigam, 2017

Peng, 2018, Large-scale hierarchical text classification with recursively regularized deep graph-cnn, 1063

Mork, 2014, Recent enhancements to the NLM medical text indexer, 1328

Zavorin, 2016, Using learning-to-rank to enhance NLM medical text indexer results, 8

Aronson, 2001, Effective mapping of biomedical text to the UMLS metathesaurus: the metamap program, 17

Lin, 2007, Pubmed related articles: a probabilistic topic-based model for content similarity, BMC Bioinformatics, 8, 423, 10.1186/1471-2105-8-423

Peng, 2016, Deepmesh: deep semantic representation for improving large-scale mesh indexing, Bioinformatics, 32, 70, 10.1093/bioinformatics/btw294

Peng, 2018, Meshlabeler and DeepMeSH: recent progress in large-scale mesh indexing, 203

Le, 2014, Distributed representations of sentences and documents, 1188

Papagiannopoulou, 2016, Large-scale semantic indexing and question answering in biomedicine, 50

Papanikolaou, 2017, Large-scale online semantic indexing of biomedical articles via an ensemble of multi-label classification models, J. Biomed. Semant., 8, 43:1, 10.1186/s13326-017-0150-0

Mao, 2017, MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank, J. Biomed. Semant., 8, 15:1, 10.1186/s13326-017-0123-3

Liu, 2015, MeSHLAbeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence, Bioinformatics, 31, 339, 10.1093/bioinformatics/btv237

Ribadas-Pena, 2015, CoLe and UTAI at BioASQ 2015: experiments with similarity based descriptor assignment

Du, 2017, A novel serial deep multi-task learning model for large scale biomedical semantic indexing, 533

Tanenblatt, 2010, The conceptmapper approach to named entity recognition, 546

Mikolov, 2013, Distributed representations of words and phrases and their compositionality, 3111

Ilievski, 2017, Efficient hyperparameter optimization for deep learning algorithms using deterministic rbf surrogates, 822

Sutton, 1986, Two problems with backpropagation and other steepest-descent learning procedures for networks, 823

Polyak, 1964, Some methods of speeding up the convergence of iteration methods, USSR Comput. Math. Math. Phys., 4, 1, 10.1016/0041-5553(64)90137-5

Nesterov, 1983, A method for unconstrained convex minimization problem with the rate of convergence O(1/k̂ 2), 543

Gargiulo, 2018, Deep convolution neural network for extreme multi-label text classification, 641

Alicante, 2016, Semantic cluster labeling for medical relations, 183

Gargiulo, 2018, A clustering based methodology to support the translation of medical specifications to software models, Appl. Soft Comput., 71, 199, 10.1016/j.asoc.2018.03.057

Mikolov, 2013, Efficient estimation of word representations in vector space

A. Joulin, E. Grave, P. Bojanowski, M. Douze, H. Jégou, T. Mikolov, FastText.zip: Compressing text classification models, arXiv preprint arXiv:1612.03651.

S.M. Rezaeinia, A. Ghodsi, R. Rahmani, Improving the Accuracy of Pre-trained Word Embeddings for Sentiment Analysis, CoRR abs/1711.08609.

Gargiulo, 2017, A deep learning approach for scientific paper semantic ranking, 471

I. Pavlopoulos, A. Kosmopoulos, I. Androutsopoulos, Continuous Space Word Vectors Obtained by Applying Word2Vec to Abstracts of Biomedical Articles, Tech. rep., NLP Group, Department of Informatics, Athens University of Economics and Business, Greece Institute of Informatics and Telecommunications, NCRS Demokritos, Greece, 2014.

Q. Liu, Z. Ling, H. Jiang, Y. Hu, Part-of-Speech Relevance Weights for Learning Word Embeddings, CoRR abs/1603.07695.

Bojanowski, 2017, Enriching word vectors with subword information, Trans. Assoc. Comput. Linguist., 5, 135, 10.1162/tacl_a_00051

Joulin, 2017, Bag of tricks for efficient text classification, 427

Melamud, 2015, A simple word embedding model for lexical substitution, 1

Gargiulo, 2017, A big data architecture for knowledge discovery in PubMed articles, 82

Demsar, 2006, Statistical comparisons of classifiers over multiple data sets, J. Mach. Learn. Res., 7, 1

Manning, 2014, The stanford coreNLP natural language processing toolkit, 55

Řehůřek, 2010, Software framework for topic modelling with large corpora, 45

Bird, 2009

Tsoumakas, 2010, Mining multi-label data, 667

Özgür, 2005, Text categorization with class-based and corpus-based keyword selection, Comput. Inf. Sci.-ISCIS 2005, 606

Manning, 2010

Sokolova, 2009, A systematic analysis of performance measures for classification tasks, Inf. Process. Manage., 45, 427, 10.1016/j.ipm.2009.03.002

Godbole, 2004, Discriminative methods for multi-labeled classification, 22

Moskovitch, 2006, Multiple hierarchical classification of free-text clinical guidelines, Artif. Intell. Med., 37, 177, 10.1016/j.artmed.2006.04.001

Kosmopoulos, 2015, Evaluation measures for hierarchical classification: a unified view and novel approaches, Data Min. Knowl. Discov., 29, 820, 10.1007/s10618-014-0382-x

Aho, 1976, On finding lowest common ancestors in trees, SIAM J. Comput., 5, 115, 10.1137/0205011

S. Ramamoorthy, S. Murugan, An Attentive Sequence Model for Adverse Drug Event Extraction from Biomedical Text, CoRR abs/1801.00625.