Text classification using a few labeled examples

Computers in Human Behavior - Tập 30 - Trang 689-697 - 2014
Francesco Colace1, Massimo De Santo1, Luca Greco1, Paolo Napoletano2
1DIEM - Department of Information Engineering, Electrical Engineering and Applied Mathematics University of Salerno, 84084 Fisciano, Italy
2Department of Informatics, Systems and Communication University of Milan, Bicocca, 20126 Milan, Italy

Tài liệu tham khảo

Adam Grzywaczewski, 2012, Task-specific information retrieval systems for software engineers, Journal of Computer and System Sciences, 78–4, 1204, 10.1016/j.jcss.2011.10.009 Aggarwal, 2012, A survey of text classification algorithms, 163 Berkhin, 2006, A survey of clustering data mining techniques, 25 Bishop, 1995 Bishop, 2006 Blei, 2003, Latent Dirichlet allocation, Journal of Machine Learning Research, 3 Blum, 1997, Selection of relevant features and examples in machine learning, Artificial Intelligence, 97, 245, 10.1016/S0004-3702(97)00063-5 Christopher, 2008 Clarizia, 2011, A new text classification technique using small training sets, 1038 Clarizia, 2011, Mixed graph of terms for query expansion, 581 Colace, 2013, Improving text retrieval accuracy by using a minimal relevance feedback, Vol. 348, 126 Cortes, 1995, Support-vector networks, Machine Learning, 20, 273, 10.1007/BF00994018 Fodor, I. (2002). A survey of dimension reduction techniques, technical report. Griffiths, 2007, Topics in semantic representation, Psychological Review, 114, 211, 10.1037/0033-295X.114.2.211 Hai Dong, 2011, A framework for discovering and classifying ubiquitous services in digital health ecosystems, Journal of Computer and System Sciences, 78–4, 687, 10.1016/j.jcss.2010.02.009 Hastie, 2009 Karavasilis, 2010, A model for investigating e-governance adoption using tam and doi, International Journal of Knowledge Society Research, 3, 71, 10.4018/jksr.2010070106 Ko, 2009, Text classification from unlabeled documents with bootstrapping and feature projection techniques, Information Processing Management, 45, 70, 10.1016/j.ipm.2008.07.004 Lewis, 2004, Rcv1: A new benchmark collection for text categorization research, Journal of Machine Learning Research, 5, 361 Liu, 2006 Liu, 1999, Mining interesting knowledge using DM-II, 430 Magdalini Eirinaki, 2012, Feature-based opinion mining and ranking, Journal of Computer and System Sciences, 78–4, 1175, 10.1016/j.jcss.2011.10.007 McCallum, A. K. (1996). Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering. <http://www.cs.cmu.edu/?mccallum/bow>. McCallum, A. K. (2002). Mallet: A machine learning for language toolkit. <http://mallet.cs.umass.edu>. McCallum, 1998, A comparison of event models for naive Bayes text classification McCallum, 1999, A machine learning approach to building domain-specific search engines, Vol. 2, 662 Napoletano, P., Colace, F., De Santo, M., & Greco, L. (2012). Text classification using a graph of terms. In 2012 Sixth international conference on complex, intelligent and software intensive systems (CISIS) (pp. 1030 –1035). Ng, A.Y., Jordan, M. I. (2002). On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Noam, S. Naftali, T. (2001). The power of word clusters for text classification. In 23rd European colloquium on information retrieval research. Palus, 2011, Evaluation of organization structure based on email interactions, International Journal of Knowledge Society Research, 2, 1, 10.4018/jksr.2011010101 Quinlan, 1986, Induction of decision trees, Machine Learning, 1, 81, 10.1007/BF00116251 Rahat Iqbal, 2012, Information retrieval, decision making process and user needs, Journal of Computer and System Sciences, 78–4, 1158, 10.1016/j.jcss.2011.10.005 Salton, 1983 Sebastiani, 2002, Machine learning in automated text categorization, ACM Computing Surverys, 34, 1, 10.1145/505282.505283 Tsoumakas, 2007, Multi-label classification: An overview, International Journal Data Warehousing and Mining, 1, 10.4018/jdwm.2007070101 Yang, 1999, A re-examination of text categorization methods, 42