Semantic Text Classification for Supporting Automated Compliance Checking in Construction
Tóm tắt
Từ khóa
Tài liệu tham khảo
Abu Sheikha F. and Inkpen D. (2010). “Automatic classification of documents by formality.” Proc. 6th Int. Conf. on Natural Language Processing and Knowledge Engineering (NLP-KE) IEEE Washington DC 1–5.
Amor R. and Xu K. (2005). “Automated classification of A/E/C web content.” Proc. CIB W78 22nd Int. on IT in Construction Dresden Univ. of Tech. Dresden Germany 315–319.
Ayyasamy R. K. Tahayna B. Alhashmi S. Eugene S. and Egerton S. (2010). “Mining Wikipedia knowledge to improve document indexing and classification.” Proc. 10th Int. Conf. on Info. Science Signal Processing and their Applications (ISSPA) Vol. 10 IEEE Washington DC 806–809.
Bi W. and Kwok J. (2011). “Multi-label classification on tree- and DAG- structured hierarchies.” Proc. 28th Int. Conf. on Machine Learning (ICML) Vol. 28 ACM New York 17–24.
Cherman E., 2011, Multi-label problem transformation methods: A case study, Lat. Am. Center Informat. Stud. (CLEI), 14, 1
Dasgupta A. Petros D. Harb B. Josifovski V. and Mahoney M. (2007). “Feature selection methods for text classification.” Proc. 13th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining ACM New York 230–239.
Forman G., 2003, An extensive empirical study of feature selection metrics for text classification, J. Mach. Learn. Res., 3, 1289
Ghamrawi N. and McCallum A. (2005). “Collective multi-label classification.” Proc. 14th Int. Conf. on Information and Knowledge Management (CIKM ’05) ACM New York 195–200.
Grefenstette G., 1999, Syntactic world class tagging
Heath D. Zitzelberger A. and Giraud-Carrier C. (2010). “A multiple domain comparison of multi-label classification methods.” Proc. 2nd Int. Workshop on Learning from Multi-Label Data International Machine Learning Society (IMLS) Princeton NJ.
Janik M. and Kochut K. (2008). “Training-less ontology-based text categorization.” Proc. 30th Eur. Conf. on Information Retrieval (ESAIR 2008) Workshop on Exploiting Semantic Annotations in Information Retrieval ACM New York.
Jia Z. and Mu J. (2010). “Web text categorization for large-scale corpus.” Proc. Int. Conf. on Computer Application and System Modeling (ICCASM) Vol. 8 IEEE Washington DC 188–191.
Joachims T. (1998). “Text categorization with support vector machines.” Proc. European Conf. on Machine Learning (ECML) Springer New York.
Kumar E., 2011, Natural language processing
Liu Y. Jin R. and Yang L. (2006). “Semi-supervised multi-label learning by constrained non-negative matrix factorization.” Proc. 21st National Conf. on Artificial Intelligence Vol. 1 AAAI Menlo Park CA 421–426.
Machine Learning for Language Toolkit (MALLET). (2009). “University of Massachusetts Amherst.” 〈http://mallet.cs.umass.edu/〉 (Jan. 15 2010).
Mahfouz T. (2011). “Unstructured construction document classification model through support vector machine (SVM).” Proc. Int. Workshop on J. Comput. Civ. Eng. ASCE Reston VA.
Mladenik D. and Grobelnik M. (1999). “Feature selection for unbalanced class distribution and naïve Bayes.” Proc. 16th Intl. Conf. on Machine Learning Vol. 16 Morgan Kaufmann San Francisco 258–267.
Moens M. (2000). Automatic indexing and abstracting of document texts Kluwer Academic Dordrecht Netherlands.
Nigam K. Lafferty J. and McCallum A. (1999). “Using maximum entropy for text classification.” Proc. Int. Joint Conf. on Artificial Intelligence Workshop on Machine Learning for Information Filtering AAAI Menlo Park CA 61–67.
Qui Y. Yang G. and Tan Z. (2010). “Chinese text classification based on extended naïve Bayes model with weighed positive features.” Proc. 1st Int. Conf. on Pervasive Computing Signal Processing and Applications Vol. 1 IEEE Washington DC 243–246.
Rennie J. Shih L. Teevan J. and Karger D. (2003). “Tackling the poor assumptions of naïve Bayes text classifiers.” Proc. 20th Int. Conf. on Machine Learning (ICML) Vol. 20 ACM New York 616–623.
Rizzolo N. and Roth D. (2010). “Learning based Java for rapid development of NLP systems.” Proc. Int. Conf. on Language Resources and Evaluation (LREC) European Language Resources Association (ELRA) Paris France.
Rogati M. and Yang Y. (2002). “High-performing feature selection for text classification.” Proc. Conf. on Information and Knowledge Management ACM New York 659–661.
Russel S., 2010, Artificial intelligence: A modern approach, 3
Shein K. and Nyunt T. (2010). “Sentiment classification based on ontology and SVM classifier.” Proc. 2nd Int. Conf. on Communication Software and Networks Vol. 2 IEEE Washington DC 169–172.
Silva C. and Ribeiro B. (2003). “The Importance of stop word removal on recall values in text categorization.” Proc. Int. Joint Conf. on Neural Networks Vol. 3 IEEE Washington DC 1661–1666.
Toman M. Tesar R. and Jezek K. (2006). “Influence of word normalization on text classification.” Proc. Int. Conf. on Multidisciplinary Info. Sciences and Tech. Open Institute of Knowledge Merida Spain 354–358.
Wang H. Wang L. and Yi L. (2010). “Maximum entropy framework used in text classification.” Proc. Int. Conf. on Intelligent Computing and Intelligent Systems (ICIS) Vol. 2 IEEE Washington DC 828–833.
Watt S., 2009, Information retrieval: Searching in the 21st century
Xu Y., 2007, A study on mutual information-based feature selection for text categorization, J. Comput. Inf. Syst., 3, 1007
Yang S. Wu X. Deng Z. Zhang M. and Yang D. (2002). “Relative term-frequency based feature selection for text categorization.” Proc. 1st Int. Conf. on Machine Learning and Cybernetics Vol. 1 IEEE Washington DC 1432–1436.
Yang Y. and Pederson J. (1997). “A comparative study on feature selection in text categorization.” Proc. 14th Int. Conf. on Machine Learning Morgan Kaufmann San Francisco 412–420.
