Semantic Text Classification for Supporting Automated Compliance Checking in Construction

Journal of Computing in Civil Engineering - Tập 30 Số 1 - 2016

D. M. Salama¹, Nora El-Gohary²

¹Graduate Student, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana-Champaign, 205 N. Mathews Ave., Urbana, IL 61801.

²Assistant Professor, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana-Champaign, 205 N. Mathews Ave., Urbana, IL 61801 (corresponding author).

Tóm tắt

Từ khóa

Tài liệu tham khảo

Abu Sheikha F. and Inkpen D. (2010). “Automatic classification of documents by formality.” Proc. 6th Int. Conf. on Natural Language Processing and Knowledge Engineering (NLP-KE) IEEE Washington DC 1–5.

Amor R. and Xu K. (2005). “Automated classification of A/E/C web content.” Proc. CIB W78 22nd Int. on IT in Construction Dresden Univ. of Tech. Dresden Germany 315–319.

Ayyasamy R. K. Tahayna B. Alhashmi S. Eugene S. and Egerton S. (2010). “Mining Wikipedia knowledge to improve document indexing and classification.” Proc. 10th Int. Conf. on Info. Science Signal Processing and their Applications (ISSPA) Vol. 10 IEEE Washington DC 806–809.

Bi W. and Kwok J. (2011). “Multi-label classification on tree- and DAG- structured hierarchies.” Proc. 28th Int. Conf. on Machine Learning (ICML) Vol. 28 ACM New York 17–24.

10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO;2-L

10.1016/S0926-5805(03)00004-9

10.1061/(ASCE)0887-3801(2002)16:4(234)

10.1145/1961189.1961199

Cherman E., 2011, Multi-label problem transformation methods: A case study, Lat. Am. Center Informat. Stud. (CLEI), 14, 1

10.1002/aris.1440370103

Dasgupta A. Petros D. Harb B. Josifovski V. and Mahoney M. (2007). “Feature selection methods for text classification.” Proc. 13th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining ACM New York 230–239.

10.1016/j.autcon.2009.07.002

Forman G., 2003, An extensive empirical study of feature selection metrics for text classification, J. Mach. Learn. Res., 3, 1289

10.1201/9781584888796.pt4

Ghamrawi N. and McCallum A. (2005). “Collective multi-label classification.” Proc. 14th Int. Conf. on Information and Knowledge Management (CIKM ’05) ACM New York 195–200.

Grefenstette G., 1999, Syntactic world class tagging

10.1007/978-3-642-17187-1_35

Heath D. Zitzelberger A. and Giraud-Carrier C. (2010). “A multiple domain comparison of multi-label classification methods.” Proc. 2nd Int. Workshop on Learning from Multi-Label Data International Machine Learning Society (IMLS) Princeton NJ.

Janik M. and Kochut K. (2008). “Training-less ontology-based text categorization.” Proc. 30th Eur. Conf. on Information Retrieval (ESAIR 2008) Workshop on Exploiting Semantic Annotations in Information Retrieval ACM New York.

Jia Z. and Mu J. (2010). “Web text categorization for large-scale corpus.” Proc. Int. Conf. on Computer Application and System Modeling (ICCASM) Vol. 8 IEEE Washington DC 188–191.

Joachims T. (1998). “Text categorization with support vector machines.” Proc. European Conf. on Machine Learning (ECML) Springer New York.

10.1061/(ASCE)0887-3801(2008)22:1(3)

Kumar E., 2011, Natural language processing

10.1007/s10791-009-9093-0

10.1007/978-3-642-01256-3_17

Liu Y. Jin R. and Yang L. (2006). “Semi-supervised multi-label learning by constrained non-negative matrix factorization.” Proc. 21st National Conf. on Artificial Intelligence Vol. 1 AAAI Menlo Park CA 421–426.

Machine Learning for Language Toolkit (MALLET). (2009). “University of Massachusetts Amherst.” 〈http://mallet.cs.umass.edu/〉 (Jan. 15 2010).

Mahfouz T. (2011). “Unstructured construction document classification model through support vector machine (SVM).” Proc. Int. Workshop on J. Comput. Civ. Eng. ASCE Reston VA.

10.1017/CBO9780511809071

Mladenik D. and Grobelnik M. (1999). “Feature selection for unbalanced class distribution and naïve Bayes.” Proc. 16th Intl. Conf. on Machine Learning Vol. 16 Morgan Kaufmann San Francisco 258–267.

Moens M. (2000). Automatic indexing and abstracting of document texts Kluwer Academic Dordrecht Netherlands.

10.1061/(ASCE)1076-0342(2006)12:1(50)

Nigam K. Lafferty J. and McCallum A. (1999). “Using maximum entropy for text classification.” Proc. Int. Joint Conf. on Artificial Intelligence Workshop on Machine Learning for Information Filtering AAAI Menlo Park CA 61–67.

Qui Y. Yang G. and Tan Z. (2010). “Chinese text classification based on extended naïve Bayes model with weighed positive features.” Proc. 1st Int. Conf. on Pervasive Computing Signal Processing and Applications Vol. 1 IEEE Washington DC 243–246.

Rennie J. Shih L. Teevan J. and Karger D. (2003). “Tackling the poor assumptions of naïve Bayes text classifiers.” Proc. 20th Int. Conf. on Machine Learning (ICML) Vol. 20 ACM New York 616–623.

Rizzolo N. and Roth D. (2010). “Learning based Java for rapid development of NLP systems.” Proc. Int. Conf. on Language Resources and Evaluation (LREC) European Language Resources Association (ELRA) Paris France.

Rogati M. and Yang Y. (2002). “High-performing feature selection for text classification.” Proc. Conf. on Information and Knowledge Management ACM New York 659–661.

10.1023/A:1012782908347

Russel S., 2010, Artificial intelligence: A modern approach, 3

10.1061/(ASCE)CP.1943-5487.0000298

10.1016/0306-4573(88)90021-0

10.1145/505282.505283

Shein K. and Nyunt T. (2010). “Sentiment classification based on ontology and SVM classifier.” Proc. 2nd Int. Conf. on Communication Software and Networks Vol. 2 IEEE Washington DC 169–172.

Silva C. and Ribeiro B. (2003). “The Importance of stop word removal on recall values in text categorization.” Proc. Int. Joint Conf. on Neural Networks Vol. 3 IEEE Washington DC 1661–1666.

10.1007/3-540-44886-1_41

10.1061/(ASCE)0887-3801(2010)24:2(203)

Toman M. Tesar R. and Jezek K. (2006). “Influence of word normalization on text classification.” Proc. Int. Conf. on Multidisciplinary Info. Sciences and Tech. Open Institute of Knowledge Merida Spain 354–358.

10.4018/jdwm.2007070101

Wang H. Wang L. and Yi L. (2010). “Maximum entropy framework used in text classification.” Proc. Int. Conf. on Intelligent Computing and Intelligent Systems (ICIS) Vol. 2 IEEE Washington DC 828–833.

Watt S., 2009, Information retrieval: Searching in the 21st century

Xu Y., 2007, A study on mutual information-based feature selection for text categorization, J. Comput. Inf. Syst., 3, 1007

Yang S. Wu X. Deng Z. Zhang M. and Yang D. (2002). “Relative term-frequency based feature selection for text categorization.” Proc. 1st Int. Conf. on Machine Learning and Cybernetics Vol. 1 IEEE Washington DC 1432–1436.

Yang Y. and Pederson J. (1997). “A comparative study on feature selection in text categorization.” Proc. 14th Int. Conf. on Machine Learning Morgan Kaufmann San Francisco 412–420.

10.1016/S0031-3203(01)00210-2

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]