Three-level learning for improving cross-project logging prediction for if-blocks

Sangeeta Lal1, Neetu Sardana1, Ashish Sureka2
1Jaypee Institute of Information Technology, Noida, Uttar-Pradesh, India
2ABB Corporate Research, Bangalore, India

Tài liệu tham khảo

Antoniol, 2008, Is it a bug or an enhancement?: a text-based approach to classify change requests, 23 Apache. Apache project homepage. https://commons.apache.org/proper/commons-logging/. [Online; accessed 18-March-2016]. Blackberry enterprise server logs submission. https://salesforce.services.blackberry.com/webforms/beslogs. (Online; accessed 4-June-2016). Chawla, 2002, Smote: synthetic minority over-sampling technique, J. Artif. Intell. Res., 16, 321, 10.1613/jair.953 Chawla, 2003, Smoteboost: Improving prediction of the minority class in boosting, 107 Cloudstack. Cloudstack project homepage. https://cloudstack.apache.org/. (Online; Accessed 18-March-2016). Dai, 2007, Boosting for transfer learning, 193 Ding, 2015, Log2: a cost-aware logging mechanism for performance diagnosis, 139 Freund, 1999, The alternating decision tree learning algorithm, In Icml, 99, 124 Fu, 2009, Execution anomaly detection in distributed systems through unstructured log analysis, 10.1109/ICDM.2009.60 Fu, 2014, Where do developers log? an empirical study on logging practices in industry, 24 Guo, 2004, Learning from imbalanced data sets with boosting and data generation: the databoost-im approach, ACM SIGKDD Explor. Newsletter, 6, 30, 10.1145/1007730.1007736 Guyon, 2003, An introduction to variable and feature selection, J. Mach. Learn. Res., 3, 1157 Hadoop. Hadoop bug. https://issues.apache.org/jira/browse/MAPREDUCE-5501. (Online; Accessed 18-may-2016). Hadoop. Hadoopt project homepage. http://hadoop.apache.org/. (Online; accessed 18-March-2016). Hall, 2009, The weka data mining software: an update, SIGKDD Explor. Newsl., 11, 10, 10.1145/1656274.1656278 Han, 2011 He, 2009, Learning from imbalanced data, IEEE Trans. Knowl. Data Eng., 21, 1263, 10.1109/TKDE.2008.239 Jonsson, 2016, Automated bug assignment: ensemble-based machine learning in large scale industrial contexts, Empirical Softw. Eng., 21, 1533, 10.1007/s10664-015-9401-9 Kim, 2008, Classifying software changes: clean or buggy?, IEEE Trans. Software Eng., 34, 181, 10.1109/TSE.2007.70773 Kubat, M., Matwin S., et al., 1997. Addressing the curse of imbalanced training sets: one-sided selection. In: ICML, vol. 97. Nashville, USA. pp. 179–186. Lal, 2016, Logopt: Static feature extraction from source code for automated catch block logging prediction, 151 Lal, 2016, Improving logging prediction on imbalanced datasets: a case study on open source java projects, Int. J. Open Source Softw. Processes (IJOSSP), 7, 43, 10.4018/IJOSSP.2016040103 Lal, 2016, Logoptplus: Learning to optimize logging in catch and if programming constructs, vol. 1, 215 Lal, 2017, Eclogger: cross-project catch-block logging prediction using ensemble of classifiers, E-inf. Softw. Eng., 11 Liu, 2005, Toward integrating feature selection algorithms for classification and clustering, IEEE Trans. Knowl. Data Eng., 17, 491, 10.1109/TKDE.2005.66 Manning, 2008 Nagaraj, 2012, Structured comparative analysis of systems logs to diagnose performance problems, 26 Nam, 2013, Transfer defect learning, 382 Pan, 2010, A survey on transfer learning, IEEE Trans. Knowl. Data Eng., 22, 1345, 10.1109/TKDE.2009.191 Pan, 2011, Domain adaptation via transfer component analysis, IEEE Trans. Neural Networks, 22, 199, 10.1109/TNN.2010.2091281 Panichella, 2015, How can i improve my app? classifying user reviews for software maintenance and evolution, 281 Shivaji, 2013, Reducing features to improve code change-based bug prediction, IEEE Trans. Software Eng., 39, 552, 10.1109/TSE.2012.43 Sigelman, B.H., Barroso, L.A., Burrows, M., Stephenson, P., Plakal, M., Beaver, D., Jaspan, S., Shanbhag, C., 2010. Dapper, a large-scale distributed systems tracing infrastructure. Technical report, Technical report. Google. Tan, 2015, Online defect prediction for imbalanced data, vol. 2, 99 Tomcat. Tomcate project homepage. http://tomcat.apache.org//. (Online; Accessed 18-March-2016). Valdivia Garcia, 2014, Characterizing and predicting blocking bugs in open source projects, 72 Witten, I.H., Frank, E., Hall, M.A. Data mining: Practical machine learning tools and techniques. 2011. Xia, X., Lo, D., McIntosh, S., Shihab, E., Hassan, A.E., 2015. Cross-project build co-change prediction. In: 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER). March 2015. pp. 311–320. Xia, 2015, Elblocker: predicting blocking bugs with ensemble imbalance learning, Inf. Softw. Technol., 61, 93, 10.1016/j.infsof.2014.12.006 Xia, 2016, It takes two to tango: Deleted stack overflow question prediction with text and meta features, vol. 1, 73 Yu, 2004, Efficient feature selection via analysis of relevance and redundancy, J. Mach. Learn. Res., 5, 1205 Zhang, 2015, An empirical study of classifier combination for cross-project defect prediction, vol. 2, 264 Zhu, 2015, Learning to log: helping developers make informed logging decisions, vol. 1, 415