LASSO-based feature selection and naïve Bayes classifier for crime prediction and its type

Springer Science and Business Media LLC - Tập 13 - Trang 187-197 - 2019
Gnaneswara Rao Nitta1, B. Yogeshwara Rao2, T. Sravani3, N. Ramakrishiah4, M. BalaAnand5
1Computer Science Department, Viganan University, Vadlamudi, India
2Vignan University, Vadlamudi, India
3Tata Consultancy Services, Mumbai, India
4JNTU Kakinada, Kakinada, India
5V.R.S. College of Engineering and Technology, Villupuram, India

Tóm tắt

For centuries, crime has been viewed as random because it is based on human behavior; even now, it incorporates an excessive number of factors for current machine learning models to forecast accurately. In this work, we tend to discuss the early crime prediction results from a model developed using the data from the Chicago crime dataset. In any case, with a superior execution future crime is to anticipated accurately, it is a testing assignment as a result of the increase in several crimes in present days. Therefore, the crime foreseeing method is foremost, and it identifies the future crimes and number of crimes are degraded. In this paper, we built up a model to anticipate future crime occurrences at a future time and also predict which type of crime may be happening in a given area. First, we analyze how certain crime features like given a date, time and some geologically important relevant features such as latitude and longitude. Second, we discuss several analytics techniques we used to find meaning in our data, such as LASSO feature selection analysis, classification models like naïve Bayes and SVM. Finally, we select the best model for foreseeing crime type and seriousness of the crime for giving different features.

Tài liệu tham khảo

Kochar B, Chhillar R et al (2012) An effective data warehousing system for RFID using novel data cleaning, data transformation, and loading techniques. Arab J Inf Technol 9(3):208–216 Santhi P, Bhaskaran VM et al (2010) Performance of clustering algorithms in healthcare database. Int J Adv Comput Sci 2(1):26–31 Wahbeh AH, Al-Radaideh QA et al (2011) A comparison study between data mining tools over some classification methods. Int J Adv Comput Sci Appl (Special Issue):18–26 Han J, Kamber M et al (2006) Data mining: concepts and techniques. Morgan Kaufmann Publishers, San Francisco Li G, Wang Y et al (2012) A privacy-preserving classification method based on singular value decomposition. Arab J Inf Technol 9(6):529–534 Gorr W, Olligschlaeger A, Thompson Y (2003) Short-term forecasting of crime. Int J Forecast 19(4):579–594 Chen P, Yuan H, Shu X et al (2008) Forecasting crime using the ARIMA model. In: Proceedings of the 5th international conference on fuzzy systems and knowledge discovery. pp 627–630 Shingleton JS et al (2012) Crime trend prediction using regression models for salinas, California. Naval Postgraduate School, Monterey Kianmehr K, Alhajj R et al (2006) Crime hot-spots prediction using SVM. In: Proceedings of the international conference on computer systems and applications. pp 952–959 Wang P, Mathieu R, Ke J, Cai HJ et al (2010) Predicting criminal recidivism with SVM. In: Proceedings of the international conference on management and service science. pp 1–9 Nath SV et al (2006) Crime pattern detection using data mining. Oracle Corporation, Redwood Shores Yu C-H, Ward MW, Morabito M, Ding W et al (2011) Crime forecasting using data mining techniques. University of Massachusetts Boston, Boston Wang T, Rudin C, Wagner D, Sevier R et al (2013) Learning to detect patterns of crime. Massachusetts Institute of Technology, Cambridge Kang H-W, Kang H-B (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244 Lin Y-L, Chen T-Y, Yu L-C (2017) Using machine learning to assist crime prevention. In: 2017 6th IIAI international congress on advanced applied informatics Ahishakiye E, Taremwa D, Opiyo E, Niyonzima I (2017) Crime prediction using decision tree (J48) classification algorithm. Int J Comput Inf Technol. 06(03) ISSN: 2279-0764 BalaAnand M, Sankari S, Sowmipriya R, Sivaranjani S. Identifying fake user’s in social networks using non verbal behavior. Int J Technol Eng Syst (IJTES) 7(2):157–161 Maram B, Gnanasekar JM, Manogaran G et al (2018) SOCA. https://doi.org/10.1007/s11761-018-0249-x https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present-Dashboard/5cd6-ry5g Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc 58(1):267–288 Liao R, Wang X, Li L, Qin Z et al (2010) A novel serial crime prediction model based on Bayesian learning theory. Proc IEEE Int Conf Mach Learn Cybern 4:1757–1762 Gnaneswara Rao N, Sravani T, Vijaya Kumar V (2014) OCRM: optimal cost region matching similarity measure for region based image retrieval. Int J Multimedia Ubiquitous Eng 9(4):327–342 Gnaneswara Rao N, Vijaya Kumar V (2007) Texturebased image indexing and retrieval. In: VISAPP 2007 international conference on computer vision theory and applications, Barcelona, Spain, March 2007, pp 177-181 Kang H-W, Kang H-B et al (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244