LIMITLESS — LIght-weight MonItoring Tool for LargE Scale Systems

Microprocessors and Microsystems - Tập 93 - Trang 104586 - 2022
Alberto Cascajo1, David E. Singh1, Jesus Carretero1
1University Carlos III of Madrid Leganés, Madrid, Spain

Tài liệu tham khảo

F. Isaila, J. Carretero, R. Ross, CLARISSE: A middleware for data-staging coordination and control on large-scale HPC platforms, in: 16th International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2016, pp. 346–355. Martín, 2015, Enhancing the performance of malleable MPI applications by using performance-aware dynamic reconfiguration, Parallel Comput., 46, 60, 10.1016/j.parco.2015.04.003 Gormley, 2015 Arnold, 2006, Tree-based overlay networks for scalable applications Cascajo, 2019, Performance-aware scheduling of parallel applications on non-dedicated clusters, Electronics, 8, 982, 10.3390/electronics8090982 Gupta, 2015 Cunningham, 2020 Wiebe, 2015, Quantum algorithms for nearest-neighbor methods for supervised and unsupervised learning, Quantum Inf. Comput., 15, 316 Varga, 2010, OMNeT++, 35 Massie, 2004, The ganglia distributed monitoring system: design, implementation, and experience, Parallel Comput., 30, 817, 10.1016/j.parco.2004.04.001 collectd – The system statistics collection daemon, URL https://collectd.org/. Böhm, 2010, Aggregation of real-time system monitoring data for analyzing large-scale parallel and distributed computing environments, 72 Agelastos, 2016, Continuous whole-system monitoring toward rapid understanding of production HPC applications and systems, Parallel Comput., 58, 90, 10.1016/j.parco.2016.05.009 Izadpanah, 2018, Integrating low-latency analysis into HPC system monitoring Netti, 2019, DCDB wintermute: Enabling online and holistic operational data analytics on HPC systems, 101 Sperhac, 2018, Federating XDMoD to monitor affiliated computing resources, 580 Rohl, 2017, LIKWID monitoring stack: A flexible framework enabling job specific performance monitoring for the masses, 2017-September, 781 Yu, 2018, A cross-layer security monitoring selection algorithm based on traffic prediction, IEEE Access, 6, 35382, 10.1109/ACCESS.2018.2851993 S.M. Rashti, M. Mollanoori, M.S. Nia, N.M. Charkari, A prediction-based algorithm for target tracking in wireless sensor networks, in: 2009 International Conference on Ultra Modern Telecommunications and Workshops, 2009. Tang, 2016, Prediction of the bridge monitoring data based on support vector machine, 2016-January, 781 X. Kang, M. Xu, Explore of monitoring data pattern prediction of gas tunnel, in: 2011 International Conference on Remote Sensing, Environment and Transportation Engineering, RSETE 2011 — Proceedings, 2011, pp. 4046–4049. R. Lijia, L. Hong, L. Yan, On-line monitoring and prediction for transmission line sag, in: Proceedings of 2012 IEEE International Conference on Condition Monitoring and Diagnosis, CMD 2012, 2012, pp. 813–817. Bhulai, 2015, Nearest neighbour algorithms for forecasting call arrivals in call centers, Vol. 39, 77, 10.1007/978-3-319-19857-6_8 Ates, 2018, Taxonomist: Application detection through rich monitoring data, 11014 LNCS, 92 Selvathi, 2019, Segmentation of brain tumor tissues in MR images using multiresolution transforms and random forest classifier with adaboost technique Ma, 2019, Real-time foot-ground contact detection for inertial motion capture based on an adaptive weighted Naive Bayes model, IEEE Access, 7, 130312, 10.1109/ACCESS.2019.2939839 Bamler, 2020 Lee, 2020, Bootstrap aggregating and random forest, 389, 10.1007/978-3-030-31150-6_13 Lu, 2020, Hybrid decision tree-based machine learning models for short-term water quality prediction, Chemosphere, 249, 10.1016/j.chemosphere.2020.126169 Mor, 2021, A systematic review of hidden Markov models and their applications, Arch. Comput. Methods Eng. Vol., 28, 1429, 10.1007/s11831-020-09422-4 Li, 2020, Adaptively constrained dynamic time warping for time series classification and clustering, Inform. Sci., 534, 97, 10.1016/j.ins.2020.04.009 Shaban, 2020, A new COVID-19 patients detection strategy (CPDS) based on hybrid feature selection and enhanced KNN classifier, Knowl.-Based Syst., 205, 10.1016/j.knosys.2020.106270 Shahin, 2020, Novel cascaded Gaussian mixture model-deep neural network classifier for speaker identification in emotional talking environments, Neural Comput. Appl., 32, 2575, 10.1007/s00521-018-3760-2 Alsghaier, 2020, Software fault prediction using particle swarm algorithm with genetic algorithm and support vector machine classifier, Softw. - Pract. Exp., 50, 407, 10.1002/spe.2784 Ekanadham, 1995, Application oriented resource management on large scale parallel systems, IBM Research, Yorktown Heights, 56 Fan, 2019, Scheduling beyond CPUs for HPC