ALSketch: An adaptive learning-based sketch for accurate network measurement under dynamic traffic distribution

Journal of Network and Computer Applications - Tập 216 - Trang 103659 - 2023
Xiaojun Cheng1, Xuyang Jing2, Zheng Yan2, Xian Li1, Pu Wang3, Wei Wu3
1Institute for Future, School of Automation, Qingdao University, China
2School of Cyber Engineering, Xidian University, China
3Academy of Cyberspace Studies, Beijing, China

Tài liệu tham khảo

CAIDA, 2016 Charikar, 2004, Finding frequent items in data streams, Theoret. Comput. Sci., 312, 3, 10.1016/S0304-3975(03)00400-6 CIC, ., (2016) Cicflowmeter, https://www.unb.ca/cic/research/applications.html. Cormode, 2005, An improved data stream summary: the count-min sketch and its applications, J. Algorithms, 55, 58, 10.1016/j.jalgor.2003.12.001 Dalt, 2022, Bayesian sketches for volume estimation in data streams, Proc. VLDB Endow., 16, 657, 10.14778/3574245.3574252 Deng, 2007 Domingos, P., Hulten, G., 2000. Mining high-speed data streams. In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. pp. 71–80. Du, 2021, Putting the “learning, 2860 Estan, C., Varghese, G., 2002. New directions in traffic measurement and accounting. In: Proceedings of the 2002 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications. pp. 323–336. Ferragina, 2020, Learned data structures, 5 Flajolet, 1985, Approximate counting: a detailed analysis, BIT Numer. Math., 25, 113, 10.1007/BF01934993 Fu, 2019 Gama, 2014, A survey on concept drift adaptation, ACM Comput. Surv., 46, 1, 10.1145/2523813 Hoeffding, 1963, Probability inequalities for sums of bounded random variables, J. Amer. Statist. Assoc., 58, 13, 10.1080/01621459.1963.10500830 Hoi, 2021, Online learning: A comprehensive survey, Neurocomputing, 459, 249, 10.1016/j.neucom.2021.04.112 Hoi, 2014, Libol: A library for online learning algorithms, J. Mach. Learn. Res., 15, 495 Hsu, C.-Y., Indyk, P., Katabi, D., Vakilian, A., 2019. Learning-Based Frequency Estimation Algorithms. In: Proceedings of International Conference on Learning Representations. Huang, Q., Sheng, S., Chen, X., Bao, Y., Zhang, R., Xu, Y., Zhang, G., 2021. Toward Nearly-Zero-Error Sketching via Compressive Sensing. In: Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation. pp. 1027–1044. Jia, 2021, Loglog filter: Filtering cold items within a large range over high speed data streams, 804 Jiang, T., Li, Y., Lin, H., Ruan, Y., Woodruff, D.P., 2020. Learning-augmented data stream algorithms. In: ICLR. Jing, 2021, SuperSketch: A multi-dimensional reversible data structure for super host identification, IEEE Trans. Dependable Secure Comput., 19, 2741, 10.1109/TDSC.2021.3072295 Jing, 2021, ExtendedSketch: Fusing network traffic for super host identification with a memory efficient sketch, IEEE Trans. Dependable Secure Comput., 19, 3913, 10.1109/TDSC.2021.3111328 Jing, 2019, Network traffic fusion and analysis against DDoS flooding attacks with a novel reversible sketch, Inf. Fusion, 51, 100, 10.1016/j.inffus.2018.10.013 Jing, 2018, Security data collection and data analytics in the internet: A survey, IEEE Commun. Surv. Tutor., 21, 586, 10.1109/COMST.2018.2863942 Li, J., Li, Z., Xu, Y., Jiang, S., Yang, T., Cui, B., Dai, Y., Zhang, G., 2020a. Wavingsketch: An unbiased and generic sketch for finding top-k items in data streams. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. pp. 1574–1584. Li, 2020 Lou, 2022, An efficient adaptive denoising sketch for per-flow traffic measurement, 161 Lu, 2022, LUSketch: A fast and precise sketch for top-k finding in data streams, 1 Manapragada, C., Webb, G.I., Salehi, M., 2018. Extremely fast decision tree. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. pp. 1953–1962. Montiel, 2021, River: machine learning for streaming data in python, J. Mach. Learn. Res., 22, 4945 Morris, 1978, Counting large numbers of events in small registers, Commun. ACM, 21, 840, 10.1145/359619.359627 Pitel, G., Fouquier, G., 2015. Count-min-log sketch: Approximately counting with approximate counters. In: Proceedings of International Symposium on Web AlGorithms. Ravichandran, 2019 Roy, P., Khan, A., Alonso, G., 2016. Augmented sketch: Faster and more accurate stream processing. In: Proceedings of the 2016 International Conference on Management of Data. pp. 1449–1463. Singh, 2020, Probabilistic data structures for big data analytics: A comprehensive review, Knowl.-Based Syst., 188, 10.1016/j.knosys.2019.104987 Sivaraman, V., Narayana, S., Rottenstreich, O., Muthukrishnan, S., Rexford, J., 2017. Heavy-hitter detection entirely in the data plane. In: Proceedings of the Symposium on SDN Research. pp. 164–176. Tsymbal, 2004, The problem of concept drift: definitions and related work, Comput. Sci. Dep. Trinity Coll. Dublin, 106, 58 Xiao, 2023, Universal and accurate sketch for estimating heavy hitters and moments in data streams, IEEE/ACM Trans. Netw., 10.1109/TNET.2022.3216025 Xiao, 2017, Cardinality estimation for elephant flows: A compact solution based on virtual register sharing, IEEE/ACM Trans. Netw., 25, 3738, 10.1109/TNET.2017.2753842 Xiao, 2022, Accurately identify time-decaying heavy hitters by decay-aware cuckoo filter along kicking path, 1 Yan, 2022, TalentSketch: LSTM-based sketch for adaptive and high-precision network measurement, 1 Yang, 2019, Diamond sketch: Accurate per-flow measurement for big streaming data, IEEE Trans. Parallel Distrib. Syst., 30, 2650, 10.1109/TPDS.2019.2923772 Yang, T., Gong, J., Zhang, H., Zou, L., Shi, L., Li, X., 2018a. Heavyguardian: Separate and guard hot items in data streams. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. pp. 2584–2593. Yang, T., Jiang, J., Liu, P., Huang, Q., Gong, J., Zhou, Y., Miao, R., Li, X., Uhlig, S., 2018b. Elastic sketch: Adaptive and fast network-wide measurements. In: Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication. pp. 561–575. Yang, 2022, Light: A component enhances faster and more accurate traffic measurement, 3673 Yang, 2019, HeavyKeeper: an accurate algorithm for finding Top-k elephant flows, IEEE/ACM Trans. Netw., 27, 1845, 10.1109/TNET.2019.2933868 Zhang, 2020, Learned sketches for frequency estimation, Inform. Sci., 507, 365, 10.1016/j.ins.2019.08.045 Zhou, Y., Yang, T., Jiang, J., Cui, B., Yu, M., Li, X., Uhlig, S., 2018. Cold filter: A meta-framework for faster and more accurate stream processing. In: Proceedings of the 2018 International Conference on Management of Data. pp. 741–756.