Cooperative multi-camera vehicle tracking and traffic surveillance with edge artificial intelligence and representation learning

Hao (Frank) Yang1, Jiarui Cai2, Chenxi Liu1, Ruimin Ke3, Yinhai Wang1,2
1Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195, USA
2Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195, USA
3Department of Civil Engineering, The University of Texas at El Paso, El Paso, TX 79968, USA

Tài liệu tham khảo

Ansari, 2018, The earth ain’t flat: Monocular reconstruction of vehicles on steep and graded roads from a moving camera, 8404 Bergmann, Philipp, Meinhardt, Tim, Leal-Taixe, Laura, 2019. Tracking without bells and whistles. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 941–951. Bochkovskiy, 2020 Chen, 2019, Deep learning with edge computing: A review, Proc. IEEE, 107, 1655, 10.1109/JPROC.2019.2921977 Cui, 2016, New progress of DRIVE net: An E-science transportation platform for data sharing, visualization, modeling, and analysis, 1 Deng, 2009, Imagenet: A large-scale hierarchical image database, 248 Dou, 2020, Edge computing-enabled deep learning for real-time video optimization in IIoT, IEEE Trans. Ind. Inform., 17, 2842, 10.1109/TII.2020.3020386 Eckstein, Viktor, Schumann, Arne, Specker, Andreas, 2020. Large Scale Vehicle Re-Identification by Knowledge Transfer From Simulated Data and Temporal Attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. pp. 616–617. Feichtenhofer, Christoph, Pinz, Axel, Zisserman, Andrew, 2017. Detect to track and track to detect. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 3038–3046. GeekAlexis, 2020 Girshick, Ross, 2015. Fast r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 1440–1448. González, 2011, Automatic traffic signs and panels inspection system using computer vision, IEEE Trans. Intell. Transp. Syst., 12, 485, 10.1109/TITS.2010.2098029 Guo, Hongpeng, Yao, Shuochao, Yang, Zhe, Zhou, Qian, Nahrstedt, Klara, 2021. CrossRoI: cross-camera region of interest optimization for efficient real time video analytics at scale. In: Proceedings of the 12th ACM Multimedia Systems Conference. pp. 186–199. He, Kaiming, Gkioxari, Georgia, Dollár, Piotr, Girshick, Ross, 2017. Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 2961–2969. He, Yuhang, Han, Jie, Yu, Wentao, Hong, Xiaopeng, Wei, Xing, Gong, Yihong, 2020a. City-scale multi-camera vehicle tracking by semantic attribute parsing and cross-camera tracklet matching. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. pp. 576–577. He, 2020 He, 2015, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Trans. Pattern Anal. Mach. Intell., 37, 1904, 10.1109/TPAMI.2015.2389824 He, Kaiming, Zhang, Xiangyu, Ren, Shaoqing, Sun, Jian, 2016. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 770–778. Horn, 1981, Determining optical flow, 319 Hsu, Hung-Min, Huang, Tsung-Wei, Wang, Gaoang, Cai, Jiarui, Lei, Zhichao, Hwang, Jenq-Neng, 2019. Multi-Camera Tracking of Vehicles based on Deep Features Re-ID and Trajectory-Based Camera Link Models. In: AI City Challenge Workshop, IEEE/CVF Computer Vision and Pattern Recognition (CVPR) Conference, Long Beach, California. Huang, Tsung-Wei, Cai, Jiarui, Yang, Hao, Hsu, Hung-Min, Hwang, Jenq-Neng, 2019. Multi-View Vehicle Re-Identification using Temporal Attention Model and Metadata Re-ranking. In: AI City Challenge Workshop, IEEE/CVF Computer Vision and Pattern Recognition (CVPR) Conference, Long Beach, California. Huang, 2020, DotSCN: Group re-identification via domain-transferred single and couple representation learning, IEEE Trans. Circuits Syst. Video Technol., 31, 2739, 10.1109/TCSVT.2020.3031303 Jain, 2020, Spatula: Efficient cross-camera video analytics on large camera networks, 110 Jin, 2014, An information framework for creating a smart city through internet of things, IEEE Internet Things J., 1, 112, 10.1109/JIOT.2013.2296516 Ke, 2016, Real-time bidirectional traffic flow parameter estimation from aerial videos, IEEE Trans. Intell. Transp. Syst., 18, 890, 10.1109/TITS.2016.2595526 Ke, 2022, Real-time traffic and road surveillance with parallel edge intelligence, IEEE J. Radio Freq. Identif., 10.1109/JRFID.2022.3211262 Ke, 2020, A smart, efficient, and reliable parking surveillance system with edge artificial intelligence on IoT devices, IEEE Trans. Intell. Transp. Syst. Khan, 2019, A survey of advances in vision-based vehicle re-identification, Comput. Vis. Image Underst., 182, 50, 10.1016/j.cviu.2019.03.001 Koubaa, 2021, Cloud versus edge deployment strategies of real-time face recognition inference, IEEE Trans. Netw. Sci. Eng., 9, 143, 10.1109/TNSE.2021.3055835 Kuma, 2019, Vehicle re-identification: an efficient baseline using triplet embedding, 1 Lam, Siu Kwan, Pitrou, Antoine, Seibert, Stanley, 2015. Numba: A llvm-based python jit compiler. In: Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC. pp. 1–6. Li, 2017, Policy-based secure and trustworthy sensing for internet of things in smart cities, IEEE Internet Things J., 5, 716, 10.1109/JIOT.2017.2720635 Liang, 2018, Enhancing video rate adaptation with mobile edge computing and caching in software-defined mobile networks, IEEE Trans. Wireless Commun., 17, 7013, 10.1109/TWC.2018.2865354 Lin, 2019, Refinenet: Multi-path refinement networks for dense prediction, IEEE Trans. Pattern Anal. Mach. Intell., 42, 1228 Lin, 2014, Microsoft coco: Common objects in context, 740 Liu, 2016, Ssd: Single shot multibox detector, 21 Lou, 2019, Embedding adversarial learning for vehicle re-identification, IEEE Trans. Image Process., 28, 3794, 10.1109/TIP.2019.2902112 Luo, 2018, MIO-TCD: A new benchmark dataset for vehicle classification and localization, IEEE Trans. Image Process., 27, 5129, 10.1109/TIP.2018.2848705 Luo, Hao, Gu, Youzhi, Liao, Xingyu, Lai, Shenqi, Jiang, Wei, 2019. Bag of tricks and a strong baseline for deep person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Malinovskiy, 2009, Video-based vehicle detection and tracking using spatiotemporal maps, Transp. Res. Rec., 2121, 81, 10.3141/2121-09 Naphade, Milind, Wang, Shuo, Anastasiu, David C, Tang, Zheng, Chang, Ming-Ching, Yang, Xiaodong, Zheng, Liang, Sharma, Anuj, Chellappa, Rama, Chakraborty, Pranamesh, 2020. The 4th AI City Challenge. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. pp. 626–627. Neff, 2019, REVAMP 2 T: Real-time edge video analytics for multicamera privacy-aware pedestrian tracking, IEEE Internet Things J., 7, 2591, 10.1109/JIOT.2019.2954804 Nguyen, 2017, Improving poor GPS area localization for intelligent vehicles, 417 Nikodem, 2020, Multi-camera vehicle tracking using edge computing and low-power communication, Sensors, 20, 3334, 10.3390/s20113334 Pan, Xingang, Luo, Ping, Shi, Jianping, Tang, Xiaoou, 2018. Two at once: Enhancing learning and generalization capacities via ibn-net. In: Proceedings of the European Conference on Computer Vision. ECCV, pp. 464–479. Qian, Yijun, Yu, Lijun, Liu, Wenhe, Hauptmann, Alexander G., 2020. ELECTRICITY: An efficient multi-camera vehicle tracking system for intelligent city. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. pp. 588–589. Redmon, Joseph, Divvala, Santosh, Girshick, Ross, Farhadi, Ali, 2016. You only look once: Unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 779–788. Redmon, 2018 Ren, 2015, Faster r-cnn: Towards real-time object detection with region proposal networks, 91 Schrank, 2019 Schroff, Florian, Kalenichenko, Dmitry, Philbin, James, 2015. Facenet: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 815–823. Shakeri, Moein, Zhang, Hong, 2017. Moving Object Detection in Time-Lapse or Motion Trigger Image Sequences Using Low-Rank and Invariant Sparse Decomposition. In: Proceedings of the IEEE International Conference on Computer Vision. ICCV. Suprem, 2020 Tang, 2019, Moana: An online learned adaptive appearance model for robust multiple object tracking in 3d, IEEE Access, 7, 31934, 10.1109/ACCESS.2019.2903121 Tang, Zheng, Naphade, Milind, Liu, Ming-Yu, Yang, Xiaodong, Birchfield, Stan, Wang, Shuo, Kumar, Ratnesh, Anastasiu, David, Hwang, Jenq-Neng, 2019. Cityflow: A city-scale benchmark for multi-target multi-camera vehicle tracking and re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 8797–8806. Wang, Gaoang, Wang, Yizhou, Zhang, Haotian, Gu, Renshu, Hwang, Jenq-Neng, 2019. Exploit the connectivity: Multi-object tracking with trackletnet. In: Proceedings of the 27th ACM International Conference on Multimedia. pp. 482–490. Wojke, 2017, Simple online and realtime tracking with a deep association metric, 3645 Wu, 2018, A light cnn for deep face representation with noisy labels, IEEE Trans. Inf. Forensics Secur., 13, 2884, 10.1109/TIFS.2018.2833032 Xu, Jing, Zhao, Rui, Zhu, Feng, Wang, Huaming, Ouyang, Wanli, 2018. Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 2119–2128. Yang, 2020 Yang, 2022, Traffic-informed multi-camera sensing (TIMS) system based on vehicle re-identification, IEEE Trans. Intell. Transp. Syst. Yang, 2022, Toward a real-time smart parking data management and prediction (SPDMP) system by attributes representation learning, Int. J. Intell. Syst., 37, 4437, 10.1002/int.22725 Yang, 2022, Cooperative traffic signal assistance system for non-motorized users and disabilities empowered by computer vision and edge artificial intelligence, Transp. Res. C, 145, 10.1016/j.trc.2022.103896 Yang, 2021, How fast you will drive? Predicting speed of customized paths by deep neural network, IEEE Trans. Intell. Transp. Syst., 23, 2045, 10.1109/TITS.2020.3031026 Yang, 2021, Truck parking pattern aggregation and availability prediction by deep learning, IEEE Trans. Intell. Transp. Syst. Yang, 2018, Vehicle detection in intelligent transportation systems and its applications under varying environments: A review, Image Vis. Comput., 69, 143, 10.1016/j.imavis.2017.09.008 Ye, 2020 Ye, 2021, Deep learning for person re-identification: A survey and outlook, IEEE Trans. Pattern Anal. Mach. Intell. Yun, 2019, Minimum sampling size of floating cars for urban link travel time distribution estimation, Transp. Res. Rec., 2673, 24, 10.1177/0361198119834297 Zhang, 2007, Video-based vehicle detection and classification system for real-time traffic data collection using uncalibrated video cameras, Transp. Res. Rec., 1993, 138, 10.3141/1993-19 Zhou, 2021, When intelligent transportation systems sensing meets edge computing: Vision and challenges, Appl. Sci., 11, 9680, 10.3390/app11209680 Zhou, Kaiyang, Yang, Yongxin, Cavallaro, Andrea, Xiang, Tao, 2019. Omni-scale feature learning for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 3702–3712. Zhu, Xiangyu, Luo, Zhenbo, Fu, Pei, Ji, Xiang, 2020. VOC-ReID: Vehicle Re-identification based on Vehicle-Orientation-Camera. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. pp. 602–603. Zhuang, 2022, Edge-artificial intelligence-powered parking surveillance with quantized neural networks, IEEE Intell. Transp. Syst. Mag., 10.1109/MITS.2022.3182358