Pedestrian detection using first- and second-order aggregate channel features

Blossom Treesa Bastian1, Jiji C.V.1
1College of Engineering Trivandrum, Trivandrum, India

Tóm tắt

The content-based analysis of visual multimedia like images and videos are urgently needed to empower human society for the automation of difficult tasks. Pedestrian detection serves as a backbone for a multitude of image processing and machine learning algorithms and secures quite a lot of real-world applications. Keeping this fact in mind, here, we deal with the fabrication of suitable features to identify human/pedestrian instances from images with near accuracy. Accordingly, we introduce second-order aggregate channel features (SOACF) to enhance the performance of much-celebrated pedestrian detection algorithm which was mainly based on the first-order information in an image—aggregate channel features detector (ACF detector). We experimentally proved the complementary nature of ACF and SOACF. Designed to garner both these features together, instead of simple concatenation, or direct merging of the two detectors, we employed a weighted non-maximum suppression merging algorithm. The prospective detector not only performed well on INRIA, Caltech and KITTI pedestrian data set but also, mitigate the miss rate by $$\sim 4\%$$ in Caltech data set and $$\sim 2\%$$ in KITTI data set in comparison with ACF detector. Despite the fact that our in-house generated detector uses only a few channels, it surpasses many state-of-the-art methods based on baseline ACF detector. Moreover, the detection speed is 100 times faster than the topmost pedestrian detector based on ACF.

Tài liệu tham khảo

Kaur A, Dhir R, Lehal GS (2017) A survey on camera-captured scene text detection and extraction: towards gurmukhi script. Int J Multimed Inf Retr 6(2):115–142 Shirahama K, Grzegorzek M, Uehara K (2015) Weakly supervised detection of video events using hidden conditional random fields. Int J Multimed Inf Retr 4(1):17–32 Saadna Y, Behloul A (2017) An overview of traffic sign detection and classification methods. Int J Multimed Inf Retr 6(3):193–210 Sathish PK, Balaji S (2018) A complete person re-identification model using Kernel-pca-based Gabor-filtered hybrid descriptors. Int J Multimed Inf Retr 7(4):221–229 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 1, pp 886–893. IEEE Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645 Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: CVPR Viola P, Jones MJ, Snow D (2005) Detecting pedestrian using patterns of motion and appearance. Int J Comput Vis 63(2):153–161 Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2016) How far are we from solving pedestrian detection? In: CVPR Dollár P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545 Nam W, Dollár P, Han JH (2014) Local decorrelation for improved pedestrian detection. In: Advances in neural information processing systems, pp 424–432 Yang B, Yan J, Lei Z, Li SZ (2015) Convolutional channel features. In: Proceedings of the IEEE international conference on computer vision, pp 82–90 Paisitkriangkrai S, Shen C, van den Hengel A (2014) Strengthening the effectiveness of pedestrian detection with spatially pooled features. In: European conference on computer vision, pp 546–561. Springer Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1751–1760. IEEE Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. PAMI 34:743–761 Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The kitti vision benchmark suite. In: Conference on computer vision and pattern recognition (CVPR) Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 947–954 Lim JJ, Zitnick CL, Dollr, P (2013) Sketch tokens: a learned mid-level representation for contour and object detection. In: 2013 IEEE conference on computer vision and pattern recognition, pp 3158–3165 Cao H, Yamaguchi K, Naito T, Ninomiya Y (2009) Pedestrian recognition using second-order hog feature. In: Asian conference on computer vision, pp 628–634. Springer Jiang Y, Ma J (2015) Combination features and models for human detection. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition 07 June 2012, pp 240–248. 10.1109/CVPR.2015.7298620 Huang D, Zhu C, Wang Y, Chen L (2014) Hsog: a novel local image descriptor based on histograms of the second-order gradients. IEEE Trans Image Process 23(11):4680–4695 Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: CVPR