ObjectFusion: An object detection and segmentation framework with RGB-D SLAM and convolutional neural networks

Neurocomputing - Tập 345 - Trang 3-14 - 2019
Guanzhong Tian1, Liang Liu2, JongHyok Ri2, Yong Liu3, Yiran Sun1
1State Key Laboratory of Industrial Control Technology, Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou 310027, China
2State Key Laboratory of Industrial Control Technology, Institute of Cyber Systems and Control, Zhejiang University, Hangzhou, 310027, China
3Institute of Information Technology, Kim Il Song University, Pyongyang 190016, Republic of Korea

Tài liệu tham khảo

Luan, 2017, Fast task-specific target detection via graph based constraints representation and checking, 3984 Liu, 2015, Detection based object labeling of 3D point cloud for indoor scenes, Neurocomputing, 174 Long, 2015, Fully convolutional networks for semantic segmentation, 3431 Zheng, 2015, Conditional random fields as recurrent neural networks, 1529 Whelan, 2015, ElasticFusion: dense SLAM without a pose graph Whelan, 2015, Real-time large-scale dense RGB-D slam with volumetric fusion, Int. J. Robot. Res., 34, 598, 10.1177/0278364914551008 Zuo, 2017, Robust visual slam with point and line features, 1775 Kundu, 2014, Joint semantic segmentation and 3D reconstruction from monocular video, Vol. 8694, 703 Redmon, 2016, You only look once: Unified, real-time object detection W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: single shot multibox detector, Springer International Publishing, Cham, pp. 21–37, 10.1007/978-3-319-46448-0_2. Girshick, 2015, Fast R-CNN Ren, 2015, Faster R-CNN: towards real-time object detection with region proposal networks, 91 Stückler, 2014, Multi-resolution surfel maps for efficient dense 3D modeling and tracking, J. Vis. Commun. Image Rep., 25, 137, 10.1016/j.jvcir.2013.02.008 McCormac, 2017, Semanticfusion: dense 3D semantic mapping with convolutional neural networks, 4628 Sünderhauf, 2017, Meaningful maps with object-oriented semantic mapping, 5079 Engel, 2014, LSD-SLAM: large-scale direct monocular slam Mur-Artal, 2015, ORB-SLAM: a versatile and accurate monocular slam system, IEEE Trans. Robot., 31, 1147, 10.1109/TRO.2015.2463671 Newcombe, 2011, Dtam: dense tracking and mapping in real-time, 2320 Keller, 2013, Real-time 3D reconstruction in dynamic scenes using point-based fusion, 1 Newcombe, 2011, Kinectfusion: real-time dense surface mapping and tracking, 127 Engel, 2013, Semi-dense visual odometry for a monocular camera, 1449 Klein, 2007, Parallel tracking and mapping for small ar workspaces, 1 Klein, 2008, Improving the agility of keyframe-based slam Salas-Moreno, 2013, Slam++: simultaneous localisation and mapping at the level of objects, 1352 Pillai, 2015, Monocular slam supported object recognition Dalal, 2005, Histograms of oriented gradients for human detection, 886 Lowe, 2004, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vis., 60, 91, 10.1023/B:VISI.0000029664.99615.94 Shi, 2000, Normalized cuts and image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 22, 888, 10.1109/34.868688 Rother, 2004, “GrabCut”–Interactive foreground extraction using iterated graph cuts, ACM Trans. Gr., 23, 309, 10.1145/1015706.1015720 Jia, 2014, Caffe: convolutional architecture for fast feature embedding, 675 Pfister, 2000, Surfels: surface elements as rendering primitives, 335 Sumner, 2007, Embedded deformation for shape manipulation, 26, 80 Ozuysal, 2010, Fast keypoint recognition using random ferns, IEEE Trans. Pattern Anal. Mach. Intell., 32, 448, 10.1109/TPAMI.2009.23 Fischler, 1981, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Commun. ACM, 24, 381, 10.1145/358669.358692 Hoiem, 2009, Pascal VOC 2008 challenge, PASCAL challenge workshop in ECCV Silberman, 2012, Indoor segmentation and support inference from RGBD images, 746 He, 2017, Mask R-CNN, 2980 Chen, 2018, Encoder–decoder with atrous separable convolution for semantic image segmentation Everingham, 2015, The pascal visual object classes challenge: a retrospective, International Journal of Computer Vision (IJCV), 111, 98, 10.1007/s11263-014-0733-5