ObjectFusion: An object detection and segmentation framework with RGB-D SLAM and convolutional neural networks

Neurocomputing - Tập 345 - Trang 3-14 - 2019

Guanzhong Tian¹, Liang Liu², JongHyok Ri², Yong Liu³, Yiran Sun¹

¹State Key Laboratory of Industrial Control Technology, Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou 310027, China

²State Key Laboratory of Industrial Control Technology, Institute of Cyber Systems and Control, Zhejiang University, Hangzhou, 310027, China

³Institute of Information Technology, Kim Il Song University, Pyongyang 190016, Republic of Korea

Tài liệu tham khảo

Luan, 2017, Fast task-specific target detection via graph based constraints representation and checking, 3984 Liu, 2015, Detection based object labeling of 3D point cloud for indoor scenes, Neurocomputing, 174 Long, 2015, Fully convolutional networks for semantic segmentation, 3431 Zheng, 2015, Conditional random fields as recurrent neural networks, 1529 Whelan, 2015, ElasticFusion: dense SLAM without a pose graph Whelan, 2015, Real-time large-scale dense RGB-D slam with volumetric fusion, Int. J. Robot. Res., 34, 598, 10.1177/0278364914551008 Zuo, 2017, Robust visual slam with point and line features, 1775 Kundu, 2014, Joint semantic segmentation and 3D reconstruction from monocular video, Vol. 8694, 703 Redmon, 2016, You only look once: Unified, real-time object detection W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: single shot multibox detector, Springer International Publishing, Cham, pp. 21–37, 10.1007/978-3-319-46448-0_2. Girshick, 2015, Fast R-CNN Ren, 2015, Faster R-CNN: towards real-time object detection with region proposal networks, 91 Stückler, 2014, Multi-resolution surfel maps for efficient dense 3D modeling and tracking, J. Vis. Commun. Image Rep., 25, 137, 10.1016/j.jvcir.2013.02.008 McCormac, 2017, Semanticfusion: dense 3D semantic mapping with convolutional neural networks, 4628 Sünderhauf, 2017, Meaningful maps with object-oriented semantic mapping, 5079 Engel, 2014, LSD-SLAM: large-scale direct monocular slam Mur-Artal, 2015, ORB-SLAM: a versatile and accurate monocular slam system, IEEE Trans. Robot., 31, 1147, 10.1109/TRO.2015.2463671 Newcombe, 2011, Dtam: dense tracking and mapping in real-time, 2320 Keller, 2013, Real-time 3D reconstruction in dynamic scenes using point-based fusion, 1 Newcombe, 2011, Kinectfusion: real-time dense surface mapping and tracking, 127 Engel, 2013, Semi-dense visual odometry for a monocular camera, 1449 Klein, 2007, Parallel tracking and mapping for small ar workspaces, 1 Klein, 2008, Improving the agility of keyframe-based slam Salas-Moreno, 2013, Slam++: simultaneous localisation and mapping at the level of objects, 1352 Pillai, 2015, Monocular slam supported object recognition Dalal, 2005, Histograms of oriented gradients for human detection, 886 Lowe, 2004, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vis., 60, 91, 10.1023/B:VISI.0000029664.99615.94 Shi, 2000, Normalized cuts and image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 22, 888, 10.1109/34.868688 Rother, 2004, “GrabCut”–Interactive foreground extraction using iterated graph cuts, ACM Trans. Gr., 23, 309, 10.1145/1015706.1015720 Jia, 2014, Caffe: convolutional architecture for fast feature embedding, 675 Pfister, 2000, Surfels: surface elements as rendering primitives, 335 Sumner, 2007, Embedded deformation for shape manipulation, 26, 80 Ozuysal, 2010, Fast keypoint recognition using random ferns, IEEE Trans. Pattern Anal. Mach. Intell., 32, 448, 10.1109/TPAMI.2009.23 Fischler, 1981, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Commun. ACM, 24, 381, 10.1145/358669.358692 Hoiem, 2009, Pascal VOC 2008 challenge, PASCAL challenge workshop in ECCV Silberman, 2012, Indoor segmentation and support inference from RGBD images, 746 He, 2017, Mask R-CNN, 2980 Chen, 2018, Encoder–decoder with atrous separable convolution for semantic image segmentation Everingham, 2015, The pascal visual object classes challenge: a retrospective, International Journal of Computer Vision (IJCV), 111, 98, 10.1007/s11263-014-0733-5

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA