ObjectFusion: An object detection and segmentation framework with RGB-D SLAM and convolutional neural networks
Tài liệu tham khảo
Luan, 2017, Fast task-specific target detection via graph based constraints representation and checking, 3984
Liu, 2015, Detection based object labeling of 3D point cloud for indoor scenes, Neurocomputing, 174
Long, 2015, Fully convolutional networks for semantic segmentation, 3431
Zheng, 2015, Conditional random fields as recurrent neural networks, 1529
Whelan, 2015, ElasticFusion: dense SLAM without a pose graph
Whelan, 2015, Real-time large-scale dense RGB-D slam with volumetric fusion, Int. J. Robot. Res., 34, 598, 10.1177/0278364914551008
Zuo, 2017, Robust visual slam with point and line features, 1775
Kundu, 2014, Joint semantic segmentation and 3D reconstruction from monocular video, Vol. 8694, 703
Redmon, 2016, You only look once: Unified, real-time object detection
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: single shot multibox detector, Springer International Publishing, Cham, pp. 21–37, 10.1007/978-3-319-46448-0_2.
Girshick, 2015, Fast R-CNN
Ren, 2015, Faster R-CNN: towards real-time object detection with region proposal networks, 91
Stückler, 2014, Multi-resolution surfel maps for efficient dense 3D modeling and tracking, J. Vis. Commun. Image Rep., 25, 137, 10.1016/j.jvcir.2013.02.008
McCormac, 2017, Semanticfusion: dense 3D semantic mapping with convolutional neural networks, 4628
Sünderhauf, 2017, Meaningful maps with object-oriented semantic mapping, 5079
Engel, 2014, LSD-SLAM: large-scale direct monocular slam
Mur-Artal, 2015, ORB-SLAM: a versatile and accurate monocular slam system, IEEE Trans. Robot., 31, 1147, 10.1109/TRO.2015.2463671
Newcombe, 2011, Dtam: dense tracking and mapping in real-time, 2320
Keller, 2013, Real-time 3D reconstruction in dynamic scenes using point-based fusion, 1
Newcombe, 2011, Kinectfusion: real-time dense surface mapping and tracking, 127
Engel, 2013, Semi-dense visual odometry for a monocular camera, 1449
Klein, 2007, Parallel tracking and mapping for small ar workspaces, 1
Klein, 2008, Improving the agility of keyframe-based slam
Salas-Moreno, 2013, Slam++: simultaneous localisation and mapping at the level of objects, 1352
Pillai, 2015, Monocular slam supported object recognition
Dalal, 2005, Histograms of oriented gradients for human detection, 886
Lowe, 2004, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vis., 60, 91, 10.1023/B:VISI.0000029664.99615.94
Shi, 2000, Normalized cuts and image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 22, 888, 10.1109/34.868688
Rother, 2004, “GrabCut”–Interactive foreground extraction using iterated graph cuts, ACM Trans. Gr., 23, 309, 10.1145/1015706.1015720
Jia, 2014, Caffe: convolutional architecture for fast feature embedding, 675
Pfister, 2000, Surfels: surface elements as rendering primitives, 335
Sumner, 2007, Embedded deformation for shape manipulation, 26, 80
Ozuysal, 2010, Fast keypoint recognition using random ferns, IEEE Trans. Pattern Anal. Mach. Intell., 32, 448, 10.1109/TPAMI.2009.23
Fischler, 1981, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Commun. ACM, 24, 381, 10.1145/358669.358692
Hoiem, 2009, Pascal VOC 2008 challenge, PASCAL challenge workshop in ECCV
Silberman, 2012, Indoor segmentation and support inference from RGBD images, 746
He, 2017, Mask R-CNN, 2980
Chen, 2018, Encoder–decoder with atrous separable convolution for semantic image segmentation
Everingham, 2015, The pascal visual object classes challenge: a retrospective, International Journal of Computer Vision (IJCV), 111, 98, 10.1007/s11263-014-0733-5