A multi-scale fully convolutional network for semantic labeling of 3D point clouds
Tóm tắt
Từ khóa
Tài liệu tham khảo
Audebert, N., Saux, B.L., Lefèvre, S., 21–23 November 2016. Semantic segmentation of earth observation data using multimodal and multi-scale deep networks. In: Proceedings of the Asian Conference on Computer Vision (ACCV). Taipei, Taiwan.
Axelsson, P., 2000. DEM generation from laser scanner data using adaptive TIN models. In: ISPRS International Archives of Photogrammetry and Remote Sensing, vol. XXXIII-Part B4/1. pp. 111–118.
Badrinarayanan, 2017, Segnet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 39, 2481, 10.1109/TPAMI.2016.2644615
Bai, S., Bai, X., Zhou, Z., Zhang, Z., Jan Latecki, L., 2016. GIFT: a real-time and scalable 3D shape search engine. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5023–5032.
Blomley, R., Jutzi, B., Weinmann, M., September 2016. 3D semantic labeling of ALS point clouds by exploiting multi-scale, multi-type neighborhoods for feature extraction. In: Proceedings of the International Conference on Geographic Object-Based Image Analysis (GEOBIA). Enschede, The Netherlands, pp. 1–8.
Boulch, A., Saux, B.L., Audebert, N., April 2017. Unstructured point cloud semantic labeling using deep segmentation networks. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, vol. 2. pp. 17–24.
Boykov, 2001, Fast approximate energy minimization via graph cuts, IEEE Trans. Pattern Anal. Mach. Intell., 23, 1222, 10.1109/34.969114
Caltagirone, L., Scheidegger, S., Svensson, L., Wahde, M., June 2017. Fast LIDAR-based road detection using fully convolutional neural networks. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV). pp. 1019–1024.
Chehata, N., Guo, L., Mallet, C., 2009. Airborne lidar feature selection for urban classification using random forests. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXXVIII-Part 3.
Chollet, F., et al., 2015. Keras. <https://github.com/fchollet/keras>.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L., 2009. Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 248–255.
Geiger, 2013, Vision meets robotics: the KITTI dataset, Int. J. Robot. Res., 32, 1231, 10.1177/0278364913491297
Girshick, R., Donahue, J., Darrell, T., Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 580–587.
Glorot, X., Bengio, Y., 2010. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, vol. 9. pp. 249–256.
Golovinskiy, A., Kim, V.G., Funkhouser, T., September 2009. Shape-based recognition of 3D point clouds in urban environments. In: Proceedings of the 12th International Conference on Computer Vision (ICCV). pp. 2154–2161.
Grilli, E., Menna, F., Remondino, F., 2017. A review of point clouds segmentation and classification algorithms. In: ISPRS International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XLII-2/W3. Nafplio, Greece, pp. 339–344.
Haala, N., Brenner, C., Anders, K.-H., 1998. 3D urban GIS from laser altimeter and 2D map data. In: ISPRS International Archives of Photogrammetry, Remote Sensing & Spatial Information Sciences, vol. 32. pp. 339–346.
Hackel, T., Savinov, N., Ladicky, L., Wegner, J.D., Schindler, K., Pollefeys, M., 2017. SEMANTIC3D.NET: a new large-scale point cloud classification benchmark. In: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. IV-1-W1. pp. 91–98.
Hackel, T., Wegner, J.D., Schindler, K., 2016. Fast semantic segmentation of 3D point clouds with strongly varying density. In: ISPRS Annals of Photogrammetry, Remote Sensing & Spatial Information Sciences, vol. III-3. Prague, Czech Republic, pp. 177–184.
He, K., Zhang, X., Ren, S., Sun, J., December 2015. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). pp. 1026–1034.
He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 770–778.
Huang, 2016, Point cloud labeling using 3D convolutional neural network, 2670
Hug, C., Wehr, A., 1997. Detecting and identifying topographic objects in imaging laser altimeter data. In: ISPRS International Archives of Photogrammetry and Remote Sensing, vol. 32, Part 3–4W2. pp. 19–26.
Ioffe, S., Szegedy, C., 2015. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning, vol. 37. pp. 448–456.
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K., 2015. Spatial transformer networks. In: Advances in Neural Information Processing Systems 28. pp. 2017–2025.
Jochem, A., Höfle, B., Hollaus, M., Rutzinger, M., September 2009. Object detection in airborne LIDAR data for improved solar radiation modeling in urban areas. In: International Archives of Photogrammetry, Remote Sensing, and Spatial Information Sciences, vol. 38, Part 3/W8. Paris, France, pp. 1–6.
Kingma, D.P., Ba, J., 2015. Adam: a method for stochastic optimization. In: International Conference on Learning Representations.
Krizhevsky, A., Sutskever, I., Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, vol. 25. pp. 1097–1105.
LeCun, 1989, Backpropagation applied to handwritten zip code recognition, Neural Comput., 1, 541, 10.1162/neco.1989.1.4.541
Li, B., 2017. 3D fully convolutional network for vehicle detection in point cloud. In: Proceedings of the International Conference on Intelligent Robots and Systems (IROS).
Lin, 2014, Eigen-feature analysis of weighted covariance matrices for LiDAR point cloud classification, ISPRS J. Photogramm. Remote Sens., 94, 70, 10.1016/j.isprsjprs.2014.04.016
Lin, M., Chen, Q., Yan, S., 2014b. Network in network. In: International Conference on Learning Representations (ICLR).
Liu, Y., Piramanayagam, S., Monteiro, S.T., Saber, E., July 2017. Dense semantic labeling of very-high-resolution aerial imagery and lidar with fully-convolutional neural networks and higher-order CRFS. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1561–1570.
Long, J., Shelhamer, E., Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, (CVPR). Boston, MA, pp. 3431–3440.
Mallet, C., 2010. Analysis of Full-waveform Lidar Data for Urban Area Mapping (Ph.D. thesis). Télécom ParisTech.
Maturana, 2015, VoxDet: a 3D convolutional neural network for real-time object recognition, 922
Moussa, A., El-Sheimy, N., September 2010. Automatic classification and 3D modeling of lidar data. In: Proceedings of the ISPRS Commission III symposium, vol. 38. ISPRS, Saint-Mand, France, pp. 155–159.
Niemeyer, 2014, Contextual classification of lidar data and building object detection in urban areas, ISPRS J. Photogramm. Remote Sens., 87, 152, 10.1016/j.isprsjprs.2013.11.001
Niemeyer, J., Rottensteiner, F., Soergel, U., Heipke, C., July 2016. Hierarchical higher order CRF for the classification of airborne LIDAR point clouds in urban areas. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XLI-B3. Czech Republic, pp. 655–662.
Qi, C.R., Su, H., Mo, K., Guibas, L.J., 2017. PointNet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Qi, C.R., Su, H., Nießner, M., Dai, A., Yan, M., Guibas, L.J., 2016. Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5648–5656.
Ramiya, A., Nidamanuri, R.R., Krishnan, R., December 2014. Semantic labelling of urban point cloud data. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XL-8. Hyderbad, India, pp. 907–911.
Ryoo, M.S., Rothrock, B., Matthies, L.H., 2015. Pooled motion features for first-person videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 896–904.
Savva, M., Yu, F., Su, H., Aono, M., Chen, B., Cohen-Or, D., Deng, W., Su, H., Bai, S., Bai, X., et al., 2016. SHREC16 track: large-scale 3d shape retrieval from ShapeNet core55. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval. pp. 89–98.
Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E., 2015. Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 945–953.
Tapper, G., 2016. Extraction of DTM from Satellite Images Using Neural Networks (Ph.D. thesis). Linkoping University.
Wang, N., Yeung, D.-Y., 2013. Learning a deep compact image representation for visual tracking. In: Advances in Neural Information Processing Systems, vol. 26. pp. 809–817.
Weinmann, M., Jutzi, B., Mallet, C., Aug. 2014. Semantic 3D scene interpretation: a framework combining optimal neighborhood size selection with relevant features. In: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. II-3. pp. 181–188.
Xing, S., Li, P., Xu, Q., Wang, D., Li, P., Sep. 2017. Surface fitting filtering of LIDAR point cloud with waveform information. In: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. IV-2. pp. 179–184.
Yang, 2017, A convolutional neural network-based 3D semantic labeling method for ALS point clouds, Remote Sens., 9
Yosinski, J., Clune, J., Nguyen, A., Fuchs, T., Lipson, H., 2015. Understanding neural networks through deep visualization. Available from: <1506.06579>.
Yousefhussien, 2016, Online tracking using saliency, 1
Yunfei, B., Guoping, L., Chunxiang, C., Hao, Z., Qisheng, H., Linyan, B., Chaoyi, C., 2008. Classification of lidar point cloud and generation of DTM from lidar height and intensity data in forested area. In: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXVII-7. pp. 313–318.
