A multi-scale fully convolutional network for semantic labeling of 3D point clouds

ISPRS Journal of Photogrammetry and Remote Sensing - Tập 143 - Trang 191-204 - 2018

Mohammed Yousefhussien¹, David Kelbe², Emmett J. Ientilucci¹, Carl Salvaggio¹

¹Rochester Institute of Technology, Chester F. Carlson Center for Imaging Science, Rochester, NY, USA

²Oak Ridge National Laboratory, Geographic Information Science and Technology Group, Oak Ridge, TN, USA

Tóm tắt

Từ khóa

Tài liệu tham khảo

Audebert, N., Saux, B.L., Lefèvre, S., 21–23 November 2016. Semantic segmentation of earth observation data using multimodal and multi-scale deep networks. In: Proceedings of the Asian Conference on Computer Vision (ACCV). Taipei, Taiwan.

Axelsson, P., 2000. DEM generation from laser scanner data using adaptive TIN models. In: ISPRS International Archives of Photogrammetry and Remote Sensing, vol. XXXIII-Part B4/1. pp. 111–118.

Badrinarayanan, 2017, Segnet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 39, 2481, 10.1109/TPAMI.2016.2644615

Bai, S., Bai, X., Zhou, Z., Zhang, Z., Jan Latecki, L., 2016. GIFT: a real-time and scalable 3D shape search engine. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5023–5032.

Blomley, R., Jutzi, B., Weinmann, M., September 2016. 3D semantic labeling of ALS point clouds by exploiting multi-scale, multi-type neighborhoods for feature extraction. In: Proceedings of the International Conference on Geographic Object-Based Image Analysis (GEOBIA). Enschede, The Netherlands, pp. 1–8.

Boulch, A., Saux, B.L., Audebert, N., April 2017. Unstructured point cloud semantic labeling using deep segmentation networks. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, vol. 2. pp. 17–24.

Boykov, 2001, Fast approximate energy minimization via graph cuts, IEEE Trans. Pattern Anal. Mach. Intell., 23, 1222, 10.1109/34.969114

Caltagirone, L., Scheidegger, S., Svensson, L., Wahde, M., June 2017. Fast LIDAR-based road detection using fully convolutional neural networks. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV). pp. 1019–1024.

Chehata, N., Guo, L., Mallet, C., 2009. Airborne lidar feature selection for urban classification using random forests. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXXVIII-Part 3.

Chollet, F., et al., 2015. Keras. <https://github.com/fchollet/keras>.

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L., 2009. Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 248–255.

Geiger, 2013, Vision meets robotics: the KITTI dataset, Int. J. Robot. Res., 32, 1231, 10.1177/0278364913491297

Girshick, R., Donahue, J., Darrell, T., Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 580–587.

Glorot, X., Bengio, Y., 2010. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, vol. 9. pp. 249–256.

Golovinskiy, A., Kim, V.G., Funkhouser, T., September 2009. Shape-based recognition of 3D point clouds in urban environments. In: Proceedings of the 12th International Conference on Computer Vision (ICCV). pp. 2154–2161.

Grilli, E., Menna, F., Remondino, F., 2017. A review of point clouds segmentation and classification algorithms. In: ISPRS International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XLII-2/W3. Nafplio, Greece, pp. 339–344.

Haala, N., Brenner, C., Anders, K.-H., 1998. 3D urban GIS from laser altimeter and 2D map data. In: ISPRS International Archives of Photogrammetry, Remote Sensing & Spatial Information Sciences, vol. 32. pp. 339–346.

Hackel, T., Savinov, N., Ladicky, L., Wegner, J.D., Schindler, K., Pollefeys, M., 2017. SEMANTIC3D.NET: a new large-scale point cloud classification benchmark. In: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. IV-1-W1. pp. 91–98.

Hackel, T., Wegner, J.D., Schindler, K., 2016. Fast semantic segmentation of 3D point clouds with strongly varying density. In: ISPRS Annals of Photogrammetry, Remote Sensing & Spatial Information Sciences, vol. III-3. Prague, Czech Republic, pp. 177–184.

He, K., Zhang, X., Ren, S., Sun, J., December 2015. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). pp. 1026–1034.

He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 770–778.

Huang, 2016, Point cloud labeling using 3D convolutional neural network, 2670

Hug, C., Wehr, A., 1997. Detecting and identifying topographic objects in imaging laser altimeter data. In: ISPRS International Archives of Photogrammetry and Remote Sensing, vol. 32, Part 3–4W2. pp. 19–26.

Ioffe, S., Szegedy, C., 2015. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning, vol. 37. pp. 448–456.

Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K., 2015. Spatial transformer networks. In: Advances in Neural Information Processing Systems 28. pp. 2017–2025.

Jochem, A., Höfle, B., Hollaus, M., Rutzinger, M., September 2009. Object detection in airborne LIDAR data for improved solar radiation modeling in urban areas. In: International Archives of Photogrammetry, Remote Sensing, and Spatial Information Sciences, vol. 38, Part 3/W8. Paris, France, pp. 1–6.

Kingma, D.P., Ba, J., 2015. Adam: a method for stochastic optimization. In: International Conference on Learning Representations.

Krizhevsky, A., Sutskever, I., Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, vol. 25. pp. 1097–1105.

LeCun, 1989, Backpropagation applied to handwritten zip code recognition, Neural Comput., 1, 541, 10.1162/neco.1989.1.4.541

Li, B., 2017. 3D fully convolutional network for vehicle detection in point cloud. In: Proceedings of the International Conference on Intelligent Robots and Systems (IROS).

Lin, 2014, Eigen-feature analysis of weighted covariance matrices for LiDAR point cloud classification, ISPRS J. Photogramm. Remote Sens., 94, 70, 10.1016/j.isprsjprs.2014.04.016

Lin, M., Chen, Q., Yan, S., 2014b. Network in network. In: International Conference on Learning Representations (ICLR).

Liu, Y., Piramanayagam, S., Monteiro, S.T., Saber, E., July 2017. Dense semantic labeling of very-high-resolution aerial imagery and lidar with fully-convolutional neural networks and higher-order CRFS. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1561–1570.

Long, J., Shelhamer, E., Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, (CVPR). Boston, MA, pp. 3431–3440.

Mallet, C., 2010. Analysis of Full-waveform Lidar Data for Urban Area Mapping (Ph.D. thesis). Télécom ParisTech.

Maturana, 2015, VoxDet: a 3D convolutional neural network for real-time object recognition, 922

Moussa, A., El-Sheimy, N., September 2010. Automatic classification and 3D modeling of lidar data. In: Proceedings of the ISPRS Commission III symposium, vol. 38. ISPRS, Saint-Mand, France, pp. 155–159.

Niemeyer, 2014, Contextual classification of lidar data and building object detection in urban areas, ISPRS J. Photogramm. Remote Sens., 87, 152, 10.1016/j.isprsjprs.2013.11.001

Niemeyer, J., Rottensteiner, F., Soergel, U., Heipke, C., July 2016. Hierarchical higher order CRF for the classification of airborne LIDAR point clouds in urban areas. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XLI-B3. Czech Republic, pp. 655–662.

Qi, C.R., Su, H., Mo, K., Guibas, L.J., 2017. PointNet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Qi, C.R., Su, H., Nießner, M., Dai, A., Yan, M., Guibas, L.J., 2016. Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5648–5656.

Ramiya, A., Nidamanuri, R.R., Krishnan, R., December 2014. Semantic labelling of urban point cloud data. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XL-8. Hyderbad, India, pp. 907–911.

Ryoo, M.S., Rothrock, B., Matthies, L.H., 2015. Pooled motion features for first-person videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 896–904.

Savva, M., Yu, F., Su, H., Aono, M., Chen, B., Cohen-Or, D., Deng, W., Su, H., Bai, S., Bai, X., et al., 2016. SHREC16 track: large-scale 3d shape retrieval from ShapeNet core55. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval. pp. 89–98.

Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E., 2015. Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 945–953.

Tapper, G., 2016. Extraction of DTM from Satellite Images Using Neural Networks (Ph.D. thesis). Linkoping University.

Wang, N., Yeung, D.-Y., 2013. Learning a deep compact image representation for visual tracking. In: Advances in Neural Information Processing Systems, vol. 26. pp. 809–817.

Weinmann, M., Jutzi, B., Mallet, C., Aug. 2014. Semantic 3D scene interpretation: a framework combining optimal neighborhood size selection with relevant features. In: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. II-3. pp. 181–188.

Xing, S., Li, P., Xu, Q., Wang, D., Li, P., Sep. 2017. Surface fitting filtering of LIDAR point cloud with waveform information. In: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. IV-2. pp. 179–184.

Yang, 2017, A convolutional neural network-based 3D semantic labeling method for ALS point clouds, Remote Sens., 9

Yosinski, J., Clune, J., Nguyen, A., Fuchs, T., Lipson, H., 2015. Understanding neural networks through deep visualization. Available from: <1506.06579>.

Yousefhussien, 2016, Online tracking using saliency, 1

Yunfei, B., Guoping, L., Chunxiang, C., Hao, Z., Qisheng, H., Linyan, B., Chaoyi, C., 2008. Classification of lidar point cloud and generation of DTM from lidar height and intensity data in forested area. In: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXVII-7. pp. 313–318.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]