A Convolutional Neural Network-Based 3D Semantic Labeling Method for ALS Point Clouds

Remote Sensing - Tập 9 Số 9 - Trang 936
Zhishuang Yang1, Wanshou Jiang1, Bo Xu1, Quansheng Zhu1, San Jiang1, Wei Huang1
1State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430000, Hubei, China

Tóm tắt

3D semantic labeling is a fundamental task in airborne laser scanning (ALS) point clouds processing. The complexity of observed scenes and the irregularity of point distributions make this task quite challenging. Existing methods rely on a large number of features for the LiDAR points and the interaction of neighboring points, but cannot exploit the potential of them. In this paper, a convolutional neural network (CNN) based method that extracts the high-level representation of features is used. A point-based feature image-generation method is proposed that transforms the 3D neighborhood features of a point into a 2D image. First, for each point in the ALS data, the local geometric features, global geometric features and full-waveform features of its neighboring points within a window are extracted and transformed into an image. Then, the feature images are treated as the input of a CNN model for a 3D semantic labeling task. Finally, to allow performance comparisons with existing approaches, we evaluate our framework on the publicly available datasets provided by the International Society for Photogrammetry and Remote Sensing Working Groups II/4 (ISPRS WG II/4) benchmark tests on 3D labeling. The experiment results achieve 82.3% overall accuracy, which is the best among all considered methods.

Từ khóa


Tài liệu tham khảo

Bishop, C.M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics), Springer.

Chan, 2008, Evaluation of random forest and adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery, Remote Sens. Environ., 112, 2999, 10.1016/j.rse.2008.02.011

Mountrakis, 2011, Support vector machines in remote sensing: A review, ISPRS J. Photogramm. Remote Sens., 66, 247, 10.1016/j.isprsjprs.2010.11.001

Breiman, 2001, Random forests, Mach. Learn., 45, 5, 10.1023/A:1010933404324

Gislason, 2003, Random forests for land cover classification, Pattern Recognit. Lett., 27, 294, 10.1016/j.patrec.2005.08.011

Chehata, 2009, Airborne LiDAR feature selection for urban classification using random forests, Geomat. Inf. Sci. Wuhan Univ., 38, 207

Mallet, C. (2010). Analysis of Full-Waveform LiDAR Data for Urban Area Mapping. [Ph.D Thesis, Télécom ParisTech].

Mallet, 2011, Relevance assessment of full-waveform LiDAR data for urban area classification, ISPRS J. Photogramm. Remote Sens., 66, S71, 10.1016/j.isprsjprs.2011.09.008

Weinmann, 2015, Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers, ISPRS J. Photogramm. Remote Sens., 105, 286, 10.1016/j.isprsjprs.2015.01.016

Geman, 1984, Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images, IEEE Trans. Pattern Anal. Mach. Intell., 6, 721, 10.1109/TPAMI.1984.4767596

Kumar, 2006, Discriminative random fields, Int. J. Comput. Vis., 68, 179, 10.1007/s11263-006-7007-9

Anguelov, D., Taskarf, B., Chatalbashev, V., Koller, D., Gupta, D., Heitz, G., and Ng, A. (2005, January 20–25). In discriminative learning of Markov random fields for segmentation of 3D scan data. Proceedings of the Computer Vision and Pattern Recognition, San Diego, CA, USA.

Shapovalov, 2010, Non-associative Markov networks for 3D point cloud classification, Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci., XXXVIII, 103

Rusu, R.B., Holzbach, A., Blodow, N., and Beetz, M. (2009, January 10–15). Fast geometric point labeling using conditional random fields. Proceedings of the Intelligent Robots and Systems, St. Louis, MO, USA.

Niemeyer, 2014, Contextual classification of LiDAR data and building object detection in urban areas, ISPRS J. Photogramm. Remote Sens., 87, 152, 10.1016/j.isprsjprs.2013.11.001

LeCun, 1989, Backpropagation applied to handwritten zip code recognition, Neural Comput., 1, 541, 10.1162/neco.1989.1.4.541

Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3–8). Imagenet classification with deep convolutional neural networks. Proceedings of the Neural Information Processing Systems Conference, Lake Tahoe, NV, USA.

Simonyan, K., and Zisserman, A. (2017, September 08). Very Deep Convolutional Networks for Large-Scale Image Recognition. Available online: https://arxiv.org/abs/1409.1556.

Szegedy, C., Liu, W., Jia, Y., and Sermanet, P. (2015, January 8–10). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.

Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 24–27). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.

Boulch, A., Saux, B.L., and Audebert, N. (2017). Unstructured point cloud semantic labeling using deep segmentation networks. Eurographics Workshop on 3D Object Retrieval, The Eurographics Association.

Hackel, 2016, Fast semantic segmentation of 3D point clouds with strongly varying density, ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci., 3, 177, 10.5194/isprs-annals-III-3-177-2016

Hu, X., and Yuan, Y. (2016). Deep-learning-based classification for DTM extraction from ALS point cloud. Remote Sens., 8.

Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. (2014, January 3–7). Caffe: Convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA.

Nair, V., and Hinton, G.E. (2010, January 21–24). Rectified linear units improve restricted boltzmann machines. Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel.

Ioffe, S., and Szegedy, C. (2015, January 6–11). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning, Lile, France.

Bishop, C.M. (1995). Neural Networks for Pattern Recognition, Oxford University Press.

Mallet, 2011, Dimensionality based scale selection in 3D LiDAR point clouds, ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci., 38, W12

Xu, G., and Zhang, Z. (1996). Epipolar geometry in stereo, motion and object recognition. Comput. Imaging Vis., 6.

Kraus, 1998, Determination of terrain models in wooded areas with airborne laser scanner data, ISPRS J. Photogramm. Remote Sens., 53, 193, 10.1016/S0924-2716(98)00009-4

Wagner, 2006, Gaussian decomposition and calibration of a novel small-footprint full-waveform digitising airborne laser scanner, ISPRS J. Photogramm. Remote Sens., 60, 100, 10.1016/j.isprsjprs.2005.12.001

International Society for Photogrammetry and Remote Sensing (2017, September 08). 3D Semantic Labeling Contest. Available online: http://www2.isprs.org/commissions/comm3/wg4/3d-semantic-labeling.html.

Cramer, 2010, The DGPF-test on digital airborne camera evaluation—Overview and test design, Photogramm. Fernerkund. Geoinf., 2010, 73, 10.1127/1432-8364/2010/0041

Chen, C., and Breiman, L. (2004). Using Random Forest to Learn Imbalanced Data, University of California, Berkeley. 666.

Ramiya, 2016, A supervoxel-based spectro-spatial approach for 3d urban point cloud labelling, Int. J. Remote Sens., 37, 4172, 10.1080/01431161.2016.1211348

Steinsiek, M., Polewski, P., Yao, W., and Krzystek, P. (2017, January 8–10). Semantische analyse von ALS- und MLS-daten in urbanen gebieten mittels conditional random fields. Proceedings of the 37. Wissenschaftlich-Technische Jahrestagung der DGPF, Würzburg, Germany.

Horvat, 2016, Context-dependent detection of non-linearly distributed points for vegetation classification in airborne lidar, ISPRS J. Photogramm. Remote Sens., 116, 1, 10.1016/j.isprsjprs.2016.02.011

Niemeyer, 2016, Hierarchical higher order crf for the classification of airborne lidar point clouds in urban areas, ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci., XLI-B3, 655

Guinard, S., and Landrieu, L. (2017, January 6–9). Weakly supervised segmentation-aided classification of urban scenes from 3D lidar point clouds. Proceedings of the ISPRS Workshop 2017, Hannover, Germany.