Classification of ALS Point Clouds Using End-to-End Deep Learning

Lukas Winiwarter1,2, Gottfried Mandlburger1,3, Stefan Schmohl3, Norbert Pfeifer1
1Department of Geodesy and Geoinformation (E120), Technische Universität Wien, Vienna, Austria
23D Geospatial Data Processing Group (3DGeo), Institute of Geography, Heidelberg University, Heidelberg, Germany
3Institute for Photogrammetry, University of Stuttgart, Stuttgart, Germany

Tóm tắt

Deep learning, referring to artificial neural networks with multiple layers, is widely used for classification tasks in many disciplines including computer vision. The most popular type is the Convolutional Neural Network (CNN), commonly applied to 2D image data. However, CNNs are difficult to adapt to irregular data like point clouds. PointNet, on the other hand, has enabled the derivation of features based on the geometric distribution of a set of points in nD-space utilising a neural network. We use PointNet on multiple scales to automatically learn a representation of local neighbourhoods in an end-to-end fashion, which is optimised for semantic labelling on 3D point clouds acquired by Airborne Laser Scanning (ALS). The results are comparable to those using manually crafted features, suggesting a successful representation of these neighbourhoods. On the ISPRS 3D Semantic Labelling benchmark, we achieve 80.6% overall accuracy, a mid-field result. Investigation on a bigger dataset, namely the 2011 ALS point cloud of the federal state of Vorarlberg, shows overall accuracies of up to 95.8% over large-scale built-up areas. Lower accuracy is achieved for the separation of low vegetation and ground points, presumably because of invalid assumptions about the distribution of classes in space, especially in high alpine regions. We conclude that the method of the end-to-end system, allowing training on a big variety of classification problems without the need for expert knowledge about neighbourhood features can also successfully be applied to single-point-based classification of ALS point clouds.

Tài liệu tham khảo

ASPRS (2011) LAS specification. https://www.asprs.org/wp-content/uploads/2010/12/LAS_1_4_r13.pdf

Bechtold S, Höfle B (2016) HELIOS: a multi-purpose LiDAR simulation framework for research, planning and training of laser scanning operations with airborne, ground-based mobile and stationary platforms. ISPRS Ann Photogramme Remote Sens Spat Inf Sci III–3:161–168

Blomley R, Weinmann M (2017) Using multi-scale features for the 3D semantic labeling of airborne laser scanning data. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2:43–50

Boulch A, Le Saux B, Audebert N (2017) Unstructured point cloud semantic labeling using deep segmentation networks. In: Pratikakis I, Dupont F, Ovsjanikov M (eds) Eurographics workshop on 3D object retrieval. The Eurographics Association, Aire-la-Ville. https://doi.org/10.2312/3dor.20171047

Dai A, Ritchie D, Bokeloh M, Reed S, Sturm J, Nießner M (2018) ScanComplete: Large-scale scene completion and semantic segmentation for 3D scans. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, vol abs/1712.10215, pp 4578–4587. https://doi.org/10.1109/CVPR.2018.00481

Gerke M (2014) 3D semantic labeling contest. http://www2.isprs.org/commissions/comm3/wg4/3d-semantic-labeling.html

Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge. http://www.deeplearningbook.org

Grilli E, Menna F, Remondino F (2017) A review of point clouds segmentation and classification algorithms. In: ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol XLII-2/W3, pp 339–344. https://doi.org/10.5194/isprs-archives-XLII-2-W3-339-2017

Hackel T, Wegner JD, Schindler K (2016) Fast semantic segmentation of 3D point clouds with strongly varying density. ISPRS Ann Photogramm Remote Sens Spat Inf Sci III–3:177–184. https://doi.org/10.5194/isprs-annals-III-3-177-2016

Huang J, You S (2016) Point cloud labeling using 3D convolutional neural network. In: 23rd international conference on pattern recognition (ICPR), pp 2670–2675. https://doi.org/10.1109/ICPR.2016.7900038

Landrieu L, Simonovsky M (2018) Large-scale point cloud semantic segmentation with superpoint graphs. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 4558–4567. https://doi.org/10.1109/CVPR.2018.00479

Lawin FJ, Danelljan M, Tosteberg P, Bhat G, Khan FS, Felsberg M (2017) Deep projective 3D semantic segmentation. In: Felsberg M, Heyden A, Krüger N (eds) Computer analysis of images and patterns. Springer, Berlin, pp 95–107. https://doi.org/10.1007/978-3-319-64689-3_8

LeCun Y, Cortes C, Burges C (2018) MNIST handwritten digit database. http://yann.lecun.com/exdb/mnist/. Accessed 13 Nov 2018

Mallet C (2010) Analysis of full-waveform LiDAR data for urban area mapping. PhD thesis, Télécom ParisTech

Niemeyer J, Rottensteiner F, Soergel U, Heipke C (2016) Hierarchical higher order CRF for the classification of airborne LiDAR point clouds in urban areas. Int Arch Photogramm Remote Sens Spat Inf Sci XLI–B3:655–662. https://doi.org/10.5194/isprs-archives-XLI-B3-655-2016

Politz F, Sester M (2018) Exploring ALS and DIM data for semantic segmentation using CNNs. Int Arch Photogramm Remote Sens Spat Inf Sci XLII–1:347–354. https://doi.org/10.5194/isprs-archives-XLII-1-347-2018

Politz F, Kazimi B, Sester M (2018) Classification of laser scanning data using deep learning. In: Proceedings of the 38th scientific-technical annual conference of the DGPF and PFGK18 in Munich, Deutsche Gesellschaft für Photogrammetrie, Fernerkundung und Geoinformation (DGPF) e.V., vol 27, pp 597–610

Rizaldy A, Persello C, Gevaert C, Oude Elberink S (2018) Fully convolutional networks for ground classification from LiDAR point clouds. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2:231–238

Schmohl S, Sörgel U (2019) Submanifold sparse convolutional networks for semantic segmentation of large-scale ALS point clouds. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2/W5:77–84

Song S, Yu F, Zeng A, Chang AX, Savva M, Funkhouser TA (2017) Semantic scene completion from a single depth image. In: 2017 IEEE conference on computer vision and pattern recognition, pp 190–198. https://doi.org/10.1109/CVPR.2017.28

Tchapmi L, Choy C, Armeni I, Gwak J, Savarese S (2017) SEGCloud: Semantic segmentation of 3D point clouds. In: 2017 international conference on 3D vision (3DV), pp 537–547. https://doi.org/10.1109/3DV.2017.00067

TopoSys (2014) Technischer Abschlussbericht LiDAR und RGB-Land Vorarlberg (Anja Wiedenhöft and Svein G Vatslid)

Wagner W, Roncat A, Melzer T, Ullrich A (2007) Waveform analysis techniques in airborne laser scanning. ISPRS Int Arch Photogramm Remote Sens Spat Inf Sci XXXVI/3:413–418