Classification of ALS Point Clouds Using End-to-End Deep Learning

PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science - Tập 87 - Trang 75-90 - 2019

Lukas Winiwarter^1,2, Gottfried Mandlburger^1,3, Stefan Schmohl³, Norbert Pfeifer¹

¹Department of Geodesy and Geoinformation (E120), Technische Universität Wien, Vienna, Austria

²3D Geospatial Data Processing Group (3DGeo), Institute of Geography, Heidelberg University, Heidelberg, Germany

³Institute for Photogrammetry, University of Stuttgart, Stuttgart, Germany

Tóm tắt

Deep learning, referring to artificial neural networks with multiple layers, is widely used for classification tasks in many disciplines including computer vision. The most popular type is the Convolutional Neural Network (CNN), commonly applied to 2D image data. However, CNNs are difficult to adapt to irregular data like point clouds. PointNet, on the other hand, has enabled the derivation of features based on the geometric distribution of a set of points in nD-space utilising a neural network. We use PointNet on multiple scales to automatically learn a representation of local neighbourhoods in an end-to-end fashion, which is optimised for semantic labelling on 3D point clouds acquired by Airborne Laser Scanning (ALS). The results are comparable to those using manually crafted features, suggesting a successful representation of these neighbourhoods. On the ISPRS 3D Semantic Labelling benchmark, we achieve 80.6% overall accuracy, a mid-field result. Investigation on a bigger dataset, namely the 2011 ALS point cloud of the federal state of Vorarlberg, shows overall accuracies of up to 95.8% over large-scale built-up areas. Lower accuracy is achieved for the separation of low vegetation and ground points, presumably because of invalid assumptions about the distribution of classes in space, especially in high alpine regions. We conclude that the method of the end-to-end system, allowing training on a big variety of classification problems without the need for expert knowledge about neighbourhood features can also successfully be applied to single-point-based classification of ALS point clouds.

Tài liệu tham khảo

ASPRS (2011) LAS specification. https://www.asprs.org/wp-content/uploads/2010/12/LAS_1_4_r13.pdf

Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder–decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481–2495

Bechtold S, Höfle B (2016) HELIOS: a multi-purpose LiDAR simulation framework for research, planning and training of laser scanning operations with airborne, ground-based mobile and stationary platforms. ISPRS Ann Photogramme Remote Sens Spat Inf Sci III–3:161–168

Blomley R, Weinmann M (2017) Using multi-scale features for the 3D semantic labeling of airborne laser scanning data. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2:43–50

Boulch A, Le Saux B, Audebert N (2017) Unstructured point cloud semantic labeling using deep segmentation networks. In: Pratikakis I, Dupont F, Ovsjanikov M (eds) Eurographics workshop on 3D object retrieval. The Eurographics Association, Aire-la-Ville. https://doi.org/10.2312/3dor.20171047

Chehata N, Guo L, Mallet C (2009) Airborne LiDAR feature selection for urban classification using random forests. In: ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, IARPS, Paris, France, vol XXXVIII-3/W8, pp 207–212

Cramer M (2010) The DGPF-test on digital airborne camera evaluation-overview and test design. Photogramm Fernerkundung Geoinf 2:73–82

Cybenko G (1989) Approximation by superpositions of a sigmoidal function. Math Control Signals Syst 2(4):303–314

Dai A, Chang AX, Savva M, Halber M, Funkhouser T, Niessner M (2017) ScanNet: richly-annotated 3D reconstructions of indoor scenes. In: The IEEE conference on computer vision and pattern recognition (CVPR) 2017

Dai A, Ritchie D, Bokeloh M, Reed S, Sturm J, Nießner M (2018) ScanComplete: Large-scale scene completion and semantic segmentation for 3D scans. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, vol abs/1712.10215, pp 4578–4587. https://doi.org/10.1109/CVPR.2018.00481

Gerke M (2014) 3D semantic labeling contest. http://www2.isprs.org/commissions/comm3/wg4/3d-semantic-labeling.html

Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge. http://www.deeplearningbook.org

Graham B, Engelcke M, van der Maaten L (2018) 3D semantic segmentation with submanifold sparse convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 9224–9232

Grilli E, Menna F, Remondino F (2017) A review of point clouds segmentation and classification algorithms. In: ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol XLII-2/W3, pp 339–344. https://doi.org/10.5194/isprs-archives-XLII-2-W3-339-2017

Hackel T, Wegner JD, Schindler K (2016) Fast semantic segmentation of 3D point clouds with strongly varying density. ISPRS Ann Photogramm Remote Sens Spat Inf Sci III–3:177–184. https://doi.org/10.5194/isprs-annals-III-3-177-2016

Hackel T, Wegner JD, Savinov N, Ladicky L, Schindler K, Pollefeys M (2018) Large-scale supervised learning for 3D point cloud labeling: Semantic3d.net. Photogramm Eng Remote Sens 84(5):297–308

Hoo-Chang S, Roth HR, Gao M, Lu L, Xu Z, Nogues I, Yao J, Mollura D, Summers RM (2016) Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging 35(5):1285

Hu X, Yuan Y (2016) Deep-learning-based classification for DTM extraction from ALS point cloud. Remote Sens 8:9. https://doi.org/10.3390/rs8090730

Huang J, You S (2016) Point cloud labeling using 3D convolutional neural network. In: 23rd international conference on pattern recognition (ICPR), pp 2670–2675. https://doi.org/10.1109/ICPR.2016.7900038

Hubel DH, Wiesel TN (1962) Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol 160(1):106–154

Landrieu L, Simonovsky M (2018) Large-scale point cloud semantic segmentation with superpoint graphs. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 4558–4567. https://doi.org/10.1109/CVPR.2018.00479

Lawin FJ, Danelljan M, Tosteberg P, Bhat G, Khan FS, Felsberg M (2017) Deep projective 3D semantic segmentation. In: Felsberg M, Heyden A, Krüger N (eds) Computer analysis of images and patterns. Springer, Berlin, pp 95–107. https://doi.org/10.1007/978-3-319-64689-3_8

LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551

LeCun Y, Cortes C, Burges C (2018) MNIST handwritten digit database. http://yann.lecun.com/exdb/mnist/. Accessed 13 Nov 2018

Mallet C (2010) Analysis of full-waveform LiDAR data for urban area mapping. PhD thesis, Télécom ParisTech

McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5(4):115–133

Niemeyer J, Rottensteiner F, Soergel U (2014) Contextual classification of LiDAR data and building object detection in urban areas. ISPRS J Photogramm Remote Sens 87:152–165

Niemeyer J, Rottensteiner F, Soergel U, Heipke C (2016) Hierarchical higher order CRF for the classification of airborne LiDAR point clouds in urban areas. Int Arch Photogramm Remote Sens Spat Inf Sci XLI–B3:655–662. https://doi.org/10.5194/isprs-archives-XLI-B3-655-2016

Otepka J, Ghuffar S, Waldhauser C, Hochreiter R, Pfeifer N (2013) Georeferenced point clouds: a survey of features and point cloud management. ISPRS Int J Geo-Inf 2(4):1038–1065

Persello C, Stein A (2017) Deep fully convolutional networks for the detection of informal settlements in VHR images. IEEE Geosci Remote Sens Lett 14(12):2325–2329

Politz F, Sester M (2018) Exploring ALS and DIM data for semantic segmentation using CNNs. Int Arch Photogramm Remote Sens Spat Inf Sci XLII–1:347–354. https://doi.org/10.5194/isprs-archives-XLII-1-347-2018

Politz F, Kazimi B, Sester M (2018) Classification of laser scanning data using deep learning. In: Proceedings of the 38th scientific-technical annual conference of the DGPF and PFGK18 in Munich, Deutsche Gesellschaft für Photogrammetrie, Fernerkundung und Geoinformation (DGPF) e.V., vol 27, pp 597–610

Qi CR, Su H, Mo K, Guibas LJ (2017a) PointNet: Deep learning on point sets for 3D classification and segmentation. Proc Comput Vis Pattern Recogn IEEE 1(2):4

Qi CR, Yi L, Su H, Guibas LJ (2017b) PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in neural information processing systems, pp 5099–5108

Rizaldy A, Persello C, Gevaert C, Oude Elberink S (2018) Fully convolutional networks for ground classification from LiDAR point clouds. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2:231–238

Rosenblatt F (1958) The perceptron: a probabilistic model for information storage and organization in the brain. Psychol Rev 65(6):386–408

Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL, the PDP Research Group (eds) Parallel distributed processing: explorations in the microstructures of cognition, vol I: foundations. MIT Press, New York

Schmohl S, Sörgel U (2019) Submanifold sparse convolutional networks for semantic segmentation of large-scale ALS point clouds. ISPRS Ann Photogramm Remote Sens Spat Inf Sci IV–2/W5:77–84

Song S, Yu F, Zeng A, Chang AX, Savva M, Funkhouser TA (2017) Semantic scene completion from a single depth image. In: 2017 IEEE conference on computer vision and pattern recognition, pp 190–198. https://doi.org/10.1109/CVPR.2017.28

Tchapmi L, Choy C, Armeni I, Gwak J, Savarese S (2017) SEGCloud: Semantic segmentation of 3D point clouds. In: 2017 international conference on 3D vision (3DV), pp 537–547. https://doi.org/10.1109/3DV.2017.00067

TopoSys (2014) Technischer Abschlussbericht LiDAR und RGB-Land Vorarlberg (Anja Wiedenhöft and Svein G Vatslid)

Tran THG, Otepka J, Wang D, Pfeifer N (2018) Classification of image matching point clouds over an urban area. Int J Remote Sens 39(12):4145–4169

Vosselman G (2013) Point cloud segmentation for urban scene classification. ISPRS Int Arch Photogramm Remote Sens Spat Inf Sci XL–7/W2:257–262. https://doi.org/10.5194/isprsarchives-XL-7-W2-257-2013

Wagner W, Roncat A, Melzer T, Ullrich A (2007) Waveform analysis techniques in airborne laser scanning. ISPRS Int Arch Photogramm Remote Sens Spat Inf Sci XXXVI/3:413–418

Weinmann M, Jutzi B, Mallet C (2013) Feature relevance assessment for the semantic interpretation of 3D point cloud data. ISPRS Ann Photogramm Remote Sens Spat Inf Sci II–5/W2:313–318

Weinmann M, Jutzi B, Hinz S, Mallet C (2015) Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers. ISPRS J Photogramm Remote Sens 105:286–304

Yang Z, Jiang W, Xu B, Zhu Q, Jiang S, Huang W (2017) A convolutional neural network-based 3D semantic labeling method for ALS point clouds. Remote Sens 9:9. https://doi.org/10.3390/rs9090936

Yousefhussien M, Kelbe DJ, Ientilucci EJ, Salvaggio C (2018) A multi-scale fully convolutional network for semantic labeling of 3D point clouds. ISPRS J Photogramm Remote Sens 143:191–204. https://doi.org/10.1016/j.isprsjprs.2018.03.018

Zhao R, Pang M, Wang J (2018) Classifying airborne LiDAR point clouds via deep features learned by a multi-scale convolutional neural network. Int J Geogr Inf Sci 32(5):960–979

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]