🤖 AI Summary
Current deep learning methods for 3D point clouds face significant deployment challenges in urban and environmental applications: existing research overemphasizes architectural innovation while neglecting practical constraints—including ultra-large scale, non-uniform density, multimodality, and scene diversity. This paper pioneers a reverse survey paradigm (“application scenario → algorithmic requirement”) to systematically evaluate the adaptability and limitations of representative architectures—PointNet, PointCNN, and graph neural networks—across key tasks: point cloud completion, registration, semantic segmentation, and reconstruction. We benchmark their performance on mainstream datasets, identifying insufficient generalization as a root cause. Furthermore, we explicitly characterize the gap between algorithmic innovation and engineering deployment. Finally, we propose a co-design pathway that jointly optimizes model lightweighting, cross-density robustness, and multimodal fusion—thereby providing both theoretical foundations and practical guidelines for deployable point cloud intelligence.
📝 Abstract
Point cloud processing as a fundamental task in the field of geomatics and computer vision, has been supporting tasks and applications at different scales from air to ground, including mapping, environmental monitoring, urban/tree structure modeling, automated driving, robotics, disaster responses etc. Due to the rapid development of deep learning, point cloud processing algorithms have nowadays been almost explicitly dominated by learning-based approaches, most of which are yet transitioned into real-world practices. Existing surveys primarily focus on the ever-updating network architecture to accommodate unordered point clouds, largely ignoring their practical values in typical point cloud processing applications, in which extra-large volume of data, diverse scene contents, varying point density, data modality need to be considered. In this paper, we provide a meta review on deep learning approaches and datasets that cover a selection of critical tasks of point cloud processing in use such as scene completion, registration, semantic segmentation, and modeling. By reviewing a broad range of urban and environmental applications these tasks can support, we identify gaps to be closed as these methods transformed into applications and draw concluding remarks in both the algorithmic and practical aspects of the surveyed methods.