论文
arXiv
GeoAI
GIS
RemoteSensing
EarthObservation
中文标题
PCFootprint:面向航拍LiDAR点云的矢量化建筑轮廓提取的大规模数据集与基准
English Title
PCFootprint: A Large-Scale Dataset and Benchmark for Vectorized Building Footprint Extraction from Aerial LiDAR Point Clouds
Haoyuan Shen, Kuihao Wang, Ruisheng Wang, Yujun Liu
发布时间
2026/6/19 00:38:35
来源类型
preprint
语言
en
摘要
中文对照

建筑轮廓提取是摄影测量学、遥感与计算机视觉中的基础任务。近期基于图像的方法在从高分辨率光学影像中提取矢量化轮廓方面取得了显著进展。然而,光学影像本身易受遮挡、透视畸变及残余高程位移影响,导致轮廓提取不完整或错位。此外,缺乏显式高程信息限制了其在建筑细节层次(Level of Detail)建模中的直接应用。本文提出PCFootprint,这是首个面向机载激光扫描(airborne laser scanning)点云的建筑轮廓提取大规模公开数据集。PCFootprint包含来自爱沙尼亚土地与空间发展局的33,000个瓦片,覆盖多样化的城乡地理景观;每个瓦片尺寸为128 m × 128 m,并配有与点云系统对齐的矢量化轮廓标注。该数据集另含一个3,000瓦片的跨域测试集,用于评估模型在不同地理区域间的泛化能力。我们通过评估主流方法建立了全面的基准。实验结果揭示了复杂地理空间环境中存在的若干显著挑战,包括类内差异大、数据不平衡以及噪声干扰严重。我们相信PCFootprint将推动建筑建模、城市场景理解与地理空间分析等方向的未来研究。PCFootprint数据集已公开发布于\url{https://huggingface.co/datasets/Haoyuan-Shen/PCFootprint}。

English Original

Building footprint extraction is a fundamental task in photogrammetry, remote sensing, and computer vision. Recent image-based methods have achieved remarkable progress in extracting vectorized footprints from high-resolution optical imagery. However, optical imagery inherently susceptible to occlusions, perspective distortions, and residual relief displacement, yielding incomplete or misaligned footprint extraction. Furthermore, the lack of explicit elevation information limits its direct applicability to Level of Detail building modeling. In this paper, we present PCFootprint, the first large-scale public dataset for footprint extraction from airborne laser scanning point clouds. PCFootprint comprises \num{33000} tiles derived from the Estonian Land and Spatial Development Board, covering diverse urban and rural landscapes. Each tile spans \qtyproduct{128 x 128}{\m} with systematically aligned vectorized footprints aligned to point clouds. The dataset includes a \num{3000} tiles cross-domain test set for evaluating generalization across geographic regions. We establish comprehensive benchmarks by evaluating mainstream methods. Experimental results reveal significant challenges including high intra-class variance, data imbalance, and noise across complex geospatial environments. We believe PCFootprint will advance future research in building modeling, urban scene understanding, and geospatial analysis. The PCFootprint dataset is publicly available at \url{https://huggingface.co/datasets/Haoyuan-Shen/PCFootprint}.

元数据
arXiv2606.20455v1
来源arXiv
类型论文
抽取状态raw
关键词
GeoAI
GIS
RemoteSensing
EarthObservation
cs.CV