UrbanComp Lab | 学习资料库

返回论文库

论文

arXiv

SpatialIntelligence

Trajectory

Mobility

SpatioTemporalKG

LLM

中文标题

基于多模态时空知识的通用下一位置推荐方法

English Title

Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation

Junshu Dai, Yu Wang, Tongya Zheng, Wei Ji, Qinghong Guo, Ji Cao, Jie Song, Canghong Jin, Mingli Song

发布时间

2025/12/27 22:23:04

来源类型

preprint

语言

摘要

中文对照

人类移动行为的精准预测已产生显著的社会经济影响，例如位置推荐与疏散建议。然而，现有方法普遍存在泛化能力不足的问题：单模态方法受限于数据稀疏性与固有偏差，而多模态方法难以有效捕捉由静态多模态表征与时空动态之间的语义鸿沟所导致的移动动态特征。为此，我们引入多模态时空知识以刻画移动动态，用于位置推荐任务，提出\textbf{M}ulti-\textbf{M}odal \textbf{Mob}ility（\textbf{M}$^3$\textbf{ob}）模型。首先，通过利用大语言模型增强的时空知识图谱（STKG）所捕获的功能语义与时空知识，构建统一的时空关系图（STRG）以实现多模态表征。其次，设计一种门控机制融合不同模态的时空图表示，并提出一种基于STKG的跨模态对齐方法，将时空动态知识注入静态图像模态。在六个公开数据集上的大量实验表明，所提方法不仅在常规场景下持续提升性能，且在异常场景下展现出显著的泛化能力。

English Original

The precise prediction of human mobility has produced significant socioeconomic impacts, such as location recommendations and evacuation suggestions. However, existing methods suffer from limited generalization capability: unimodal approaches are constrained by data sparsity and inherent biases, while multi-modal methods struggle to effectively capture mobility dynamics caused by the semantic gap between static multi-modal representation and spatial-temporal dynamics. Therefore, we leverage multi-modal spatial-temporal knowledge to characterize mobility dynamics for the location recommendation task, dubbed as \textbf{M}ulti-\textbf{M}odal \textbf{Mob}ility (\textbf{M}$^3$\textbf{ob}). First, we construct a unified spatial-temporal relational graph (STRG) for multi-modal representation, by leveraging the functional semantics and spatial-temporal knowledge captured by the large language models (LLMs)-enhanced spatial-temporal knowledge graph (STKG). Second, we design a gating mechanism to fuse spatial-temporal graph representations of different modalities, and propose an STKG-guided cross-modal alignment to inject spatial-temporal dynamic knowledge into the static image modality. Extensive experiments on six public datasets show that our proposed method not only achieves consistent improvements in normal scenarios but also exhibits significant generalization ability in abnormal scenarios.

资源链接

论文 PDFarxiv.org/pdf/2512.22605v1 论文 PDFarxiv.org/pdf/2512.22605v1 原始来源页面arxiv.org/abs/2512.22605v1

元数据

arXiv2512.22605v1

来源arXiv

类型论文

抽取状态raw

关键词

SpatialIntelligence

Trajectory

Mobility

SpatioTemporalKG

LLM

cs.AI

cs.CV