论文
arXiv
LLM
Multimodal
Agent
中文标题
分析多模态大语言模型代理在城市感知任务中生成解释时的虚拟角色效应
English Title
Analyzing Persona Effects in Generated Explanations from Multimodal LLM Agents in Urban Perception
Neemias da Silva, Myriam Delgado, Rodrigo Minetto, Daniel Silver, Thiago H Silva
发布时间
2026/5/28 04:11:42
来源类型
preprint
语言
en
摘要
中文对照

本研究探讨虚拟角色提示(persona prompting)如何影响多模态大语言模型(multimodal LLM)在城市感知场景下所生成的语言。我们基于1,200个带虚拟角色条件的代理与两个无虚拟角色对照组所产生的59,808条标注,分析了不同虚拟角色下的图像描述(captions)、合理性说明(justifications)及感知标签(perception tags)。结果表明:不同虚拟角色在图像描述上呈现高度一致性;合理性说明则随社会经济属性与政治属性呈现系统性差异;感知标签未显示出统计显著的虚拟角色相关差异,但可观察到微弱的效应趋势。主题分析进一步揭示,不同虚拟角色在解释相同场景时侧重不同的评价性主题。

English Original

We study how persona prompting shapes language generated by multimodal large language models in an urban perception setting. Using 59,808 annotations from 1,200 persona-conditioned agents and two no-persona settings, we analyze captions, justifications, and perception tags across personas. Results indicate strong convergence in captions for different personas, whereas justifications display systematic variation associated with socioeconomic and political attributes, while perception tags show no statistically significant persona-related differences, though effect trends are observed. Topic analysis further reveals that personas emphasize different evaluative themes when interpreting the same scenes.

元数据
arXiv2605.29064v1
来源arXiv
类型论文
抽取状态raw
关键词
LLM
Multimodal
Agent
cs.CL
cs.CV
cs.HC
cs.MA