资讯
Google Research Blog
AI
Industry
Agent
Platform
中文标题
ReasoningBank:赋能智能体从经验中学习
English Title
ReasoningBank: Enabling agents to learn from experience
Google Research Blog
发布时间
2026/4/22 00:42:22
来源类型
blog
语言
en
摘要
中文对照

ReasoningBank 是一种新型智能体记忆框架,利用成功与失败的经验提炼出可泛化的推理策略,使智能体在部署后能够持续从经验中学习。智能体在应对复杂现实世界任务(如通用网页导航及辅助大规模软件工程代码库)中正变得日益关键。

English Original

ReasoningBank is a novel agent memory framework that uses successful and failed experiences to distill generalizable reasoning strategies, enabling an agent to continuously learn from experience after deployment. Agents are becoming increasingly crucial in tackling complex real-world tasks, ranging from general web navigation to assisting with extensive software engineering codebases.

正文
中文全文

ReasoningBank 是一种新颖的智能体记忆框架,它利用成功与失败的经验提炼出可泛化的推理策略,使智能体能够在部署后持续从经验中学习。智能体在应对各类复杂现实任务中正变得日益关键,其应用范围涵盖通用网页导航,到协助处理大规模软件工程代码库。然而,当这些智能体逐步转向现实世界中长期、持续运行的角色时,却面临一项关键局限:它们难以在部署后对成功与失败的经验进行分析与学习。测试时扩展(Test-time Scaling, TTS)——即在推理阶段扩展计算资源——已在数学与竞赛编程等推理领域展现出极强的有效性。但在智能体环境中,现有 TTS 方法往往忽略探索过程轨迹,仅将最终答案视为唯一有用的结果。而这一被忽视的探索过程,实则蕴含丰富信息,有望显著加速智能体随时间推移从经验中学习的能力。ReasoningBank 提供了一个强大框架,赋能大语言模型(LLM)从经验中学习,并在测试阶段演进为持续学习者。我们认为,以记忆驱动的经验扩展代表了智能体扩展方向上一个至关重要的新前沿。

English Original

ReasoningBank is a novel agent memory framework that uses successful and failed experiences to distill generalizable reasoning strategies, enabling an agent to continuously learn from experience after deployment. Agents are becoming increasingly crucial in tackling complex real-world tasks, ranging from general web navigation to assisting with extensive software engineering codebases. However, as these agents transition into persistent, long-running roles in the real world, they face a critical limitation: they struggle to analyze and learn from successful and failed experiences after deployment. Test-time scaling (TTS) — scaling compute at inference time — has shown immense effectiveness in reasoning domains like math and competitive programming. However, in agentic environments, existing TTS methods often discard the exploration trajectory and treat the final answer as the only useful outcome. This overlooked exploration is actually a rich data source that could accelerate an agent's ability to learn from experience over time. ReasoningBank provides a powerful framework for enabling LLMs to learn from experiences and evolve into continuous learners during test-time. We believe memory-driven experience scaling represents a crucial new frontier for agent scaling.

资源链接
About Googleabout.googleGoogle Productsabout.google/intl/en/productsReActarxiv.org/abs/2210.03629LLM-as-a-judgearxiv.org/abs/2306.05685Synapsearxiv.org/abs/2306.07863WebArenaarxiv.org/abs/2307.13854Test-time scalingarxiv.org/abs/2408.03314Agent Workflow Memoryarxiv.org/abs/2409.07429matharxiv.org/abs/2501.19393competitive programmingarxiv.org/abs/2502.14382Paperarxiv.org/abs/2509.25140Gemini-2.5-Flashdocs.cloud.google.com...i/generative-ai/docs/models/gemini/2-5-flashFollow us on githubgithub.com/google-researchReasoningBank codegithub.com/google-research/reasoning-bankSWE-Bench-Verifiedopenai.com/index/introducing-swe-bench-verifiedPrivacypolicies.google.com/privacyTermspolicies.google.com/termsBlogresearch.google/blogCareersresearch.google/careersLearn more about our Conferences & events Learn moreresearch.google/conferences-and-eventsLearn more about our People Learn moreresearch.google/peopleLearn more about our Philosophy Learn moreresearch.google/philosophyCollaborate with usresearch.google/programs-and-eventsLearn more about our Faculty programs Learn moreresearch.google/programs-and-events/faculty-engagementLearn more about our Student programs Learn moreresearch.google/programs-and-events/student-engagementLearn more about our Publications Learn moreresearch.google/pubsExplore all research areasresearch.google/research-areasAlgorithms & Theoryresearch.google/research-areas/algorithms-and-theoryClimate & Sustainabilityresearch.google/research-areas/climate-and-sustainabilityData Managementresearch.google/research-areas/data-managementData Mining & Modelingresearch.google/research-areas/data-mining-and-modelingDistributed Systems & Parallel Computingresearch.google...s/distributed-systems-and-parallel-computingEconomics & Electronic Commerceresearch.google...arch-areas/economics-and-electronic-commerceEducation Innovationresearch.google/research-areas/education-innovationGeneral Scienceresearch.google/research-areas/general-scienceHardware & Architectureresearch.google/research-areas/hardware-and-architectureHealth & Bioscienceresearch.google/research-areas/health-bioscienceHuman-Computer Interaction and Visualizationresearch.google...human-computer-interaction-and-visualizationInformation Retrieval & the Webresearch.google...arch-areas/information-retrieval-and-the-webMachine Intelligenceresearch.google/research-areas/machine-intelligenceMachine Perceptionresearch.google/research-areas/machine-perceptionMachine Translationresearch.google/research-areas/machine-translationMobile Systemsresearch.google/research-areas/mobile-systemsNatural Language Processingresearch.google/research-areas/natural-language-processingNetworkingresearch.google/research-areas/networkingQuantum Computingresearch.google/research-areas/quantum-computingResponsible AIresearch.google/research-areas/responsible-aiRoboticsresearch.google/research-areas/roboticsSecurity, Privacy, & Abuse Preventionresearch.google...-areas/security-privacy-and-abuse-preventionSoftware Engineeringresearch.google/research-areas/software-engineeringSoftware Systemsresearch.google/research-areas/software-systemsSpeech Processingresearch.google/research-areas/speech-processingLearn more about our Resources Learn moreresearch.google/resourcesLearn more about our Projects Learn moreresearch.google/resources/our-projectsHelpsupport.google.comShare on Twittertwitter.com/intent/tweetShare on Facebookwww.facebook.com/sharer/sharer.phpGooglewww.google.comShare on LinkedInwww.linkedin.com/shareArticleFollow us on linkedinwww.linkedin.com/showcase/googleresearchFollow us on youtubewww.youtube.com/c/GoogleResearchFollow us on xx.com/GoogleResearch原始来源页面research.google...k-enabling-agents-to-learn-from-experience
元数据
来源Google Research Blog
类型资讯
抽取状态raw
关键词
Generative AI
Machine Intelligence
Natural Language Processing
AI
Industry
Agent
Platform