项目
聚合研究平台、项目发布、系统演示与机构级研发动态,适合跟踪持续迭代的研究工程与平台建设。
这些条目在刷新流程中完成识别、评分与分类,并以持久化结果直接提供给页面使用。
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simplify writing and reading systems while supporting 10,000-year data preservation. The post Project Silica’s advances in glass storage technology appeared first on Microsoft Research .
Mellea 0.4.0 is the latest release of an open-source research project initiated and developed by IBM Research. Building on 0.3.0 foundational libraries and workflow primitives, 0.4.0 expands the library's integration surface and introduces new architectural patterns for structuring generative workflows. Simply put, a Granite Library is a collection of specialized model adapters designed to perform well-defined operations on portions of an input chain or conversation.
The ecosystem remains highly concentrated. Approximately half of the models on Hugging Face have less than 200 total downloads, and the top 200 most downloaded models, or 0.01% of models, comprise 49.6% of all downloads. Specialized communities form around particular domains, languages, or problem areas, and often show sustained engagement and reuse even when their overall download counts are modest. Open source AI is best understood as a collection of overlapping sub-ecosystems rather than a single uniform market.
Bringing VLA models to embedded platforms is not a matter of model compression, but a complex systems engineering problem requiring architectural decomposition, latency-aware scheduling, and hardware-aligned execution. Addressing these challenges is essential to translate recent advances in multimodal foundation models into practical and deployable embedded robotic systems.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Diamond photonics has enabled efficient interfaces for quantum memories and is predicted to be a critical component of quantum networks. However, scalable network architectures require spatial, temporal, and spectral control of photons, which relies on nonlinear and electro-optic functionalities that diamond alone cannot provide. Here, we demonstrate heterogeneous integration of a thin-film lithium niobate (TFLN) platform, which has strong chi-2 nonlinearity and electro-optic effects, with thin diamond films. We demonstrate high-Q diamond photonic crystal cavities (Q factors exceeding 5x10^4 at 735 nm) that are lithographically aligned with TFLN photonic backbone and critically coupled to it. This allows us to realize low-loss diamond-TFLN "escalators" (loss ~1 dB/coupler) that support efficient light transfer between them. At cryogenic temperatures (5K), we can collect photons emitted from silicon vacancies (SiVs) embedded within the diamond structure via the TFLN photonic circuit. This approach establishes a scalable route toward integrated photonic circuits for practical quantum networking and other technologies.
Volume 63, Issue 1, December 2026 .
In this paper, we propose a multi-view visualization technology for spatio-temporal knowledge graph(STKG), which utilizes three distinct perspectives: knowledge tree, knowledge net, and knowledge map, to facilitate a comprehensive analysis of the STKG. The knowledge tree enables the visualization of hierarchical interrelation within the STKG, while the knowledge net elucidates semantic relationships among knowledge entities. Additionally, the knowledge map displays spatial and temporal distributions via spatial maps and time axes, respectively. Our visualization technology addresses the limitations inherent in single-view approaches and the deficiency of interaction in spatio-temporal perspectives evident in existing visualization methods. Moreover, we have encapsulated this technology within an integrated, open-source platform named GeoViz. A demo video of GeoViz can be accessed at https://github.com/JeremyChou28/GeoViz.
As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-step API workflows, a new challenge has emerged: transparency. When a human makes a mistake, we can usually trace the logic. But when an AI agent fails, perhaps by hallucinating a tool output or […] The post Systematic debugging for AI agents: Introducing the AgentRx framework appeared first on Microsoft Research .
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languages and 52 models and is tested with communities in real settings. The post Paza: Introducing automatic speech recognition benchmarks and models for low resource languages appeared first on Microsoft Research .