搜索结果：DeepMind

共找到 20 条结果

高级筛选 ▾

Lessons from External Review of DeepMind's Scheming Inability Safety Case

arXiv2026-04-23作者：Stephen Barrett, Francisco Javier Campos Zabala, Sean P. Fillingham

Safety cases for frontier AI systems should provide a convincing argument, supported by evidence, that the risk of harm is within an acceptable bound. When developers author their own safety cases, confirmation bias and conflicted incentives can affect the quality of argument. External review can help to address this. In this paper, we apply the Assurance 2.0 framework to perform an external review of Google DeepMind's public scheming inability safety case. We surface substantive new concerns that materially affect the scope of the safety case and its applicability for decision-making. Based on this experience, we provide concrete recommendations for how external review should be conducted and what information AI developers should provide to support it.

Meta-Learning and Meta-Reinforcement Learning -- Tracing the Path towards DeepMind's Adaptive Agent

arXiv2026-02-23作者：Björn Hoppmann, Christoph Scholz

Humans are highly effective at utilizing prior knowledge to adapt to novel tasks, a capability that standard machine learning models struggle to replicate due to their reliance on task-specific training. Meta-learning overcomes this limitation by allowing models to acquire transferable knowledge from various tasks, enabling rapid adaptation to new challenges with minimal data. This survey provides a rigorous, task-based formalization of meta-learning and meta-reinforcement learning and uses that paradigm to chronicle the landmark algorithms that paved the way for DeepMind's Adaptive Agent, consolidating the essential concepts needed to understand the Adaptive Agent and other generalist approaches.

搜索结果：DeepMind

Lessons from External Review of DeepMind's Scheming Inability Safety Case

Meta-Learning and Meta-Reinforcement Learning -- Tracing the Path towards DeepMind's Adaptive Agent

Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations

Designing Reliable Experiments with Generative Agent-Based Modeling: A Comprehensive Guide Using Concordia by Google DeepMind

DeepMind Lab

DeepMind Lab2D

DeepMind Control Suite

Agentic AI in the Software Development Lifecycle: Architecture, Empirical Evidence, and the Reshaping of Software Engineering

Aletheia tackles FirstProof autonomously

Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning

UT-GraphCast Hindcast Dataset: A Global AI Forecast Archive from UT Austin for Weather and Climate Applications

Putnam-like dataset summary: LLMs as mathematical competition contestants

Generalization capabilities of MeshGraphNets to unseen geometries for fluid dynamics

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors

DreamerV3-XP: Optimizing exploration through uncertainty estimation

Sums and differences of sets (improvement over AlphaEvolve)

Sketch-to-Layout: Sketch-Guided Multimodal Layout Generation

SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications

VideoPrism: A Foundational Visual Encoder for Video Understanding

iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning