搜索 — ResearchTracker

Safety has become the central value around which dominant AI governance efforts are being shaped. Recently, this culminated in the publication of the International AI Safety Report, written by 96 experts of which 30 nominated by the Organisation for Economic Co-operation and Development (OECD), the European Union (EU), and the United Nations (UN). The report focuses on the safety risks of general-purpose AI and available technical mitigation approaches. In this response, informed by a system safety perspective, I refl ect on the key conclusions of the report, identifying fundamental issues in the currently dominant technical framing of AI safety and how this frustrates meaningful discourse and policy efforts to address safety comprehensively. The system safety discipline has dealt with the safety risks of software-based systems for many decades, and understands safety risks in AI systems as sociotechnical and requiring consideration of technical and non-technical factors and their interactions. The International AI Safety report does identify the need for system safety approaches. Lessons, concepts and methods from system safety indeed provide an important blueprint for overcoming

Annotating and Auditing the Safety Properties of Unsafe Rust

arXiv2025-04-30作者：Zihao Rao, Jiping Zhou, Hongliang Tian

In Rust, unsafe code is the sole source of potential undefined behaviors. To avoid misuse, Rust developers should clarify the safety properties for each unsafe API. However, the community currently lacks a key standard for safety documentation: existing safety comments in the source code and safety documentation can be ad hoc and incomplete. This paper presents a tag-centric methodology for auditing the consistency and completeness of safety documentation. We first derive a taxonomy of Safety Tags to formalize natural-language requirements. Second, because API soundness frequently relies on struct invariants, we propose a set of empirical rules to systematically audit the structural consistency of safety documentation. We implemented this methodology in safety-tool, a static linter that automatically enforces structural consistency between local safety annotations and callee requirements. Our approach was applied to the Rust standard library, fixing documentation issues on 27 APIs with 61 safety tags and identifying safety tags that are applicable to 96.1% of the public unsafe APIs in libstd. Furthermore, we have formalized the tagging idea through a Rust RFC to the wider community

搜索结果：safety

AI Safety is Stuck in Technical Terms -- A System Safety Response to the International AI Safety Report

Annotating and Auditing the Safety Properties of Unsafe Rust

All Languages Matter: On the Multilingual Safety of Large Language Models

RISC-V Functional Safety for Autonomous Automotive Systems: An Analytical Framework and Research Roadmap for ML-Assisted Certification

Reconciling Safety Measurement and Dynamic Assurance

Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety

BarrierSteer: LLM Safety via Learning Barrier Steering

Conformal Safety Monitoring for Flight Testing: A Case Study in Data-Driven Safety Learning

The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions

The Open Autonomy Safety Case Framework

Behavioral Safety Assessment towards Large-scale Deployment of Autonomous Vehicles

MTMCS-Bench: Evaluating Contextual Safety of Multimodal Large Language Models in Multi-Turn Dialogues

Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Safety Factories - a Manifesto

Backup-Based Safety Filters: A Comparative Review of Backup CBF, Model Predictive Shielding, and gatekeeper

Relating System Safety and Machine Learnt Model Performance

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

Speculative Safety-Aware Decoding

Saffron-1: Safety Inference Scaling

Aviation Safety Enhancement via NLP &amp; Deep Learning: Classifying Flight Phases in ATSB Safety Reports

Aviation Safety Enhancement via NLP & Deep Learning: Classifying Flight Phases in ATSB Safety Reports