搜索结果：intervention

共找到 20 条结果

高级筛选 ▾

Robust Intervention Learning from Emergency Stop Interventions

arXiv

Human interventions are a common source of data in autonomous systems during testing. These interventions provide an important signal about where the current policy needs improvement, but are often noisy and incomplete. We define Robust Intervention Learning (RIL) as the problem of learning from intervention data while remaining robust to the quality and informativeness of the intervention signal. In the best case, interventions are precise and avoiding them is sufficient to solve the task, but in many realistic settings avoiding interventions is necessary but not sufficient for achieving good performance. We study robust intervention learning in the context of emergency stop interventions and propose Residual Intervention Fine-Tuning (RIFT), a residual fine-tuning algorithm that treats intervention feedback as an incomplete learning signal and explicitly combines it with a prior policy. By framing intervention learning as a fine-tuning problem, our approach leverages structure encoded in the prior policy to resolve ambiguity when intervention signals under-specify the task. We provide theoretical analysis characterizing conditions under which this formulation yields principled pol

Orthogonal factorial designs for trials of therapist-delivered interventions: Randomising intervention-therapist combinations to patients

arXiv2026-01-22作者：Rebecca EA Walwyn, Rosemary A Bailey, Arpan Singh

It is recognised that treatment-related clustering should be allowed for in the sample size and analyses of individually-randomised parallel-group trials that evaluate therapist-delivered interventions such as psychotherapy. Here, interventions are a treatment factor, but therapists are not. If the aim of a trial is to separate effects of therapists from those of interventions, we propose that interventions and therapists should be regarded as two potentially interacting treatment factors (one fixed, one random) with a factorial structure. We consider the specific design where each therapist delivers each intervention (crossed therapist-intervention design), and the resulting therapist-intervention combinations are randomised to patients. We adopt a classical Design of Experiments (DoE) approach to propose a family of orthogonal factorial designs and their associated data analyses, which allow for therapist learning and centre too. We set out the associated data analyses using ANOVA and regression and report the results of a small simulation study conducted to explore the performance of the proposed randomisation methods in estimating the intervention effect and its standard error,

搜索结果：intervention

Robust Intervention Learning from Emergency Stop Interventions

Orthogonal factorial designs for trials of therapist-delivered interventions: Randomising intervention-therapist combinations to patients

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Leveraging AI for Direct Bystander Intervention Against Cyberbullying

Multi-Adapter Representation Interventions via Energy Calibration

Time-varying confounding in epidemic intervention evaluations

On a two-season faecal-oral model with impulsive intervention

SCOPE: Sequential Causal Optimization of Process Interventions

Debiased inference for stochastic treatment interventions with survival outcomes

Interpreting Transformers Through Attention Head Intervention

Non-linear Interventions on Large Language Models

A structural causal framework for interventions on evolutionary accumulation models

How to design research-aligned DEI interventions in physics

Task-driven Layerwise Additive Activation Intervention

Linking Model Intervention to Causal Interpretation in Model Explanation

MILE: Model-based Intervention Learning

Quickest Causal Change Point Detection by Adaptive Intervention

The Saturation Trap and the Subjectivity of Intervention Timing: Why Affect-Based Triggers and LLM Judges Fail to Time Interventions on Autonomous Agents

Stochastic Intervention

Characterising Interventions in Causal Games