搜索 — ResearchTracker

Influence functions are commonly used to attribute model behavior to training documents. We explore the reverse: crafting training data that induces model behavior. Our framework, Infusion, uses scalable influence-function approximations to compute small perturbations to training documents that induce targeted changes in model behavior through parameter shifts. We evaluate Infusion on data poisoning tasks across vision and language domains. On CIFAR-10, we show that making subtle edits via Infusion to just 0.2% (100/45,000) of the training documents can be competitive with the baseline of inserting a small number of explicit behavior examples. We also find that Infusion transfers across architectures (ResNet $\leftrightarrow$ CNN), suggesting a single poisoned corpus can affect multiple independently trained models. In preliminary language experiments, we characterize when our approach increases the probability of target behaviors and when it fails, finding it most effective at amplifying behaviors the model has already learned. Taken together, these results show that small, subtle edits to training data can systematically shape model behavior, underscoring the importance of traini

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

arXiv2025-09-19作者：Kangtao Lv, Haibin Chen, Yujin Yuan

Large language models (LLMs) have attracted significant attention due to their impressive general capabilities across diverse downstream tasks. However, without domain-specific optimization, they often underperform on specialized knowledge benchmarks and even produce hallucination. Recent studies show that strategically infusing domain knowledge during pretraining can substantially improve downstream performance. A critical challenge lies in balancing this infusion trade-off: injecting too little domain-specific data yields insufficient specialization, whereas excessive infusion triggers catastrophic forgetting of previously acquired knowledge. In this work, we focus on the phenomenon of memory collapse induced by over-infusion. Through systematic experiments, we make two key observations, i.e. 1) Critical collapse point: each model exhibits a threshold beyond which its knowledge retention capabilities sharply degrade. 2) Scale correlation: these collapse points scale consistently with the model's size. Building on these insights, we propose a knowledge infusion scaling law that predicts the optimal amount of domain knowledge to inject into large LLMs by analyzing their smaller cou

搜索结果：infusion

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Model

Threats and Security Strategies for IoMT Infusion Pumps

Reinforcement Learning for Synchronised Flow Control in a Dual-Gate Resin Infusion System

Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation

Sounding the metabolic orchestra: A delay dynamical systems perspective on the glucose-insulin regulatory response to on-off glucose infusion

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Multimodal Infusion Tuning for Large Models

Characterization of differences in immune responses during bolus and continuous infusion endotoxin challenges using mathematical modeling

Collaborative Knowledge Infusion for Low-resource Stance Detection

Efficient Knowledge Infusion via KG-LLM Alignment

Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes

A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil

Deep Learning-Based Computer Vision for Real Time Intravenous Drip Infusion Monitoring

The Paradox of Noise: An Empirical Study of Noise-Infusion Mechanisms to Improve Generalization, Stability, and Privacy in Federated Learning

There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question Answering

Audience-Centric Natural Language Generation via Style Infusion

SpatialMath: Spatial Comprehension-Infused Symbolic Reasoning for Mathematical Problem-Solving