搜索 — ResearchTracker

Flow Matching (FM) has achieved remarkable generative performance, yet it suffers from exposure bias due to discrepancies between training and inference. Existing mitigation strategies typically rely on static constraints or external heuristics. In this work, we propose that exposure bias itself inherently contains dynamic signals that can guide its own rectification. To leverage this, we introduce DEFAR (DirEctional-Frequency Adaptive Rectification). This framework simulates the single-step inference process during training to identify exposure bias. It utilizes directional and frequency-adaptive feedback signals from the bias itself to enhance the model's bias tolerance. It consists of two key components: (1) Anti-Drift Rectification (ADR). ADR treats inference-time drift as a signal to learn the direction to steer deviated states back toward the target. ADR endows the model with intrinsic active self-rectification capabilities; (2) Frequency Compensation (FC). Empirically, we observe that accumulated bias often stems from a lack of low-frequency components in high-noise stages, and exposure bias carries the missing frequency. FC leverages the bias itself as a self-feedback weigh

Bilevel Autoresearch: Meta-Autoresearching Itself

arXiv2026-03-24作者：Yaonan Qu, Meng Lu

If autoresearch is itself a form of research, then autoresearch can be applied to research itself. We present Bilevel Autoresearch, a bilevel framework in which an outer autoresearch loop improves an inner autoresearch loop by reading its code and traces, identifying bottlenecks, and generating injectable Python search mechanisms at runtime. The inner loop optimizes task performance; the outer loop optimizes how the inner loop searches. Both loops use the same LLM, so improvements come from the bilevel architecture rather than a stronger meta-level model, although the outer loop consumes additional inference and wall-clock budget. On Karpathy's GPT pretraining benchmark, the meta-autoresearch outer loop achieves a 5x improvement over the standard inner loop alone (-0.045 vs. -0.009 val_bpb), while parameter-level adjustment without mechanism change yields no reliable gain. The outer loop instantiates mechanisms from adjacent search domains, including combinatorial optimization, multi-armed bandits, and design of experiments, without human specification of the final mechanism design. Trace analysis suggests that these mechanisms break deterministic search patterns and force explorat

搜索结果：itself

Exposure Bias Can Alleviate Itself via Directional and Frequency Rectification in Flow Matching

Bilevel Autoresearch: Meta-Autoresearching Itself

Detoxification for LLM: From Dataset Itself

Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token

KL-regularization Itself is Differentially Private in Bandits and RLHF

LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning

A Model Can Help Itself: Reward-Free Self-Training for LLM Reasoning

Could society itself spiral into a Lorenz-like chaos when facing an epidemic threat?

When AI Eats Itself: On the Caveats of AI Autophagy

Guiding a Diffusion Model with a Bad Version of Itself

Measurement of reactor thermal neutron fluence of NTD-Ge by activation High-Purity Ge itself

This Game Is Not Going To Analyze Itself

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

An LLM can Fool Itself: A Prompt-Based Adversarial Attack

EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only

A Modern Self-Referential Weight Matrix That Learns to Modify Itself

From actions of an abelian group on itself to left braces

How a leak can stop itself

Self-Supervised Implicit Attention: Guided Attention by The Model Itself

Visually Evaluating Generative Adversarial Networks Using Itself under Multivariate Time Series