搜索 — ResearchTracker

Scientific research proceeds through iterative cycles of hypothesis generation, experiment design, execution, and revision. AI agents can automate parts of this process, but existing approaches typically follow a single research trajectory or coordinate through a central planner with fixed objectives. As a result, they struggle to sustain parallel exploration, adapt as experimental evidence changes, or preserve knowledge of failed directions over long-running experiments. We introduce AutoScientists, a decentralized team of AI agents for long-running computational scientific experimentation. Agents interpret a shared experimental state, self-organize into teams around promising hypotheses, critique proposals before using experimental compute, and share successes and failures to reduce redundant exploration. Under matched experimental budgets, AutoScientists improves over prior AI agents across biomedical machine learning, language-model training optimization, and protein fitness prediction. On BioML-Bench, spanning biomedical imaging, protein engineering, single-cell omics, and drug discovery, AutoScientists achieves a mean leaderboard percentile of 74.4% across 24 tasks, improving

AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

arXiv2026-02-22作者：Emmanuel Bamidele

Long-running LLM agents require persistent memory to preserve state across interactions, yet most deployed systems manage memory with age-based retention (e.g., TTL). While TTL bounds item lifetime, it does not bound the computational footprint of memory on the request path: as retained items accumulate, retrieval candidate sets and vector similarity scans can grow unpredictably, yielding heavy-tailed latency and unstable throughput. We present AMV-L (Adaptive Memory Value Lifecycle), a memory-management framework that treats agent memory as a managed systems resource. AMV-L assigns each memory item a continuously updated utility score and uses value-driven promotion, demotion, and eviction to maintain lifecycle tiers; retrieval is restricted to a bounded, tier-aware candidate set that decouples the request-path working set from total retained memory. We implement AMV-L in a full-stack LLM serving system and evaluate it under identical long-running workloads against two baselines: TTL and an LRU working-set policy, with fixed prompt-injection caps. Relative to TTL, AMV-L improves throughput by 3.1x and reduces latency by 4.2x (median), 4.7x (p95), and 4.4x (p99), while reducing the

搜索结果：long-running

AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

Neither Layer Alone: Epistemic Integrity Requires Hierarchical Joint Design for Long-Running AI Agents

MEMTIER: Tiered Memory Architecture and Retrieval Bottleneck Analysis for Long-Running Autonomous AI Agents

SentinelBench: A Benchmark for Long-Running Monitoring Agents

Developing Adaptive Context Compression Techniques for Large Language Models (LLMs) in Long-Running Interactions

Memory Depth, Not Memory Access: Selective Parametric Consolidation for Long-Running Language Agents

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

Understanding Persuasion in Long-Running Agents

Memory Management and Contextual Consistency for Long-Running Low-Code Agents

Oze: Decentralized Graph-based Concurrency Control for Long-running Update Transactions (Extended Version)

Spot-on: A Checkpointing Framework for Fault-Tolerant Long-running Workloads on Cloud Spot Instances

Affinity-Aware Resource Provisioning for Long-Running Applications in Shared Clusters

Optimizing System Quality of Service through Rejuvenation for Long-Running Applications with Real-Time Constraints

Cherry-Picking of Code Commits in Long-Running, Multi-release Software

Proactive Service Migration for Long-Running Byzantine Fault Tolerant Systems

Policy Design in Long-Run Welfare Dynamics

Analysis of Multiple Long-Run Relations in Panel Data Models

Persuasion in the Long Run: When history matters

Markov control of continuous time Markov processes with long run functionals by time discretization