搜索 — ResearchTracker

Manipulating dynamic objects remains an open challenge for Vision-Language-Action (VLA) models, which, despite strong generalization in static manipulation, struggle in dynamic scenarios requiring rapid perception, temporal anticipation, and continuous control. We present DynamicVLA, a framework for dynamic object manipulation that integrates temporal reasoning and closed-loop adaptation through three key designs: 1) a compact 0.4B VLA using a convolutional vision encoder for spatially efficient, structurally faithful encoding, enabling fast multimodal inference; 2) Continuous Inference, enabling overlapping reasoning and execution for lower latency and timely adaptation to object motion; and 3) Latent-aware Action Streaming, which bridges the perception-execution gap by enforcing temporally aligned action execution. To fill the missing foundation of dynamic manipulation data, we introduce the Dynamic Object Manipulation (DOM) benchmark, built from scratch with an auto data collection pipeline that efficiently gathers 200K synthetic episodes across 2.8K scenes and 206 objects, and enables fast collection of 2K real-world episodes without teleoperation. Extensive evaluations demonst

Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation

arXiv2025-02-23作者：Simin Chen, Yiming Chen, Zexin Li

Data contamination has received increasing attention in the era of large language models (LLMs) due to their reliance on vast Internet-derived training corpora. To mitigate the risk of potential data contamination, LLM benchmarking has undergone a transformation from static to dynamic benchmarking. In this work, we conduct an in-depth analysis of existing static to dynamic benchmarking methods aimed at reducing data contamination risks. We first examine methods that enhance static benchmarks and identify their inherent limitations. We then highlight a critical gap-the lack of standardized criteria for evaluating dynamic benchmarks. Based on this observation, we propose a series of optimal design principles for dynamic benchmarking and analyze the limitations of existing dynamic benchmarks. This survey provides a concise yet comprehensive overview of recent advancements in data contamination research, offering valuable insights and a clear guide for future research efforts. We maintain a GitHub repository to continuously collect both static and dynamic benchmarking methods for LLMs. The repository can be found at this link.

搜索结果：dynamic

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation

Mechanisms for a dynamic many-to-many school choice problem

Generalized pair-wise logit dynamic and its connection to a mean field game: theoretical and computational investigations focusing on resource management

3D Spectrum Awareness for Radio Dynamic Zones Using Kriging and Matrix Completion

Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction

Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering

Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA

The Predicted-Updates Dynamic Model: Offline, Incremental, and Decremental to Fully Dynamic Transformations

Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning

Dynamic On-Palm Manipulation via Controlled Sliding

Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes

DynamicStereo: Consistent Dynamic Depth from Stereo Videos

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory

Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism

ASTER: Adaptive Spatio-Temporal Early Decision Model for Dynamic Resource Allocation

Intrinsic noise in structured replicator dynamics modelling time delays

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

On the Computational Efficiency of Adaptive and Dynamic Regret Minimization

Large Language Models for Explainable Decisions in Dynamic Digital Twins