搜索 — ResearchTracker

Active vision, also known as active perception, refers to the process of actively selecting where and how to look in order to gather task-relevant information. It is a critical component of efficient perception and decision-making in humans and advanced embodied agents. Recently, the use of Multimodal Large Language Models (MLLMs) as central planning and decision-making modules in robotic systems has gained extensive attention. However, despite the importance of active perception in embodied intelligence, there is little to no exploration of how MLLMs can be equipped with or learn active perception capabilities. In this paper, we first provide a systematic definition of MLLM-based active perception tasks. We point out that the recently proposed GPT-o3 model's zoom-in search strategy can be regarded as a special case of active perception; however, it still suffers from low search efficiency and inaccurate region selection. To address these issues, we propose ACTIVE-O3, a purely reinforcement learning based training framework built on top of GRPO, designed to equip MLLMs with active perception capabilities. We further establish a comprehensive benchmark suite to evaluate ACTIVE-O3 ac

Hydrodynamics of Dense Active Fluids: Turbulence-Like States and the Role of Advected Activity

arXiv2026-02-25作者：Sandip Sahoo, Siddhartha Mukherjee, Samriddhi Sankar Ray

Dense suspensions of self-propelled bacteria and related active fluids exhibit spontaneous flow generation, vortex formation, and spatiotemporally chaotic dynamics despite operating at vanishingly small Reynolds numbers. These phenomena, commonly referred to as active turbulence, display striking visual and statistical similarities to classical inertial turbulence while arising from fundamentally different nonequilibrium mechanisms. In this article, we present a combined review and theoretical study of hydrodynamic models for dense active fluids, with particular emphasis on bacterial suspensions described by the Toner--Tu--Swift--Hohenberg (TTSH) framework. We review key experimental and theoretical developments underlying the analogy between active and inertial turbulence, highlighting the emergence of multiple dynamical regimes and the conditions under which universal spectral and intermittent behavior arises in homogeneous systems. Moving beyond the conventional assumption of spatially uniform activity, we introduce a minimal model in which the activity field is heterogeneous and dynamically advected by the flow it generates. Thus treating activity as a spatiotemporally evolving

搜索结果：active

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Hydrodynamics of Dense Active Fluids: Turbulence-Like States and the Role of Advected Activity

Self-propulsive active nematics

Subsurface Flows Associated with Formation and Flaring Activity of Solar Active Regions

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection

Active Probing with Multimodal Predictions for Motion Planning

Compute-Efficient Active Learning

Learning in Hybrid Active Inference Models

Active Inference for an Intelligent Agent in Autonomous Reconnaissance Missions

Active Stereo in the Wild through Virtual Pattern Projection

A Cross-Domain Benchmark for Active Learning

A Markovian Formalism for Active Querying

The passive and active periods for the intermittent use of an active sensor to detect an evasive target

A Framework for Transmission Design for Active RIS-Aided Communication with Partial CSI

Phase field models of active matter

Active Neural Mapping

Deriving time-averaged active inference from control principles

Active transport in complex environments

Models of Animal Behavior as Active Particle Systems with Nonreciprocal Interactions

Class Balanced Dynamic Acquisition for Domain Adaptive Semantic Segmentation using Active Learning