搜索结果：Long-Shot

共找到 20 条结果

高级筛选 ▾

ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation

arXiv2025-12-18作者：Zixuan Chen, Chongkai Gao, Lin Shao

One-shot imitation learning (OSIL) offers a promising way to teach robots new skills without large-scale data collection. However, current OSIL methods are primarily limited to short-horizon tasks, thus limiting their applicability to complex, long-horizon manipulations. To address this limitation, we propose ManiLong-Shot, a novel framework that enables effective OSIL for long-horizon prehensile manipulation tasks. ManiLong-Shot structures long-horizon tasks around physical interaction events, reframing the problem as sequencing interaction-aware primitives instead of directly imitating continuous trajectories. This primitive decomposition can be driven by high-level reasoning from a vision-language model (VLM) or by rule-based heuristics derived from robot state changes. For each primitive, ManiLong-Shot predicts invariant regions critical to the interaction, establishes correspondences between the demonstration and the current observation, and computes the target end-effector pose, enabling effective task execution. Extensive simulation experiments show that ManiLong-Shot, trained on only 10 short-horizon tasks, generalizes to 20 unseen long-horizon tasks across three difficulty

搜索结果：Long-Shot

ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation

Setting a Baseline for long-shot real-time Player and Ball detection in Soccer Videos

Handling Supervision Scarcity in Chest X-ray Classification: Long-Tailed and Zero-Shot Learning

What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs

CXR-CML: Improved zero-shot classification of long-tailed multi-label diseases in Chest X-Rays

Efficient Zero-Shot Long Document Classification by Reducing Context Through Sentence Ranking

DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation

CoS: Chain-of-Shot Prompting for Long Video Understanding

StoryMem: Multi-shot Long Video Storytelling with Memory

Long Context Tuning for Video Generation

CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray

Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation

Train Short, Infer Long: Speech-LLM Enables Zero-Shot Streamable Joint ASR and Diarization on Long Audio

Leveraging State Space Models in Long Range Genomics

VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Zero-Shot Complex Question-Answering on Long Scientific Documents

Point to Span: Zero-Shot Moment Retrieval for Navigating Unseen Hour-Long Videos

True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics

VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

搜索结果：Long-Shot

ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation

Setting a Baseline for long-shot real-time Player and Ball detection in Soccer Videos

Handling Supervision Scarcity in Chest X-ray Classification: Long-Tailed and Zero-Shot Learning

What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs

CXR-CML: Improved zero-shot classification of long-tailed multi-label diseases in Chest X-Rays

Efficient Zero-Shot Long Document Classification by Reducing Context Through Sentence Ranking

DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation

CoS: Chain-of-Shot Prompting for Long Video Understanding

StoryMem: Multi-shot Long Video Storytelling with Memory

Long Context Tuning for Video Generation

CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray

Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation

Train Short, Infer Long: Speech-LLM Enables Zero-Shot Streamable Joint ASR and Diarization on Long Audio

Leveraging State Space Models in Long Range Genomics

VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Zero-Shot Complex Question-Answering on Long Scientific Documents

Point to Span: Zero-Shot Moment Retrieval for Navigating Unseen Hour-Long Videos

True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics

VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning