搜索 — ResearchTracker

Locating specific segments within an instructional video is an efficient way to acquire guiding knowledge. Generally, the task of obtaining video segments for both verbal explanations and visual demonstrations is known as visual answer localization (VAL). However, users often need multiple interactions to obtain answers that align with their expectations when using the system. During these interactions, humans deepen their understanding of the video content by asking themselves questions, thereby accurately identifying the location. Therefore, we propose a new task, named In-VAL, to simulate the multiple interactions between humans and videos in the procedure of obtaining visual answers. The In-VAL task requires interactively addressing several semantic gap issues, including 1) the ambiguity of user intent in the input questions, 2) the incompleteness of language in video subtitles, and 3) the fragmentation of content in video segments. To address these issues, we propose Ask2Loc, a framework for resolving In-VAL by asking questions. It includes three key modules: 1) a chatting module to refine initial questions and uncover clear intentions, 2) a rewriting module to generate fluent

Task Matters: Investigating Human Questioning Behavior in Different Household Service for Learning by Asking Robots

arXiv2025-04-11作者：Yuanda Hu, Hou Jiani, Zhang Junyu

Learning by Asking (LBA) enables robots to identify knowledge gaps during task execution and acquire the missing information by asking targeted questions. However, different tasks often require different types of questions, and how to adapt questioning strategies accordingly remains underexplored. This paper investigates human questioning behavior in two representative household service tasks: a Goal-Oriented task (refrigerator organization) and a Process-Oriented task (cocktail mixing). Through a human-human study involving 28 participants, we analyze the questions asked using a structured framework that encodes each question along three dimensions: acquired knowledge, cognitive process, and question form. Our results reveal that participants adapt both question types and their temporal ordering based on task structure. Goal-Oriented tasks elicited early inquiries about user preferences, while Process-Oriented tasks led to ongoing, parallel questioning of procedural steps and preferences. These findings offer actionable insights for developing task-sensitive questioning strategies in LBA-enabled robots for more effective and personalized human-robot collaboration.

搜索结果：asking

Ask2Loc: Learning to Locate Instructional Visual Answers by Asking Questions

Task Matters: Investigating Human Questioning Behavior in Different Household Service for Learning by Asking Robots

Applying Text Mining to Analyze Human Question Asking in Creativity Research

Asking Clarifying Questions for Preference Elicitation With Large Language Models

LOVA3: Learning to Visual Question Answering, Asking and Assessment

Socratic Students: Teaching Language Models to Learn by Asking Questions

Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions

Asking a Language Model for Diverse Responses

Asking More Informative Questions for Grounded Retrieval

A Survey on Asking Clarification Questions Datasets in Conversational Systems

Pedagogical Agents for Fostering Question-Asking Skills in Children

ELBA: Learning by Asking for Embodied Visual Navigation and Task Completion

Crafting Interpretable Embeddings by Asking LLMs Questions

A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions

Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Asking Complex Questions with Multi-hop Answer-focused Reasoning

Learning through Dialogue Interactions by Asking Questions

Ask don't tell: Reducing sycophancy in large language models

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

Bid--Ask Martingale Optimal Transport