搜索 — ResearchTracker

Eye-hand coordinated interaction is becoming a mainstream interaction modality in Virtual Reality (VR) user interfaces.Current paradigms for this multimodal interaction require users to learn predefined gestures and memorize multiple gesture-task associations, which can be summarized as an ``Operation-to-Intent" paradigm. This paradigm increases users' learning costs and has low interaction error tolerance. In this paper, we propose SIAgent, a novel "Intent-to-Operation" framework allowing users to express interaction intents through natural eye-hand motions based on common sense and habits. Our system features two main components: (1) intent recognition that translates spatial interaction data into natural language and infers user intent, and (2) agent-based execution that generates an agent to execute corresponding tasks. This eliminates the need for gesture memorization and accommodates individual motion preferences with high error tolerance. We conduct two user studies across over 60 interaction tasks, comparing our method with two "Operation-to-Intent" techniques. Results show our method achieves higher intent recognition accuracy than gaze + pinch interaction (97.2% vs 93.1%)

Inclusive AI for Group Interactions: Predicting Gaze-Direction Behaviors in People with Intellectual and Developmental Disabilities

arXiv2026-03-15作者：Giulia Huang, Maristella Matera, Micol Spitale

Artificial agents that support human group interactions hold great promise, especially in sensitive contexts such as well-being promotion and therapeutic interventions. However, current systems struggle to mediate group interactions involving people who are not neurotypical. This limitation arises because most AI detection models (e.g., for turn-taking) are trained on data from neurotypical populations. This work takes a step toward inclusive AI by addressing the challenge of eye contact detection, a core component of non-verbal communication, with and for people with Intellectual and Developmental Disabilities. First, we introduce a new dataset, Multi-party Interaction with Intellectual and Developmental Disabilities (MIDD), capturing atypical gaze and engagement patterns. Second, we present the results of a comparative analysis with neurotypical datasets, highlighting differences in class imbalance, speaking activity, gaze distribution, and interaction dynamics. Then, we evaluate classifiers ranging from SVMs to FSFNet, showing that fine-tuning on MIDD improves performance, though notable limitations remain. Finally, we present the insights gathered through a focus group with six

搜索结果：Interaction

SIAgent: Spatial Interaction Agent via LLM-powered Eye-Hand Motion Intent Understanding in VR

Inclusive AI for Group Interactions: Predicting Gaze-Direction Behaviors in People with Intellectual and Developmental Disabilities

Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions

DeBiasMe: De-biasing Human-AI Interactions with Metacognitive AIED (AI in Education) Interventions

Disambiguating Anthropomorphism and Anthropomimesis in Human-Robot Interaction

The Role of Generative AI in Facilitating Social Interactions: A Scoping Review

From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma

Core Elements of Social Interaction for Constructive Human-Robot Interaction

Enhancing Accessibility in Soft Robotics: Exploring Magnet-Embedded Paper-Based Interactions

MERCI: Multimodal Emotional and peRsonal Conversational Interactions Dataset

A Longitudinal Study of Child Wellbeing Assessment via Online Interactions with a Social Robot

Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation

Human-like Nonverbal Behavior with MetaHumans in Real-World Interaction Studies: An Architecture Using Generative Methods and Motion Capture

Internet of Tangible Things (IoTT): Challenges and Opportunities for Tangible Interaction with IoT

Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm

Emotional Interaction Qualities: Vocabulary, Modalities, Actions, And Mapping

The Communal Loom: Integrating Tangible Interaction and Participatory Data Collection for Assessing Well-Being

Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving

Behind the Smile: Mental Health Implications of Mother-Infant Interactions Revealed Through Smile Analysis

What Can You Say to a Robot? Capability Communication Leads to More Natural Conversations