搜索 — ResearchTracker

Current LLM assistants are powerful at answering questions, but they have limited access to the behavioral context that reveals when and where a user is struggling. We present a gaze-grounded multimodal LLM assistant that uses egocentric video with gaze overlays to identify likely points of difficulty and target follow-up retrospective assistance. We instantiate this vision in a controlled study (n=36) comparing the gaze-aware AI assistant to a text-only LLM assistant. Compared to a conventional LLM assistant, the gaze-aware assistant was rated as significantly more accurate and personalized in its assessments of users' reading behavior and significantly improved people's ability to recall information. Users spoke significantly fewer words with the gaze-aware assistant, indicating more efficient interactions. Qualitative results underscored both perceived benefits in comprehension and challenges when interpretations of gaze behaviors were inaccurate. Our findings suggest that gaze-aware LLM assistants can reason about cognitive needs to improve cognitive outcomes of users.

Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU

arXiv2026-02-17作者：Rehana Mahfuz, Yinyi Guo, Erik Visser

Real-time conversational assistants for procedural tasks often depend on video input, which can be computationally expensive and compromise user privacy. For the first time, we propose a real-time conversational assistant that provides comprehensive guidance for a procedural task using only lightweight privacy-preserving modalities such as audio and IMU inputs from a user's wearable device to understand the context. This assistant proactively communicates step-by-step instructions to a user performing a furniture assembly task, and answers user questions. We construct a dataset containing conversations where the assistant guides the user in performing the task. On observing that an off-the-shelf language model is a very talkative assistant, we design a novel User Whim Agnostic (UWA) LoRA finetuning method which improves the model's ability to suppress less informative dialogues, while maintaining its tendency to communicate important instructions. This leads to >30% improvement in the F-score. Finetuning the model also results in a 16x speedup by eliminating the need to provide in-context examples in the prompt. We further describe how such an assistant is implemented on edge de

搜索结果：assistant

From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants

Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU

ProPerSim: Developing Proactive and Personalized AI Assistants through User-Assistant Simulation

AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution

Virtual Mouse And Assistant: A Technological Revolution Of Artificial Intelligence

Can AI Assistants Know What They Don't Know?

Need Help? Designing Proactive AI Assistants for Programming

Role of energy-invariant assistants in energy extraction from quantum batteries

Gender Biases in Error Mitigation by Voice Assistants

ScheduleMe: Multi-Agent Calendar Assistant

LLAMAPIE: Proactive In-Ear Conversation Assistants

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons

Digital assistant in a point of sales

Embodiment perception of a smart home assistant

From a Natural to a Formal Language with DSL Assistant

PEAK: Explainable Privacy Assistant through Automated Knowledge Extraction

A Mixed-Methods Approach to Understanding User Trust after Voice Assistant Failures

TalkPhoto: A Versatile Training-Free Conversational Assistant for Intelligent Image Editing

Evaluation and Continual Improvement for an Enterprise AI Assistant

A Wizard of Oz Study Simulating API Usage Dialogues with a Virtual Assistant