搜索结果：AJPM focus

共找到 20 条结果

高级筛选 ▾

Focus Agent: LLM-Powered Virtual Focus Group

arXiv2024-09-03作者：Taiyu Zhang, Xuesong Zhang, Robbe Cools

In the domain of Human-Computer Interaction, focus groups represent a widely utilised yet resource-intensive methodology, often demanding the expertise of skilled moderators and meticulous preparatory efforts. This study introduces the ``Focus Agent,'' a Large Language Model (LLM) powered framework that simulates both the focus group (for data collection) and acts as a moderator in a focus group setting with human participants. To assess the data quality derived from the Focus Agent, we ran five focus group sessions with a total of 23 human participants as well as deploying the Focus Agent to simulate these discussions with AI participants. Quantitative analysis indicates that Focus Agent can generate opinions similar to those of human participants. Furthermore, the research exposes some improvements associated with LLMs acting as moderators in focus group discussions that include human participants.

FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering

arXiv2025-06-26作者：Liangyu Zhong, Fabio Rosenthal, Joachim Sicking

While Multimodal Large Language Models (MLLMs) offer strong perception and reasoning capabilities for image-text input, Visual Question Answering (VQA) focusing on small image details still remains a challenge. Although visual cropping techniques seem promising, recent approaches have several limitations: the need for task-specific fine-tuning, low efficiency due to uninformed exhaustive search, or incompatibility with efficient attention implementations. We address these shortcomings by proposing a training-free visual cropping method, dubbed FOCUS, that leverages MLLM-internal representations to guide the search for the most relevant image region. This is accomplished in four steps: first, we identify the target object(s) in the VQA prompt; second, we compute an object relevance map using the key-value (KV) cache; third, we propose and rank relevant image regions based on the map; and finally, we perform the fine-grained VQA task using the top-ranked region. As a result of this informed search strategy, FOCUS achieves strong performance across four fine-grained VQA datasets and three types of MLLMs. It outperforms three popular visual cropping methods in both accuracy and efficie

搜索结果：AJPM focus

Focus Agent: LLM-Powered Virtual Focus Group

FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering

FOCUS: Optimal Control for Multi-Entity World Modeling in Text-to-Image Generation

Adaptive Chain-of-Focus Reasoning via Dynamic Visual Search and Zooming for Efficient VLMs

FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models

Results on Charmed Meson Spectroscopy from Focus

Focus on topological quantum computation

Focus Group on Artificial Intelligence for Health

Focus on Shortcuts to Adiabaticity

Emergent phenomena in multicomponent superconductivity: an introduction to the focus issue

Focus on Quantum Memories

Focus on the Physics of Cancer

The Target Silicon Detector for the FOCUS Spectrometer

D0-D0bar Mixing in FOCUS

Cerenkov Particle Identification in FOCUS

Analysis of the Kpi hadronic state interaction using D into K-pi+mu+nu semileptonic decays from the FOCUS experiment

Focus on Imaging Methods in Granular Physics

Reconstruction of Vees, Kinks, $Ξ^-$'s, and $Ω^-$'s in the Focus Spectrometer

Dalitz plot analysis of the D+ to K-pi+pi+ decay in the FOCUS experiment

Measurement of the Ratio of the Vector to Pseudoscalar Charm Semileptonic Decay Rate Γ(D+ &gt; ANTI-K*0 mu+ nu)/Γ(D+ &gt; ANTI-K0 mu+ nu)

Measurement of the Ratio of the Vector to Pseudoscalar Charm Semileptonic Decay Rate Γ(D+ > ANTI-K*0 mu+ nu)/Γ(D+ > ANTI-K0 mu+ nu)