共找到 20 条结果
In the domain of Human-Computer Interaction, focus groups represent a widely utilised yet resource-intensive methodology, often demanding the expertise of skilled moderators and meticulous preparatory efforts. This study introduces the ``Focus Agent,'' a Large Language Model (LLM) powered framework that simulates both the focus group (for data collection) and acts as a moderator in a focus group setting with human participants. To assess the data quality derived from the Focus Agent, we ran five focus group sessions with a total of 23 human participants as well as deploying the Focus Agent to simulate these discussions with AI participants. Quantitative analysis indicates that Focus Agent can generate opinions similar to those of human participants. Furthermore, the research exposes some improvements associated with LLMs acting as moderators in focus group discussions that include human participants.
While Multimodal Large Language Models (MLLMs) offer strong perception and reasoning capabilities for image-text input, Visual Question Answering (VQA) focusing on small image details still remains a challenge. Although visual cropping techniques seem promising, recent approaches have several limitations: the need for task-specific fine-tuning, low efficiency due to uninformed exhaustive search, or incompatibility with efficient attention implementations. We address these shortcomings by proposing a training-free visual cropping method, dubbed FOCUS, that leverages MLLM-internal representations to guide the search for the most relevant image region. This is accomplished in four steps: first, we identify the target object(s) in the VQA prompt; second, we compute an object relevance map using the key-value (KV) cache; third, we propose and rank relevant image regions based on the map; and finally, we perform the fine-grained VQA task using the top-ranked region. As a result of this informed search strategy, FOCUS achieves strong performance across four fine-grained VQA datasets and three types of MLLMs. It outperforms three popular visual cropping methods in both accuracy and efficie
Text-to-image (T2I) models excel on single-entity prompts but struggle with multi-entity scenes, often exhibiting attribute leakage, identity entanglement, and subject omissions. We present a principled theoretical framework that steers sampling toward multi-subject fidelity by casting flow matching (FM) as stochastic optimal control (SOC), yielding a single hyperparameter controlled trade-off between fidelity and object-centric state separation / binding consistency. Within this framework, we derive two architecture-agnostic algorithms: (i) a training-free test-time controller that perturbs the base velocity with a single-pass update, and (ii) Adjoint Matching, a lightweight fine-tuning rule that regresses a control network to a backward adjoint signal. The same formulation unifies prior attention heuristics, extends to diffusion models via a flow--diffusion correspondence, and provides the first fine-tuning route explicitly designed for multi-subject fidelity. In addition, we also introduce FOCUS (Flow Optimal Control for Unentangled Subjects), a probabilistic attention-binding objective compatible with both algorithms. Empirically, on Stable Diffusion 3.5 and FLUX.1, both algori
Vision language models (VLMs) have achieved impressive performance across a variety of computer vision tasks. However, the multimodal reasoning capability has not been fully explored in existing models. In this paper, we propose a Chain-of-Focus (CoF) method that allows VLMs to perform adaptive focusing and zooming in on key image regions based on obtained visual cues and the given questions, achieving efficient multimodal reasoning. To enable this CoF capability, we present a two-stage training pipeline, including supervised fine-tuning (SFT) and reinforcement learning (RL). In the SFT stage, we construct the MM-CoF dataset, comprising 3K samples derived from a visual agent designed to adaptively identify key regions to solve visual tasks with different image resolutions and questions. We use MM-CoF to fine-tune the Qwen2.5-VL model for cold start. In the RL stage, we leverage the outcome accuracies and formats as rewards to update the Qwen2.5-VL model, enabling further refining the search and reasoning strategy of models without human priors. Our model achieves significant improvements on multiple benchmarks. On the V* benchmark that requires strong visual reasoning capability, o
Using model weights pretrained on a high-resource language as a warm start can reduce the need for data and compute to obtain high-quality language models for other, especially low-resource, languages. However, if we want to use a new tokenizer specialized for the target language, we cannot transfer the source model's embedding matrix. In this paper, we propose FOCUS - Fast Overlapping Token Combinations Using Sparsemax, a novel embedding initialization method that initializes the embedding matrix effectively for a new tokenizer based on information in the source model's embedding matrix. FOCUS represents newly added tokens as combinations of tokens in the overlap of the source and target vocabularies. The overlapping tokens are selected based on semantic similarity in an auxiliary static token embedding space. We focus our study on using the multilingual XLM-R as a source model and empirically show that FOCUS outperforms random initialization and previous work in language modeling and on a range of downstream tasks (NLI, QA, and NER).
We report the preliminary measurement by the FOCUS Collaboration (E831 at Fermilab) of masses and widths of the L=1 charm mesons D_2^{*0} and D_2^{*+}.
Topological quantum computation started as a niche area of research aimed at employing particles with exotic statistics, called anyons, for performing quantum computation. Soon it evolved to include a wide variety of disciplines. Advances in the understanding of anyon properties inspired new quantum algorithms and helped in the characterisation of topological phases of matter and their experimental realisation. The conceptual appeal of topological systems as well as their promise for building fault-tolerant quantum technologies fuelled the fascination in this field. This `focus on' brings together several of the latest developments in the field and facilitates the synergy between different approaches.
Artificial Intelligence (AI) - the phenomenon of machines being able to solve problems that require human intelligence - has in the past decade seen an enormous rise of interest due to significant advances in effectiveness and use. The health sector, one of the most important sectors for societies and economies worldwide, is particularly interesting for AI applications, given the ongoing digitalisation of all types of health information. The potential for AI assistance in the health domain is immense, because AI can support medical decision making at reduced costs, everywhere. However, due to the complexity of AI algorithms, it is difficult to distinguish good from bad AI-based solutions and to understand their strengths and weaknesses, which is crucial for clarifying responsibilities and for building trust. For this reason, the International Telecommunication Union (ITU) has established a new Focus Group on "Artificial Intelligence for Health" (FG-AI4H) in partnership with the World Health Organization (WHO). Health and care services are usually the responsibility of a government - even when provided through private insurance systems - and thus under the responsibility of WHO/ITU
Shortcuts to Adiabaticity (STA) constitute driving schemes that provide an alternative to adiabatic protocols to control and guide the dynamics of classical and quantum systems without the requirement of slow driving. Research on STA advances swiftly with theoretical progress being accompanied by experiments on a wide variety of platforms. We summarize recent developments emphasizing advances reported in this focus issue while providing an outlook with open problems and prospects for future research.
Multicomponent superconductivity is a novel quantum phenomenon in many different superconducting materials, such as multiband ones in which different superconducting gaps open in different Fermi surfaces, films engineered at the atomic scale to enter the quantum confined regime, multilayers, two-dimensional electron gases at the oxide interfaces, and complex materials in which different electronic orbitals or different carriers participate in the formation of the superconducting condensate. In all these systems the increased number of degrees of freedom of the multicomponent superconducting wave-function allows for emergent quantum effects that are otherwise unattainable in single-component superconductors. In this editorial paper we introduce the present focus issue, exploring the complex but fascinating physics of multicomponent superconductivity.
Quantum memories are essential for quantum information processing and long-distance quantum communication. The field has recently seen a lot of progress, and the present focus issue offers a glimpse of these developments, showing both experimental and theoretical results from many of the leading groups around the world. On the experimental side, it shows work on cold gases, warm vapors, rare-earth ion doped crystals and single atoms. On the theoretical side there are in-depth studies of existing memory protocols, proposals for new protocols including approaches based on quantum error correction, and proposals for new applications of quantum storage. Looking forward, we anticipate many more exciting results in this area.
Despite the spectacular achievements of molecular biology in the second half of the twentieth century and the crucial advances it permitted in cancer research, the fight against cancer has brought some disillusions. It is nowadays more and more apparent that getting a global picture of the very diverse and interlinked aspects of cancer development necessitates, in synergy with these achievements, other perspectives and investigating tools. In this undertaking, multidisciplinary approaches that include quantitative sciences in general and physics in particular play a crucial role. This `focus on' collection contains 19 articles representative of the diversity and state-of-the-art of the contributions that physics can bring to the field of cancer research.
We describe a silicon microstrip detector interleaved with segments of a beryllium oxide target which was used in the FOCUS photoproduction experiment at Fermilab. The detector was designed to improve the vertex resolution and to enhance the reconstruction efficiency of short-lived charm particles.
We report on a direct measurement of the mixing parameter y=(3.42+-1.39+-0.74)% in the D0-D0bar system by measuring the lifetime difference between the CP mixed final state K^+pi^- and the CP even state K^+K^-. We also present a study of the decay \ws based on a sample of 149+-31 observed events compared to 36760+-195 events observed in the Cabibbo favored channel D0->K^-pi^+. The observed branching ratio R=(0.404+-0.085+-0.025)% is used to obtain limits on the mixing parameters x' and y' and the doubly Cabibbo suppressed branching ratio, R_DCS. These studies are based on a large sample of photoproduced charm mesons from the FOCUS experiment at Fermilab (FNAL-E831).
We describe the algorithm used to identify charged tracks in the fixed-target charm-photoproduction experiment FOCUS.
We present a four-body semileptonic charm decay D^+ into K^-pi^+mu^+nu analysis in the range of 0.65 GeV/c^2 < mkpi < 1.5 GeV/c^2. We observe a low mass scalar contribution of 5.30 +- 0.74 + 0.99 - 0.96 % with respect to the total D^+ into K^-pi^+mu^+nu decay, compatible with the phase shift found by the LASS elastic scattering experiment. For the K*(892) resonance, we obtain a mass of 895.41 +- 0.32 + 0.35 - 0.43 MeV/c^2, a width of 47.79 +- 0.86 + 1.32 - 1.06 MeV/c^2, and a Blatt-Weisskopf damping factor parameter of 3.96 +- 0.54 + 1.31 - 0.90 GeV^(-1). We also report 90% CL upper limits of 4% and 0.64% for the branching ratios Gamma(D^+ into K*(1680)mu^+nu)/Gamma(D^+ into K^-pi^+mu^+nu) and Gamma(D^+ into K*_0(1430)mu^+nu)/Gamma(D^+ into K^-pi^+mu^+nu), respectively.
Granular materials are complex multi-particle ensembles in which macroscopic properties are largely determined by inter-particle interactions between their numerous constituents. In order to understand and to predict their macroscopic physical behavior, it is necessary to analyze the composition and interactions at the level of individual contacts and grains. To do so requires the ability to image individual particles and their local configurations to high precision. A variety of competing and complementary imaging techniques have been developed for that task. In this introductory paper accompanying the Focus Issue, we provide an overview of these imaging methods and discuss their advantages and drawbacks, as well as their limits of application.
We describe the various techniques developed in the Fermilab Wideband Experiments, E687 and FOCUS, to reconstruct long-lived states. The techniques all involve modifications to standard tracking techniques and are useful to report for future experiments.
Using data collected by the high energy photoproduction experiment FOCUS at Fermilab we performed a Dalitz plot analysis of the Cabibbo favored decay D+ to K-pi+ pi+. This study uses 53653 Dalitz-plot events with a signal fraction of ~ 97%, and represents the highest statistics, most complete Dalitz plot analysis for this channel. Results are presented and discussed using two different formalisms. The first is a simple sum of Breit--Wigner functions with freely fitted masses and widths. It is the model traditionally adopted and serves as comparison with the already published analyses. The second uses a K-matrix approach for the dominant S-wave, in which the parameters are fixed by first fitting Kpi scattering data and continued to threshold by Chiral Perturbation Theory. We show that the Dalitz plot distribution for this decay is consistent with the assumption of two body dominance of the final state interactions and the description of these interactions is in agreement with other data on the Kpi final state.
Using a high statistics sample of photo-produced charm particles from the FOCUS experiment at Fermilab, we report on the measurement of the ratio of semileptonic rates Γ(D+ > ANTI-K pi mu+ nu)/Γ(D+ > ANTI-K0 mu+ nu)= 0.625 +/- 0.045 +/- 0.034. Allowing for the K pi S-wave interference measured previously by FOCUS, we extract the vector to pseudoscalar ratio Γ(D+ > ANTI-K*0 mu+ nu)/Γ(D+ > ANTI-K0 mu+ nu)= 0.594 +/- 0.043 +/- 0.033 and the ratio Γ(D+ > ANTI-K0 mu+ nu)/Γ(D+ > K- pi+ pi+)= 1.019 +/- 0.076 +/- 0.065. Our results show a lower ratio for Γ(D > K* \ell nu})/Γ(D > K \ell nu) than has been reported recently and indicate the current world average branching fractions for the decays D+ >ANTI-K0(mu+, e+) nu are low. Using the PDG world average for B(D+ > K- pi+ pi+) we extract B(D+ > ANIT-K0 mu+ nu)=(9.27 +/- 0.69 +/- 0.59 +/- 0.61)%.