搜索 — ResearchTracker

Strabismus is a common ocular disorder that requires fine-grained subtype diagnosis for individualized treatment planning. However, existing deep learning methods mainly provide diagnostic predictions without transparent reasoning, while recent large vision-language models (LVLMs), although promising for joint image understanding and report generation, remain highly prone to hallucination in this evidence-sensitive and rule-driven medical task. To address these challenges, we propose MAGIS, an evidence-based Multi-AGent reasoning for Interpretable Strabismus diagnosis framework. MAGIS transforms black-box end-to-end generation into a structured diagnostic process consisting of candidate hypothesis generation, dual-evidence constrained context, evidence-based corrective verification, and report generation. Specifically, we introduce a Dual-Evidence Constrained Context (DECC) mechanism that jointly organizes visual evidence from the photograph of the nine cardinal positions of gaze and evidence-based clinical diagnostic rules into a constrained context for reliable diagnostic reasoning. We further develop an Evidence-Based Corrective Verification (EBCV) mechanism that verifies whethe

Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development

arXiv2026-03-25作者：Zongliang Ji, Ziyang Zhang, Xincheng Tan

Evidence-based medicine (EBM) is central to high-quality care, but remains difficult to implement in fast-paced primary care settings. Physicians face short consultations, increasing patient loads, and lengthy guideline documents that are impractical to consult in real time. To address this gap, we investigate the feasibility of using large language models (LLMs) as ambient assistants that surface targeted, evidence-based questions during physician-patient encounters. Our study focuses on question generation rather than question answering, with the aim of scaffolding physician reasoning and integrating guideline-based practice into brief consultations. We implemented two prompting strategies, a zero-shot baseline and a multi-stage reasoning variant, using Gemini 2.5 as the backbone model. We evaluated on a benchmark of 80 de-identified transcripts from real clinical encounters, with six experienced physicians contributing over 90 hours of structured review. Results indicate that while general-purpose LLMs are not yet fully reliable, they can produce clinically meaningful and guideline-relevant questions, suggesting significant potential to reduce cognitive burden and make EBM more

搜索结果：evidence-based

MAGIS: Evidence-Based Multi-Agent Reasoning for Interpretable Strabismus Clinical Decision-Making

Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development

Historian: Reducing Manual Validation in APR Benchmarking via Evidence-Based Assessment

From Documents to Spans: Scalable Supervision for Evidence-Based ICD Coding with LLMs

Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models

Expert-Annotated Embryo Image Dataset with Natural Language Descriptions for Evidence-Based Patient Communication in IVF

READER: Robust Evidence-based Authorship Decoding via Extracted Representations

EviRank: Evidence-Based Confidence Estimation for LLM-Based Ranking

DeepER-Med: Advancing Deep Evidence-Based Research in Medicine Through Agentic AI

Evidence-Based Education and Beyond: The Critical Role of Theory in Science Education Research and Practice

Experimental Evidence-Based Sub-Rayleigh Source Discrimination

Pitfalls of Evidence-Based AI Policy

Towards an Evidence-Based Approach to Climate Policy

NeoQA: Evidence-based Question Answering with Generated News Events

Leveraging Language Models to Discover Evidence-Based Actions for OSS Sustainability

Evaluating Large Language Models for Evidence-Based Clinical Question Answering

Towards Evidence-Based Tech Hiring Pipelines

Exploring the Evidence-Based SE Beliefs of Generative AI Tools

Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review

Enhancing LLM Generation with Knowledge Hypergraph for Evidence-Based Medicine