搜索 — ResearchTracker

Multimodal large language models (MLLMs) have recently achieved remarkable progress in radiology by integrating visual perception with natural language understanding. However, they often generate clinically unsupported descriptions, known as medical hallucinations, which pose serious risks in medical applications that demand accuracy and image-grounded outputs. Through empirical analysis, we find that prompt-induced hallucinations remain prevalent in radiology MLLMs, largely due to over-sensitivity to clinical sections. To address this, we introduce Clinical Contrastive Decoding (CCD), a training-free and retrieval-free inference framework that integrates structured clinical signals from task-specific radiology expert models. CCD introduces a dual-stage contrastive mechanism to refine token-level logits during generation, thereby enhancing clinical fidelity without modifying the base MLLM. Experiments on three datasets and multiple models demonstrate that CCD consistently improves overall performance on radiology report generation (RRG). On the MIMIC-CXR dataset, it yields up to a 17% improvement in RadGraph-F1 when applied to state-of-the-art RRG models. Our approach provides a li

CCS: Clinical Consensus Selection for Radiology Report Generation

arXiv2026-05-28作者：Xi Zhang, Yingshu Li, Zaiqiao Meng

Radiology report generation (RRG) is commonly formulated as a single-path generation task, where a multimodal large language model (MLLM) produces one decoded report as the final output. While recent progress has largely been driven by scaling training data, model capacity, and retrieval mechanisms, improving report quality at inference time remains underexplored. In this work, we observe that fixed radiology MLLMs often generate clinically stronger reports elsewhere in their candidate pool than the one selected by default decoding, suggesting that inference-time decision making remains an overlooked bottleneck. To address this, we propose Clinical Consensus Selection (CCS), a decoder-agnostic inference-time selection framework that samples multiple candidate reports and selects the one with the highest clinical consensus across the rollout pool. CCS unifies text-based utilities with a radiology-adapted utility computed by an image--report-trained multimodal embedder, which measures candidate agreement beyond surface-level textual similarity. Across three datasets and multiple radiology MLLMs, CCS consistently improves inference-time performance over single-path decoding and generi

搜索结果：Clinical radiology

CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding

CCS: Clinical Consensus Selection for Radiology Report Generation

MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation

Automated Structured Radiology Report Generation with Rich Clinical Context

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Clinically Grounded Agent-based Report Evaluation: An Interpretable Metric for Radiology Report Generation

RadSEM: A Finding-by-Finding Metric for Clinical Consistency in Radiology Reports

CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling

CRG Score: A Distribution-Aware Clinical Metric for Radiology Report Generation

BTReport: A Framework for Brain Tumor Radiology Report Generation with Clinically Relevant Features

Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

Improving VTE Identification through Adaptive NLP Model Selection and Clinical Expert Rule-based Classifier from Radiology Reports

ReXErr: Synthesizing Clinically Meaningful Errors in Diagnostic Radiology Reports

Dual-Modal Lung Cancer AI: Interpretable Radiology and Microscopy with Clinical Risk Integration

Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways

Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence

Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework

From Clinical Intent to Clinical Model: Autonomous Coding-Agents for Clinician-driven AI Development

Before the Labels: How Dataset Construction Shapes Suicidality Detection in Clinical Text