搜索 — ResearchTracker

Paucity of medical data severely limits the generalizability of diagnostic ML models, as the full spectrum of disease variability can not be represented by a small clinical dataset. To address this, diffusion models (DMs) have been considered as a promising avenue for synthetic image generation and augmentation. However, they frequently produce medically inaccurate images, deteriorating the model performance. Expert domain knowledge is critical for synthesizing images that correctly encode clinical information, especially when data is scarce and quality outweighs quantity. Existing approaches for incorporating human feedback, such as reinforcement learning (RL) and Direct Preference Optimization (DPO), rely on robust reward functions or demand labor-intensive expert evaluations. Recent progress in Multimodal Large Language Models (MLLMs) reveals their strong visual reasoning capabilities, making them adept candidates as evaluators. In this work, we propose a novel framework, coined MAGIC (Medically Accurate Generation of Images through AI-Expert Collaboration), that synthesizes clinically accurate skin disease images for data augmentation. Our method creatively translates expert-de

A Scoping Review of Natural Language Processing in Addressing Medically Inaccurate Information: Errors, Misinformation, and Hallucination

arXiv2025-04-16作者：Zhaoyi Sun, Wen-Wai Yim, Ozlem Uzuner

Objective: This review aims to explore the potential and challenges of using Natural Language Processing (NLP) to detect, correct, and mitigate medically inaccurate information, including errors, misinformation, and hallucination. By unifying these concepts, the review emphasizes their shared methodological foundations and their distinct implications for healthcare. Our goal is to advance patient safety, improve public health communication, and support the development of more reliable and transparent NLP applications in healthcare. Methods: A scoping review was conducted following PRISMA guidelines, analyzing studies from 2020 to 2024 across five databases. Studies were selected based on their use of NLP to address medically inaccurate information and were categorized by topic, tasks, document types, datasets, models, and evaluation metrics. Results: NLP has shown potential in addressing medically inaccurate information on the following tasks: (1) error detection (2) error correction (3) misinformation detection (4) misinformation correction (5) hallucination detection (6) hallucination mitigation. However, challenges remain with data privacy, context dependency, and evaluation sta

搜索结果：Medically

Doctor Approved: Generating Medically Accurate Skin Disease Images through AI-Expert Feedback

A Scoping Review of Natural Language Processing in Addressing Medically Inaccurate Information: Errors, Misinformation, and Hallucination

On the notion of missingness for path attribution explainability methods in medical settings: Guiding the selection of medically meaningful baselines

The Need for Medically Aware Video Compression in Gastroenterology

Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

CARE: A Conformal Safety Layer for Medical Summarization

On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

TopoCL: Topological Contrastive Learning for Medical Imaging

MedicalBench: Evaluating Large Language Models Toward Improved Medical Concept Extraction

MIRA: A Bilingual Benchmark for Medical Information Response Audit

Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation

Beyond Idealized Patients: Evaluating LLMs under Challenging Patient Behaviors in Medical Consultations

Medical Triage as Pairwise Ranking: A Benchmark for Urgency in Patient Portal Messages

ProtoTopic: Prototypical Network for Few-Shot Medical Topic Modeling

Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation

Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Structure Causal Models and LLMs Integration in Medical Visual Question Answering

HIVMedQA: Benchmarking large language models for HIV medical decision support

Medical Knowledge Intervention Prompt Tuning for Medical Image Classification