搜索 — ResearchTracker

Background: Artificial intelligence (AI) has emerged as a disruptive innovation in medicine, yet its adoption within gastroenterology remains limited and poorly characterized. We aimed to examine knowledge, practical applications, perceived barriers, and expectations regarding AI among gastroenterology specialists in Spain. Methods: We conducted a cross-sectional observational study using a structured online survey distributed by the Spanish Society of Digestive Pathology (SEPD) in 2025. The questionnaire collected sociodemographic data, patterns of AI use, perceptions, and educational needs. Descriptive statistics and multivariable models were applied. Results: Among 283 respondents (mean age 44.6 +/- 9.7 years), 87.5% acknowledged AI as a transformative tool, but only 60.2% (95% CI: 54.3-66.1%) reported using it, mostly outside institutional frameworks. Notably, 80.2% of users initiated AI use within the past year. Independent predictors of frequent use included previous training (OR=2.44), employment in university hospitals (OR=2.14), and younger age (OR=1.36 per 5-year decrease). Main barriers were lack of training (61%), absence of institutional strategies (46%), and ethical c

Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models

arXiv2025-03-24作者：Nariman Naderi, Seyed Amir Ahmad Safavi-Naini, Thomas Savage

This study evaluated self-reported response certainty across several large language models (GPT, Claude, Llama, Phi, Mistral, Gemini, Gemma, and Qwen) using 300 gastroenterology board-style questions. The highest-performing models (GPT-o1 preview, GPT-4o, and Claude-3.5-Sonnet) achieved Brier scores of 0.15-0.2 and AUROC of 0.6. Although newer models demonstrated improved performance, all exhibited a consistent tendency towards overconfidence. Uncertainty estimation presents a significant challenge to the safe use of LLMs in healthcare. Keywords: Large Language Models; Confidence Elicitation; Artificial Intelligence; Gastroenterology; Uncertainty Quantification

搜索结果：Gastroenterology

Artificial Intelligence in Spanish Gastroenterology: high expectations, limited integration. A national survey

Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models

Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models

The Need for Medically Aware Video Compression in Gastroenterology

Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

A Two-Stage Deep Learning Framework for Segmentation of Ten Gastrointestinal Organs from Coronal MR Enterography

GI-Bench: A Panoramic Benchmark Revealing the Knowledge-Experience Dissociation of Multimodal Large Language Models in Gastrointestinal Endoscopy Against Clinical Standards

PrefPaint: Enhancing Medical Image Inpainting through Expert Human Feedback

Prompt Triage: Structured Optimization Enhances Vision-Language Model Performance on Medical Imaging Benchmarks

47B Mixture-of-Experts Beats 671B Dense Models on Chinese Medical Examinations

Exploring Efficiency Frontiers of Thinking Budget in Medical Reasoning: Scaling Laws between Computational Resources and Reasoning Quality

One VLM, Two Roles: Stage-Wise Routing and Specialty-Level Deployment for Clinical Workflows

A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

Integrating Deep Feature Extraction and Hybrid ResNet-DenseNet Model for Multi-Class Abnormality Detection in Endoscopic Images

The Application of ChatGPT in Responding to Questions Related to the Boston Bowel Preparation Scale

AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities

Hard-Attention Gates with Gradient Routing for Endoscopic Image Computing

ERCPMP: An Endoscopic Image and Video Dataset for Colorectal Polyps Morphology and Pathology

Precise localization within the GI tract by combining classification of CNNs and time-series analysis of HMMs

Evaluating clinical diversity and plausibility of synthetic capsule endoscopic images