搜索 — ResearchTracker

With the rapid development of artificial intelligence (AI), large language models (LLMs) have shown strong capabilities in natural language understanding, reasoning, and generation, attracting amounts of research interest in applying LLMs to health and medicine. Critical care medicine (CCM) provides diagnosis and treatment for critically ill patients who often require intensive monitoring and interventions in intensive care units (ICUs). Can LLMs be applied to CCM? Are LLMs just like stochastic parrots or ICU experts in assisting clinical decision-making? This scoping review aims to provide a panoramic portrait of the application of LLMs in CCM. Literature in seven databases, including PubMed, Embase, Scopus, Web of Science, CINAHL, IEEE Xplore, and ACM Digital Library, were searched from January 1, 2019, to June 10, 2024. Peer-reviewed journal and conference articles that discussed the application of LLMs in critical care settings were included. From an initial 619 articles, 24 were selected for final review. This review grouped applications of LLMs in CCM into three categories: clinical decision support, medical documentation and reporting, and medical education and doctor-patien

Performance of Large Language Models in Answering Critical Care Medicine Questions

arXiv2025-09-16作者：Mahmoud Alwakeel, Aditya Nagori, An-Kwok Ian Wong

Large Language Models have been tested on medical student-level questions, but their performance in specialized fields like Critical Care Medicine (CCM) is less explored. This study evaluated Meta-Llama 3.1 models (8B and 70B parameters) on 871 CCM questions. Llama3.1:70B outperformed 8B by 30%, with 60% average accuracy. Performance varied across domains, highest in Research (68.4%) and lowest in Renal (47.9%), highlighting the need for broader future work to improve models across various subspecialty domains.

搜索结果：Critical care medicine

Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review

Performance of Large Language Models in Answering Critical Care Medicine Questions

Towards Self-Supervised Foundation Models for Critical Care Time Series

Equitable Optimization of Patient Re-allocation and Temporary Facility Placement to Maximize Critical Care System Resilience in Disasters

Differentiating hype from practical applications of large language models in medicine -- a primer for healthcare professionals

High hopes for "Deep Medicine"? AI, economics, and the future of care

Benchmarking Offline Multi-Objective Reinforcement Learning in Critical Care

Failure Modes of Time Series Interpretability Algorithms for Critical Care Applications and Potential Solutions

Towards Foundation Models for Critical Care Time Series

Redistributing Voice and Responsibility: AI in Relationship-Centred Care

Final Report for the Workshop on Robotics &amp; AI in Medicine

Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care

The promise and perils of AI in medicine

Benchmarking machine learning models on multi-centre eICU critical care dataset

Advancing clinical trial outcomes using deep learning and predictive modelling: bridging precision medicine and patient-centered care

CARE: Controlling LLM-Generated Policies through Auditable Review of Evidence in Scientific Experimentation

AI-CARE: Carbon-Aware Reporting Evaluation Metric for AI Models

Multimodal Clinical Benchmark for Emergency Care (MC-BEC): A Comprehensive Benchmark for Evaluating Foundation Models in Emergency Medicine

Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

Imagining Better AI-Enabled Healthcare Futures: The Case for Care by Design

Final Report for the Workshop on Robotics & AI in Medicine