搜索 — ResearchTracker

Artificial intelligence has advanced rapidly in biomedicine through large-scale multimodal data integration, enabling increasingly accurate prediction of clinical outcomes and patient stratification. These systems, however, remain fundamentally observational: they learn statistical associations from historical data and operate within previously observed biological and clinical states, limiting their ability to generalize to novel therapies or unobserved interventions. We argue that AI in biomedicine is undergoing a structural transition. As biomedical decision-making increasingly depends on reasoning about intervention rather than extrapolation from past observations, predictive architectures become structurally insufficient. Systems that learn from historical data cannot, by construction, represent how biological systems evolve under perturbation, and therefore cannot reliably support decision-making in the presence of novel interventions. We introduce a conceptual framework distinguishing observational and interventional intelligence and define disease-level models as systems that explicitly represent the state, dynamics, and intervention response of biological processes. These m

CRAB: A Benchmark for Evaluating Curation of Retrieval-Augmented LLMs in Biomedicine

arXiv2025-04-15作者：Hanmeng Zhong, Linqing Chen, Wentao Wu

Recent development in Retrieval-Augmented Large Language Models (LLMs) have shown great promise in biomedical applications. How ever, a critical gap persists in reliably evaluating their curation ability the process by which models select and integrate relevant references while filtering out noise. To address this, we introduce the benchmark for Curation of Retrieval-Augmented LLMs in Biomedicine (CRAB), the first multilingual benchmark tailored for evaluating the biomedical curation of retrieval-augmented LLMs, available in English, French, German and Chinese. By incorporating a novel citation-based evaluation metric, CRAB quantifies the curation performance of retrieval-augmented LLMs in biomedicine. Experimental results reveal significant discrepancies in the curation performance of mainstream LLMs, underscoring the urgent need to improve it in the domain of biomedicine. Our dataset is available at https://huggingface.co/datasets/zhm0/CRAB.

搜索结果：Biomedicine

From Prediction to Intervention: The Evolution of AI in Biomedicine

CRAB: A Benchmark for Evaluating Curation of Retrieval-Augmented LLMs in Biomedicine

The role of neuromorphic principles in the future of biomedicine and healthcare

Magnetosomes in Nature, Biomedicine and Physics

Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications

Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine

An Interpretable AI framework Quantifying Traditional Chinese Medicine Principles Towards Enhancing and Integrating with Modern Biomedicine

Foundation Model in Biomedicine

A Survey for Large Language Models in Biomedicine

RRD-Bio: Building An Integrated Research Resource Database for Biomedicine

Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health

AI for Biomedicine in the Era of Large Language Models

UltraMedical: Building Specialized Generalists in Biomedicine

BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine

Advancing High Resolution Vision-Language Models in Biomedicine

A survey of recent methods for addressing AI fairness and bias in biomedicine

A Refer-and-Ground Multimodal Large Language Model for Biomedicine

BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

Recipes for calibration and validation of agent-based models in cancer biomedicine

Advancing Biomedicine with Graph Representation Learning: Recent Progress, Challenges, and Future Directions