搜索 — ResearchTracker

Autonomous medical robots hold promise to improve patient outcomes, reduce provider workload, democratize access to care, and enable superhuman precision. However, autonomous medical robotics has been limited by a fundamental data problem: existing medical robotic datasets are small, single-embodiment, and rarely shared openly, restricting the development of foundation models that the field needs to advance. We introduce Open-H-Embodiment, the largest open dataset of medical robotic video with synchronized kinematics to date, spanning more than 50 institutions and multiple robotic platforms including the CMR Versius, Intuitive Surgical's da Vinci, da Vinci Research Kit (dVRK), Rob Surgical BiTrack, Virtual Incision's MIRA, Moon Surgical Maestro, and a variety of custom systems, spanning surgical manipulation, robotic ultrasound, and endoscopy procedures. We demonstrate the research enabled by this dataset through two foundation models. GR00T-H is the first open foundation vision-language-action model for medical robotics, which is the only evaluated model to achieve full end-to-end task completion on a structured suturing benchmark (25% of trials vs. 0% for all others) and achieves

OpenMedQ: Broad Open Pretraining for Medical Vision-Language Models

arXiv2026-06-11作者：Ibrahim Gulluk, Max Van Puyvelde, Olivier Gevaert

We present OpenMedQ, a medical vision-language model pretrained on the broadest fully-open medical mix to date: 14 datasets totaling ~3.35M pretraining samples spanning pathology, radiology, microscopy, and text-only clinical QA. OpenMedQ reaches state-of-the-art BLEU-1 on PathVQA (75.9), beating Med-PaLM M variants up to 562B parameters (~80x larger), and matches the best reported VQA-MED BLEU-1 (64.5). Its vision encoder, transferred to 8 unseen medical classification benchmarks under an identical downstream recipe, obtains the highest average macro-F1 (0.757) among BiomedCLIP (0.745), PMC-CLIP (0.745), PubMedCLIP (0.746), and a from-scratch baseline (0.616). We release our code and an interactive demo is publicly available as a reproducible baseline for the community.

搜索结果：SAGE open medical case reports

Open-H-Embodiment: A Large-Scale Dataset for Enabling Foundation Models in Medical Robotics

OpenMedQ: Broad Open Pretraining for Medical Vision-Language Models

Named Entities in Medical Case Reports: Corpus and Experiments

Open Challenges on Fairness of Artificial Intelligence in Medical Imaging Applications

MIRAGE: Retrieval and Generation of Multimodal Images and Texts for Medical Education

Medical Knowledge Intervention Prompt Tuning for Medical Image Classification

Longitudinal assessment of demographic representativeness in the Medical Imaging and Data Resource Center Open Data Commons

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

SAGE: Shape-Adapting Gated Experts for Adaptive Histopathology Image Segmentation

Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

Empirical Analysis of Zipf's Law, Power Law, and Lognormal Distributions in Medical Discharge Reports

ADP-FL-MedSeg: Adaptive Differential Privacy for Federated Medical Segmentation Across Diverse Modalities

GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image Generation

Medical Report Generation Is A Multi-label Classification Problem

Introduction of Medical Imaging Modalities

Fréchet Radiomic Distance (FRD): A Versatile Metric for Comparing Medical Imaging Datasets

Test-time generative augmentation for medical image segmentation

Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs

Caveats in Generating Medical Imaging Labels from Radiology Reports