搜索 — ResearchTracker

The Congress of Neurological Surgeons Self-Assessment for Neurological Surgeons (CNS-SANS) questions are widely used by neurosurgical residents to prepare for written board examinations. Recently, these questions have also served as benchmarks for evaluating large language models' (LLMs) neurosurgical knowledge. This study aims to assess the performance of state-of-the-art LLMs on neurosurgery board-like questions and to evaluate their robustness to the inclusion of distractor statements. A comprehensive evaluation was conducted using 28 large language models. These models were tested on 2,904 neurosurgery board examination questions derived from the CNS-SANS. Additionally, the study introduced a distraction framework to assess the fragility of these models. The framework incorporated simple, irrelevant distractor statements containing polysemous words with clinical meanings used in non-clinical contexts to determine the extent to which such distractions degrade model performance on standard medical benchmarks. 6 of the 28 tested LLMs achieved board-passing outcomes, with the top-performing models scoring over 15.7% above the passing threshold. When exposed to distractions, accurac

Surgeons vs. Computer Vision: A comparative analysis on surgical phase recognition capabilities

arXiv2025-04-26作者：Marco Mezzina, Pieter De Backer, Tom Vercauteren

Purpose: Automated Surgical Phase Recognition (SPR) uses Artificial Intelligence (AI) to segment the surgical workflow into its key events, functioning as a building block for efficient video review, surgical education as well as skill assessment. Previous research has focused on short and linear surgical procedures and has not explored if temporal context influences experts' ability to better classify surgical phases. This research addresses these gaps, focusing on Robot-Assisted Partial Nephrectomy (RAPN) as a highly non-linear procedure. Methods: Urologists of varying expertise were grouped and tasked to indicate the surgical phase for RAPN on both single frames and video snippets using a custom-made web platform. Participants reported their confidence levels and the visual landmarks used in their decision-making. AI architectures without and with temporal context as trained and benchmarked on the Cholec80 dataset were subsequently trained on this RAPN dataset. Results: Video snippets and presence of specific visual landmarks improved phase classification accuracy across all groups. Surgeons displayed high confidence in their classifications and outperformed novices, who struggl

搜索结果：Surgeons

Evaluating the performance and fragility of large language models on the self-assessment for neurological surgeons

Surgeons vs. Computer Vision: A comparative analysis on surgical phase recognition capabilities

Endoshare: A Publicly Available, Surgeons-Friendly Solution to De-Identify and Manage Surgical Videos

How Far Are Surgeons from Surgical World Models? A Pilot Study on Zero-shot Surgical Video Generation with Expert Assessment

Surgeons Awareness, Expectations, and Involvement with Artificial Intelligence: a Survey Pre and Post the GPT Era

Looking Together $ eq$ Seeing the Same Thing: Understanding Surgeons' Visual Needs During Intra-operative Coordination and Instruction

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons

Surgeons Are Indian Males and Speech Therapists Are White Females: Auditing Biases in Vision-Language Models for Healthcare Professionals

States of confusion: Eye and Head tracking reveal surgeons' confusion during arthroscopic surgery

Expert Consensus on Criteria for the Automated Assessment of Laparoscopic Camera Navigation

Impact of Extended Reality on Robot-Assisted Surgery Training

NAVIUS: Navigated Augmented Reality Visualization for Ureteroscopic Surgery

A vision-language model and platform for temporally mapping surgery from video

A Bilevel Approach to Integrated Surgeon Scheduling and Surgery Planning solved via Branch-and-Price

Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery

Unraveling the Connection: How Cognitive Workload Shapes Intent Recognition in Robot-Assisted Surgery

Video-Based Detection and Analysis of Errors in Robotic Surgical Training

SVD-Surgeon: Optimal Singular-Value Surgery for Large Language Model Compression

Multi-User Mobile Augmented Reality for Cardiovascular Surgical Planning

Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning