搜索 — ResearchTracker

Background We aim to use Natural Language Processing (NLP) to automate the extraction and classification of thyroid cancer risk factors from pathology reports. Methods We analyzed 1,410 surgical pathology reports from adult papillary thyroid cancer patients at Mayo Clinic, Rochester, MN, from 2010 to 2019. Structured and non-structured reports were used to create a consensus-based ground truth dictionary and categorized them into modified recurrence risk levels. Non-structured reports were narrative, while structured reports followed standardized formats. We then developed ThyroPath, a rule-based NLP pipeline, to extract and classify thyroid cancer features into risk categories. Training involved 225 reports (150 structured, 75 unstructured), with testing on 170 reports (120 structured, 50 unstructured) for evaluation. The pipeline's performance was assessed using both strict and lenient criteria for accuracy, precision, recall, and F1-score. Results In extraction tasks, ThyroPath achieved overall strict F-1 scores of 93% for structured reports and 90 for unstructured reports, covering 18 thyroid cancer pathology features. In classification tasks, ThyroPath-extracted information de

SurgPub-Video: A Comprehensive Surgical Video Dataset for Enhanced Surgical Intelligence in Vision-Language Model

arXiv2025-08-12作者：Yaoqian Li, Xikai Yang, Dunyuan Xu

Vision-Language Models (VLMs) have shown significant potential in surgical scene analysis, yet existing models are limited by frame-level datasets and lack high-quality video data with procedural surgical knowledge. To address these challenges, we make the following contributions: (i) SurgPub-Video, a comprehensive dataset of over 3,000 surgical videos and 25 million annotated frames across 11 specialties, sourced from peer-reviewed clinical journals, (ii) SurgLLaVA-Video, a specialized VLM for surgical video understanding, built upon the TinyLLaVA-Video architecture that supports both video-level and frame-level inputs, and (iii) a video-level surgical Visual Question Answering (VQA) benchmark, covering diverse 11 surgical specialities, such as vascular, cardiology, and thoracic. Extensive experiments, conducted on the proposed benchmark and three additional surgical downstream tasks (action recognition, skill assessment, and triplet recognition), show that SurgLLaVA-Video significantly outperforms both general-purpose and surgical-specific VLMs with only three billion parameters. The dataset, model, and benchmark will be released to enable further advancements in surgical video u

搜索结果：Surgical case reports

Use of natural language processing to extract and classify papillary thyroid cancer features from surgical pathology reports

SurgPub-Video: A Comprehensive Surgical Video Dataset for Enhanced Surgical Intelligence in Vision-Language Model

SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis

Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling

Towards Holistic Surgical Scene Graph

ORBIT-Surgical: An Open-Simulation Framework for Learning Surgical Augmented Dexterity

From Phase Grounding to Intelligent Surgical Narratives

SURGIVID: Annotation-Efficient Surgical Video Object Discovery

Advancing Surgical VQA with Scene Graph Knowledge

Technical Report: Automated Optical Inspection of Surgical Instruments

Surgical Vision World Model

Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Models

Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation

Can We Revitalize Interventional Healthcare with AI-XR Surgical Metaverses?

SurgiPose: Estimating Surgical Tool Kinematics from Monocular Video for Surgical Robot Learning

Surgical Text-to-Image Generation

Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery

Dynamic Scene Graph Representation for Surgical Video

A Formal Approach For Modelling And Analysing Surgical Procedures (Extended Version)

LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning