Minimally invasive surgery has dramatically improved patient operative outcomes, yet identifying safe operative zones remains challenging in critical phases, requiring surgeons to integrate visual cues, procedural phase, and anatomical context under high cognitive load. Existing AI systems offer binary safety verification or static detection, ignoring the phase-dependent nature of intraoperative reasoning. We introduce ResGo, a benchmark of laparoscopic frames annotated with Go Zone bounding boxes and clinician-authored rationales covering phase, exposure quality reasoning, next action and risk reminder. We introduce evaluation metrics that treat correct grounding under incorrect phase as failures, revealing that most vision-language models cannot handle such tasks and perform poorly. We then present SurGo-R1, a model optimized via RLHF with a multi-turn phase-then-go architecture where the model first identifies the surgical phase, then generates reasoning and Go Zone coordinates conditioned on that context. On unseen procedures, SurGo-R1 achieves 76.6% phase accuracy, 32.7 mIoU, and 54.8% hardcore accuracy, a 6.6$\times$ improvement over the mainstream generalist VLMs. Code, mode
The convergence of robotics and virtual reality (VR) has enabled safer and more efficient workflows in high-risk laboratory settings, particularly virology labs. As biohazard complexity increases, minimizing direct human exposure while maintaining precision becomes essential. We propose GAMORA (Gesture Articulated Meta Operative Robotic Arm), a novel VR-guided robotic system that enables remote execution of hazardous tasks using natural hand gestures. Unlike existing scripted automation or traditional teleoperation, GAMORA integrates the Oculus Quest 2, NVIDIA Jetson Nano, and Robot Operating System (ROS) to provide real-time immersive control, digital twin simulation, and inverse kinematics-based articulation. The system supports VR-based training and simulation while executing precision tasks in physical environments via a 3D-printed robotic arm. Inverse kinematics ensure accurate manipulation for delicate operations such as specimen handling and pipetting. The pipeline includes Unity-based 3D environment construction, real-time motion planning, and hardware-in-the-loop testing. GAMORA achieved a mean positional discrepancy of 2.2 mm (improved from 4 mm), pipetting accuracy withi
Purpose: Laparoscopic cholecystectomy (LC) operative difficulty (LCOD) is highly variable and influences outcomes. Despite extensive LC studies in surgical workflow analysis, limited efforts explore LCOD using intraoperative video data. Early recognition of LCOD could allow prompt review by expert surgeons, enhance operating room (OR) planning, and improve surgical outcomes. Methods: We propose the clinical task of early LCOD assessment using limited video observations. We design SurgPrOD, a deep learning model to assess LCOD by analyzing features from global and local temporal resolutions (snapshots) of the observed LC video. Also, we propose a novel snapshot-centric attention (SCA) module, acting across snapshots, to enhance LCOD prediction. We introduce the CholeScore dataset, featuring video-level LCOD labels to validate our method. Results: We evaluate SurgPrOD on 3 LCOD assessment scales in the CholeScore dataset. On our new metric assessing early and stable correct predictions, SurgPrOD surpasses baselines by at least 0.22 points. SurgPrOD improves over baselines by at least 9 and 5 percentage points in F1 score and top1-accuracy, respectively, demonstrating its effectivenes
For several cancer patients, operative resection with curative intent can end up in early recurrence of the cancer. Current limitations in peri-operative cancer staging and especially intra-operative misidentification of visible metastases is likely the main reason leading to unnecessary operative interventions in the affected individuals. Here, we evaluate whether an artificial intelligence (AI) system can improve recognition of peritoneal surface metastases on routine staging laparoscopy images from patients with gastrointestinal malignancies. In a simulated setting evaluating biopsied peritoneal lesions, a prototype deep learning surgical guidance system outperformed oncologic surgeons in identifying peritoneal surface metastases. In this environment the developed AI model would have improved the identification of metastases by 5% while reducing the number of unnecessary biopsies by 28% compared to current standard practice. Evaluating non-biopsied peritoneal lesions, the findings support the possibility that the AI system could identify peritoneal surface metastases that were falsely deemed benign in clinical practice. Our findings demonstrate the technical feasibility of an AI
Objectives: To assess the effectiveness of digital scanning techniques for self-assessment and of preparations and restorations in preclinical dental education when compared to traditional faculty grading. Methods: Forty-four separate Class I (#30-O), Class II (#30-MO) preparations, and class II amalgam restorations (#31-MO) were generated respectively under preclinical assessment setting. Calibrated faculty evaluated the preparations and restorations using a standard rubric from preclinical operative class. The same teeth were scanned using Planmeca PlanScan intraoral scanner and graded using the Romexis E4D Compare Software. Each tooth was compared against a corresponding gold standard tooth with tolerance intervals ranging from 100μm to 500μm. These scores were compared to traditional faculty grades using a linear mixed model to estimate the mean differences at 95% confidence interval for each tolerance level. Results: The average Compare Software grade of Class I preparation at 300μm tolerance had the smallest mean difference of 1.64 points on a 100 points scale compared to the average faculty grade. Class II preparation at 400μm tolerance had the smallest mean difference of 0.
Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative action captioning that estimates and verbalizes the actions to be taken by the system in a human-assisting domain. We constructed a system that outputs a verbal description of a possible operative action that changes the current state to the given target state. We collected a dataset consisting of two images as observations, which express the current state and the state changed by actions, and a caption that describes the actions that change the current state to the target state, by crowdsourcing in daily life situations. Then we constructed a system that estimates operative action by a caption. Since the operative action's caption is expected to contain some state-changing actions, we use scene-graph prediction as an auxiliary task because the events written in the scene graphs correspond to
This work is concerned with devising a robust Parkinson's (PD) disease detector from speech in real-world operating conditions using (i) foundational models, and (ii) speech enhancement (SE) methods. To this end, we first fine-tune several foundational-based models on the standard PC-GITA (s-PC-GITA) clean data. Our results demonstrate superior performance to previously proposed models. Second, we assess the generalization capability of the PD models on the extended PC-GITA (e-PC-GITA) recordings, collected in real-world operative conditions, and observe a severe drop in performance moving from ideal to real-world conditions. Third, we align training and testing conditions applaying off-the-shelf SE techniques on e-PC-GITA, and a significant boost in performance is observed only for the foundational-based models. Finally, combining the two best foundational-based models trained on s-PC-GITA, namely WavLM Base and Hubert Base, yielded top performance on the enhanced e-PC-GITA.
Total knee arthroplasty (TKA) is a common orthopaedic surgery to replace a damaged knee joint with artificial implants. The inaccuracy of achieving the planned implant position can result in the risk of implant component aseptic loosening, wear out, and even a joint revision, and those failures most of the time occur on the tibial side in the conventional jig-based TKA (CON-TKA). This study aims to precisely evaluate the accuracy of the proximal tibial resection plane intra-operatively in real-time such that the evaluation processing changes very little on the CON-TKA operative procedure. Two X-ray radiographs captured during the proximal tibial resection phase together with a pre-operative patient-specific tibia 3D mesh model segmented from computed tomography (CT) scans and a trocar pin 3D mesh model are used in the proposed simultaneous localisation and mapping (SLAM) system to estimate the proximal tibial resection plane. Validations using both simulation and in-vivo datasets are performed to demonstrate the robustness and the potential clinical value of the proposed algorithm.
Objectives Computer vision (CV) is a field of artificial intelligence that enables machines to interpret and understand images and videos. CV has the potential to be of assistance in the operating room (OR) to track surgical instruments. We built a CV algorithm for identifying surgical instruments in the neurosurgical operating room as a potential solution for surgical instrument tracking and management to decrease surgical waste and opening of unnecessary tools. Methods We collected 1660 images of 27 commonly used neurosurgical instruments. Images were labeled using the VGG Image Annotator and split into 80% training and 20% testing sets in order to train a U-Net Convolutional Neural Network using 5-fold cross validation. Results Our U-Net achieved a tool identification accuracy of 80-100% when distinguishing 25 classes of instruments, with 19/25 classes having accuracy over 90%. The model performance was not adequate for sub classifying Adson, Gerald, and Debakey forceps, which had accuracies of 60-80%. Conclusions We demonstrated the viability of using machine learning to accurately identify surgical instruments. Instrument identification could help optimize surgical tray packin
This article faces the problem of operative and procedural cooperative training in marine ports with particular attention to harbour pilots and port traffic controller. The design and development of an advanced system, equipped with dedicated hardware in the loop, for cooperative training of operators involved in the last mile of navigation is presented. Indeed, the article describes the software and hardware development of a distributed and interoperable system composed by two simulators (the bridge ship simulator and control tower simulator). Multiple problems are faced and solved including (i) the motion of the ship at sea that is based on a 6 Degree Of Freedom (DOF) model for surge, sway and yaw and closed form expressions for pitch, roll and heave and its validation; (ii) the development of the 3D geometric models and related virtual environments of a real marine port and vessel (to provide the trainees with the sensation to experience a real port and ship environment); (iii) the design of a bridge ship replica, the bridge hardware integration and the design of the visualization system; (iv) the design and development of the control tower simulator; (v) the integration of the
Providing an accurate and efficient assessment of operative difficulty is important for designing robot-assisted teleoperation interfaces that are easy and natural for human operators to use. In this paper, we aim to develop a data-driven approach to numerically characterize the operative difficulty demand of complex teleoperation. In effort to provide an entirely task-independent assessment, we consider using only data collected from the human user including: (1) physiological response, and (2) movement kinematics. By leveraging an unsupervised domain adaptation technique, our approach learns the user information that defines task difficulty in a well-known source, namely, a Fitt's target reaching task, and generalizes that knowledge to a more complex human motor control scenario, namely, the teleoperation of a robotic system. Our approach consists of two main parts: (1) The first part accounts for the inherent variances of user physiological and kinematic response between these cross-domain motor control scenarios that are vastly different. (2) A stacked two-layer learner is designed to improve the overall modeling performance, yielding a 96.6% accuracy in predicting the known di
Network slicing is considered a key mechanism to serve the multitude of tenants (e.g. vertical industries) targeted by forthcoming 5G systems in a flexible and cost-efficient manner. In this paper, we present a SDN/NFV architecture with multi-tenancy support. This architecture enables a network slice provider to deploy network slice instances for multiple tenants on-the-fly, and simultaneously provision them with isolation guarantees. Following the Network Slice as-a-Service delivery model, a tenant may access a Service Catalog, selecting the slice that best fits its needs and ordering its deployment. This work provides a detailed view on the stages that a network slice provider must follow to deploy the ordered network slice instance, accommodating it into a multi-domain infrastructure, and putting it operative for tenant's consumption. These stages address critical issues identified in the literature, including (i) the mapping from high-level service requirements to network functions and infrastructure requirements, (ii) the admission control, and (iii) the specific information a network slice descriptor should have. With the proposed architecture and the recommended set of stage
La Fuenfría Hospital (LFH) operative parameters such as: hospitalised patients; daily admissions and discharges were studies for the hospital as a whole, and per each Hospital's service unit (just called "service" here). Data were used to build operative parameter value series and their variation. Conventional statistical analyses and fractal dimension analyses were performed on the series. Statistical analyses indicated that the data did not follow a Gauss (i.e. "normal") distribution, thus nonparametric statistical analyses were chosen to describe data. The sequence of admitted daily admissions and patients staying on each service were found to be a kind of random series of a kind called random walks (Rw). Rw are sequences where what happens next ($ y_{t+Δt}$), depends on what happens now ($ y_{t}$) plus a random variable ($ ε$), $ y_{t+Δt}= y_t + ε$. Rw analysed with parametric or non parametric statistics may simulate cycles and drifts which resemble seasonal variations or fake trends. Globally, admitted patients Rws in LFFH, were found to be determined by the time elapsed between daily discharges and admissions. The factor determining LFH Rw were found to be the difference bet
Arguments are given that time must be defined in an operative manner,i.e., by constructing devices which can serve as clocks.The investigation of such devices leads to the conclusion that there is a principal uncertainity of time if one considers periods which are not large compared with the Planck time. Thus,according to the old (classical) concept,time cannot be well-defined at this scale.The uncertainity of time leads to a breakdown of Special and General relativity in the Planck regime;the same happens with causality. We present arguments that the classical concept of time,which treats t simply as a real parameter,must be replaced by a new one.
In this paper we deal with the macroscopic electromagnetic response of a finite size dispersive dielectric object, in unbounded space, in the framework of quantum electrodynamics using the Heisenberg picture. We apply a Hopfield type scheme to account for the dispersion and dissipation of the matter. We provide a general expression of the polarization density field operator as functions of the initial conditions of the matter field operators and of the electromagnetic field operators. It is a linear functional whose kernel is a linear expression of the impulse response of the dielectric object that we obtain within the framework of classical electrodynamics. The electric field operator is expressed as a function of the polarization density field operator by means of the dyadic Green's function for the free space. The statistical functions of these operators are classical functionals of the statistics of the initial conditions of the matter field operators and of the electromagnetic field operators, whose kernels are linear or multilinear expressions of the impulse response of the dielectric object. We keep the polarization and the electromagnetic field distinct to enable the treatm
Neural operators approximate mappings between function spaces, but often generalize poorly to other operators and usually require fine-tuning or retraining. In-Context Operator Networks (ICON) addresses this issue by prompting the model with numerical context so that the model learns specific operators from prompts and adapt to different operators without fine-tuning. However, ICON may still fail to generalize to out-of-distribution (OOD) operator tasks. Inpired by the success of harness engineering of Large Language models (LLMs), we introduce Chain of Operators (CHOP), a framework that harness a frozen ICON to OOD operator tasks without updating its parameters. Specifically, CHOP constructs a chain of operators consisting of explicit elementary transformations and the frozen ICON. Experiments on a scalar conservation law and a mean-field control problem show that CHOP reduces relative inference error over direct ICON evaluation, while each operator in the chain remains interpretable and in closed form. A chain constructed on one PDE family further generalizes to a different family, indicating shared mechanisms across harness systems.
In view of future applications of plasma-based particle accelerators, within the fields of high-energy physics and new light sources, the capability of plasma sources to operate at high repetition rates is crucial. In particular for gas-filled plasma discharge capillaries, which allow direct control over plasma properties, a key aspect is the longevity of the material, subject to erosion due to the heat flux delivered by high voltage plasma discharges. In this regard, we present an innovative design of discharge capillaries based on the use of different ceramic materials, which can sustain high voltage plasma discharges at high repetition rate and, moreover, be easily machined for the complex geometries required for plasma-based accelerators. Experimental campaigns are carried out at 10-150 Hz, assessing the longevity of ceramic capillaries by means of different diagnostic techniques. In addition, numerical simulations are performed to analyze the heat transfer within the whole plasma source. Results from experimental and numerical analysis highlight the capability of ceramic capillaries to preserve plasma properties and the integrity of the source during long-term plasma discharge
In this paper we attempt to lay the foundations for a theory encompassing some natural extensions of the class of subnormal operators, namely the $n$--subnormal operators and the sub-$n$--normal operators. We discuss inclusion relations among the above mentioned classes and other related classes, e.g., $n$--quasinormal and quasi-$n$--normal operators. We show that sub-$n$--normality is stronger than $n$--subnormality, and produce a concrete example of a $3$--subnormal operator which is not sub-$2$--normal. In \cite{CU1}, R.E. Curto, S.H. Lee and J. Yoon proved that if an operator $T$ is subnormal, left-invertible, and such that $T^n$ is quasinormal for some $n \le 2$, then $T$ is quasinormal. in \cite{JS}, P.Pietrzycki and J. Stochel improved this result by removing the assumption of left invertibility. In this paper we consider suitable analogs of this result for the case of operators in the above-mentioned classes. In particular, we prove that the weight sequence of an $n$--quasinormal unilateral weighted shift must be periodic with period at most $n$.
This article introduces operator on operator regression in quantum probability. Here in the regression model, the response and the independent variables are certain operator valued observables, and they are linearly associated with unknown scalar coefficient (denoted by $β$), and the error is a random operator. In the course of this study, we propose a quantum version of a class of estimators (denoted by $M$ estimator) of $β$, and the large sample behaviour of those quantum version of the estimators are derived, given the fact that the true model is also linear and the samples are observed eigenvalue pairs of the operator valued observables.
M. Lin defined a binary operation for two positive semi-definite matrices in studying certain determinantal inequalities that arise from diffusion tensor imaging. This operation enjoys some interesting properties similar to the operator geometric mean. We study this operation further and present numerous properties emphasizing the relationship with the operator geometric mean. In the end, we present an application toward Tsallis relative operator entropy.