Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery, including: which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery or during live surgery. The Pituitary Vision (PitVis) 2023 Challenge tasks the community to step and instrument recognition in videos of endoscopic pituitary surgery. This is a particularly challenging task when compared to other minimally invasive surgeries due to: the smaller working space, which limits and distorts vision; and higher frequency of instrument and step switching, which requires more precise model predictions. Participants were provided with 25-videos, with results presented at the MICCAI-2023 conference as part of the Endoscopic Vision 2023 Challenge in Vancouver, Canada, on 08-Oct-2023. There were 18-submissions from 9-teams across 6-countries, using a variety of deep learning models. The top performing model for step recognition utilised a transformer based architecture, uniquely using an autoregressive decoder with a positional encoding input. The top performing model for instrument recognition utilised a spatial encoder followed by a temporal encoder, which uniquely used a 2-layer temporal architecture. In both cases, these models outperformed purely spatial based models, illustrating the importance of sequential and temporal information. This PitVis-2023 therefore demonstrates state-of-the-art computer vision models in minimally invasive surgery are transferable to a new dataset. Benchmark results are provided in the paper, and the dataset is publicly available at: https://doi.org/10.5522/04/26531686.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.media.2025.103716DOI Listing

Publication Analysis

Top Keywords

minimally invasive
12
workflow recognition
8
recognition videos
8
videos endoscopic
8
endoscopic pituitary
8
pituitary surgery
8
computer vision
8
invasive surgery
8
2023 challenge
8
instrument recognition
8

Similar Publications

Minimally invasive and standardized thoracoscopic surgery for stage III empyema using a variable-view rigid endoscope.

Gen Thorac Cardiovasc Surg

September 2025

Department of General Thoracic Surgery, Seirei Hamamatsu General Hospital, 2-12-12, Hamamatsu, Shizuoka, 430-8558, Japan.

Thoracoscopic surgery for stage III acute empyema is often limited by poor visualization and anatomical complexity. We developed a standardized, minimally invasive approach using a variable-view rigid endoscope and fixed port placement, regardless of disease extent or patient physique. The variable-view endoscope enabled a wide, adjustable field of view without moving the camera shaft, allowing safe access even in the confined thoracic space.

View Article and Find Full Text PDF

Non-invasive prediction of invasive lung adenocarcinoma and high-risk histopathological characteristics in resectable early-stage adenocarcinoma by [18F]FDG PET/CT radiomics-based machine learning models: a prospective cohort Study.

Int J Surg

September 2025

Department of Respiratory and Critical Care Medicine, Hubei Province Clinical Research Center for Major Respiratory Diseases, Key Laboratory of Pulmonary Diseases of National Health Commission, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

Background: Precise preoperative discrimination of invasive lung adenocarcinoma (IA) from preinvasive lesions (adenocarcinoma in situ [AIS]/minimally invasive adenocarcinoma [MIA]) and prediction of high-risk histopathological features are critical for optimizing resection strategies in early-stage lung adenocarcinoma (LUAD).

Methods: In this multicenter study, 813 LUAD patients (tumors ≤3 cm) formed the training cohort. A total of 1,709 radiomic features were extracted from the PET/CT images.

View Article and Find Full Text PDF

Background: Phrenic nerve injury during mediastinal tumor resection can lead to significant postoperative diaphragmatic dysfunction. Current intraoperative protection techniques are imprecise and lack real-time feedback. We aimed to develop and validate a quantifiable, multimodal neuroprotective strategy.

View Article and Find Full Text PDF

Diabetes mellitus (DM) is a chronic metabolic disorder characterized by persistent hyperglycemia with multiple clinical manifestations and complications, such as cardiovascular disease, kidney dysfunction, retinal impairment, and peripheral neuropathy. Continuous and minimally invasive glucose monitoring is essential for effective DM management. Microneedles (MNs)-based sensing platforms offer a promising solution; however, conventional polymeric MNs suffer from limited electrochemical sensitivity due to their insufficient electroactive surface area and inefficient loading of catalytic and enzymatic components.

View Article and Find Full Text PDF