Long-term performance assessment of fully automatic biomedical glottis segmentation at the point of care.

René Groh , Stephan Dürr , Anne Schützenberger , Marion Semmler , Andreas M Kist

PLoS One

Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Bavaria, Germany.

Published: September 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Deep Learning has a large impact on medical image analysis and lately has been adopted for clinical use at the point of care. However, there is only a small number of reports of long-term studies that show the performance of deep neural networks (DNNs) in such an environment. In this study, we measured the long-term performance of a clinically optimized DNN for laryngeal glottis segmentation. We have collected the video footage for two years from an AI-powered laryngeal high-speed videoendoscopy imaging system and found that the footage image quality is stable across time. Next, we determined the DNN segmentation performance on lossy and lossless compressed data revealing that only 9% of recordings contain segmentation artifacts. We found that lossy and lossless compression is on par for glottis segmentation, however, lossless compression provides significantly superior image quality. Lastly, we employed continual learning strategies to continuously incorporate new data into the DNN to remove the aforementioned segmentation artifacts. With modest manual intervention, we were able to largely alleviate these segmentation artifacts by up to 81%. We believe that our suggested deep learning-enhanced laryngeal imaging platform consistently provides clinically sound results, and together with our proposed continual learning scheme will have a long-lasting impact on the future of laryngeal imaging.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491538	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266989	PLOS

Publication Analysis

Top Keywords

glottis segmentation

segmentation artifacts

long-term performance

point care

image quality

lossy lossless

lossless compression

continual learning

laryngeal imaging

segmentation

Similar Publications

High-Resolution Three-Dimensional Hybrid MRI + Low Dose CT Vocal Tract Modeling: A Cadaveric Pilot Study.

J Voice

July 2025

Departments of Radiology, Medicine and Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa.

David Meyer , Rushdi Zahid Rusho , Wahidul Alam , Gary E Christensen , David M Howard

Objectives: MRI based vocal tract models have many applications in voice research and education. These models do not adequately capture bony structures (e.g.

View Article and Find Full Text PDF

Similar Publications

An automatic laryngoscopic image segmentation system based on SAM prompt engineering: from glottis annotation to vocal fold segmentation.

Front Mol Biosci

July 2025

School of Computer Science, Wuhan University, Wuhan, China.

Yucong Zhang , Yuchen Song , Juan Liu , Ming Li

Introduction: Laryngeal high-speed video (HSV) is a widely used technique for diagnosing laryngeal diseases. Among various analytical approaches, segmentation of glottis regions has proven effective in evaluating vocal fold vibration patterns and detecting related disorders. However, the specific task of vocal fold segmentation remains underexplored in the literature.

View Article and Find Full Text PDF

Similar Publications

Improved YOLOv8-seg for laryngeal structure recognition in medical images.

Am J Transl Res

May 2025

Department of Anesthesiology, The First Hospital of Putian City Putian 351100, Fujian, China.

Haipo Cui , Jinjing Wu , Tianying Li , Zui Zou , Wenhui Guo

Objectives: Tracheal intubation is a routine procedure in clinical surgeries and emergency situations, essential for maintaining respiration and ensuring airway patency. Due to the complexity of laryngeal structures and the need for rapid airway management in critically ill patients, real-time, accurate identification of key laryngeal structures is crucial for successful intubation. This study presents a real-time laryngeal structure recognition method based on an improved YOLOv8-seg model.

View Article and Find Full Text PDF

Similar Publications

The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics.

Sci Rep

May 2025

Department of Computer Science, Trier University of Applied Sciences, Schneidershof, 54293, Trier, Germany.

Mona Kirstin Fehling , Maria Schuster , Maximilian Linxweiler , Jörg Lohscheller

Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds' (VFs') spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe the vibrational behavior of vocal folds (VFs) based exclusively on the time-varying glottal opening. However, focusing solely on the glottal area overlooks the full extent and dynamic behavior of the VF tissue, factors that are crucial for the voice production process.

View Article and Find Full Text PDF

Similar Publications

Effects of the head-elevated position on cervical spine motion during videolaryngoscopic intubation with manual in-line stabilization: a randomized controlled trial.

Can J Anaesth

May 2025

Department of Anesthesiology and Pain Medicine, Seoul National University Hospital, Seoul National University College of Medicine, 101, Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.

Woo-Young Jo , Chan-Ho Hong , Kyung Won Shin , Hyongmin Oh , Hee-Pyoung Park

Purpose: The head-elevated position during videolaryngoscopic intubation enables better visualization of the glottis than the head-flat position. We hypothesized that the head-elevated position would result in less cervical spine motion during videolaryngoscopic intubation under manual in-line stabilization.

Methods: We conducted a randomized controlled trial in which we assigned patients undergoing coil embolization for unruptured cerebral aneurysms into the head-elevated (N = 55) or head-flat (N = 54) groups.

View Article and Find Full Text PDF

Similar Publications