98%
921
2 minutes
20
Deep Learning has a large impact on medical image analysis and lately has been adopted for clinical use at the point of care. However, there is only a small number of reports of long-term studies that show the performance of deep neural networks (DNNs) in such an environment. In this study, we measured the long-term performance of a clinically optimized DNN for laryngeal glottis segmentation. We have collected the video footage for two years from an AI-powered laryngeal high-speed videoendoscopy imaging system and found that the footage image quality is stable across time. Next, we determined the DNN segmentation performance on lossy and lossless compressed data revealing that only 9% of recordings contain segmentation artifacts. We found that lossy and lossless compression is on par for glottis segmentation, however, lossless compression provides significantly superior image quality. Lastly, we employed continual learning strategies to continuously incorporate new data into the DNN to remove the aforementioned segmentation artifacts. With modest manual intervention, we were able to largely alleviate these segmentation artifacts by up to 81%. We believe that our suggested deep learning-enhanced laryngeal imaging platform consistently provides clinically sound results, and together with our proposed continual learning scheme will have a long-lasting impact on the future of laryngeal imaging.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491538 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266989 | PLOS |
J Voice
July 2025
Departments of Radiology, Medicine and Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa.
Objectives: MRI based vocal tract models have many applications in voice research and education. These models do not adequately capture bony structures (e.g.
View Article and Find Full Text PDFFront Mol Biosci
July 2025
School of Computer Science, Wuhan University, Wuhan, China.
Introduction: Laryngeal high-speed video (HSV) is a widely used technique for diagnosing laryngeal diseases. Among various analytical approaches, segmentation of glottis regions has proven effective in evaluating vocal fold vibration patterns and detecting related disorders. However, the specific task of vocal fold segmentation remains underexplored in the literature.
View Article and Find Full Text PDFAm J Transl Res
May 2025
Department of Anesthesiology, The First Hospital of Putian City Putian 351100, Fujian, China.
Objectives: Tracheal intubation is a routine procedure in clinical surgeries and emergency situations, essential for maintaining respiration and ensuring airway patency. Due to the complexity of laryngeal structures and the need for rapid airway management in critically ill patients, real-time, accurate identification of key laryngeal structures is crucial for successful intubation. This study presents a real-time laryngeal structure recognition method based on an improved YOLOv8-seg model.
View Article and Find Full Text PDFSci Rep
May 2025
Department of Computer Science, Trier University of Applied Sciences, Schneidershof, 54293, Trier, Germany.
Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds' (VFs') spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe the vibrational behavior of vocal folds (VFs) based exclusively on the time-varying glottal opening. However, focusing solely on the glottal area overlooks the full extent and dynamic behavior of the VF tissue, factors that are crucial for the voice production process.
View Article and Find Full Text PDFCan J Anaesth
May 2025
Department of Anesthesiology and Pain Medicine, Seoul National University Hospital, Seoul National University College of Medicine, 101, Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.
Purpose: The head-elevated position during videolaryngoscopic intubation enables better visualization of the glottis than the head-flat position. We hypothesized that the head-elevated position would result in less cervical spine motion during videolaryngoscopic intubation under manual in-line stabilization.
Methods: We conducted a randomized controlled trial in which we assigned patients undergoing coil embolization for unruptured cerebral aneurysms into the head-elevated (N = 55) or head-flat (N = 54) groups.