Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Deep Learning has a large impact on medical image analysis and lately has been adopted for clinical use at the point of care. However, there is only a small number of reports of long-term studies that show the performance of deep neural networks (DNNs) in such an environment. In this study, we measured the long-term performance of a clinically optimized DNN for laryngeal glottis segmentation. We have collected the video footage for two years from an AI-powered laryngeal high-speed videoendoscopy imaging system and found that the footage image quality is stable across time. Next, we determined the DNN segmentation performance on lossy and lossless compressed data revealing that only 9% of recordings contain segmentation artifacts. We found that lossy and lossless compression is on par for glottis segmentation, however, lossless compression provides significantly superior image quality. Lastly, we employed continual learning strategies to continuously incorporate new data into the DNN to remove the aforementioned segmentation artifacts. With modest manual intervention, we were able to largely alleviate these segmentation artifacts by up to 81%. We believe that our suggested deep learning-enhanced laryngeal imaging platform consistently provides clinically sound results, and together with our proposed continual learning scheme will have a long-lasting impact on the future of laryngeal imaging.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491538PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266989PLOS

Publication Analysis

Top Keywords

glottis segmentation
12
segmentation artifacts
12
long-term performance
8
point care
8
image quality
8
lossy lossless
8
lossless compression
8
continual learning
8
laryngeal imaging
8
segmentation
7

Similar Publications

Objectives: MRI based vocal tract models have many applications in voice research and education. These models do not adequately capture bony structures (e.g.

View Article and Find Full Text PDF

Introduction: Laryngeal high-speed video (HSV) is a widely used technique for diagnosing laryngeal diseases. Among various analytical approaches, segmentation of glottis regions has proven effective in evaluating vocal fold vibration patterns and detecting related disorders. However, the specific task of vocal fold segmentation remains underexplored in the literature.

View Article and Find Full Text PDF

Objectives: Tracheal intubation is a routine procedure in clinical surgeries and emergency situations, essential for maintaining respiration and ensuring airway patency. Due to the complexity of laryngeal structures and the need for rapid airway management in critically ill patients, real-time, accurate identification of key laryngeal structures is crucial for successful intubation. This study presents a real-time laryngeal structure recognition method based on an improved YOLOv8-seg model.

View Article and Find Full Text PDF

Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds' (VFs') spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe the vibrational behavior of vocal folds (VFs) based exclusively on the time-varying glottal opening. However, focusing solely on the glottal area overlooks the full extent and dynamic behavior of the VF tissue, factors that are crucial for the voice production process.

View Article and Find Full Text PDF

Purpose: The head-elevated position during videolaryngoscopic intubation enables better visualization of the glottis than the head-flat position. We hypothesized that the head-elevated position would result in less cervical spine motion during videolaryngoscopic intubation under manual in-line stabilization.

Methods: We conducted a randomized controlled trial in which we assigned patients undergoing coil embolization for unruptured cerebral aneurysms into the head-elevated (N = 55) or head-flat (N = 54) groups.

View Article and Find Full Text PDF