Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Accurate vocal cord leukoplakia classification is critical for the individualized treatment and early detection of laryngeal cancer. Numerous deep learning techniques have been proposed, but it is unclear how to select one to apply in the laryngeal tasks. This article introduces and reliably evaluates existing deep learning models for vocal cord leukoplakia classification.

Methods: We created white light and narrow band imaging (NBI) image datasets of vocal cord leukoplakia which were classified into six classes: normal tissues (NT), inflammatory keratosis (IK), mild dysplasia (MiD), moderate dysplasia (MoD), severe dysplasia (SD), and squamous cell carcinoma (SCC). Vocal cord leukoplakia classification was performed using six classical deep learning models, AlexNet, VGG, Google Inception, ResNet, DenseNet, and Vision Transformer.

Results: GoogLeNet (i.e., Google Inception V1), DenseNet-121, and ResNet-152 perform excellent classification. The highest overall accuracy of white light image classification is 0.9583, while the highest overall accuracy of NBI image classification is 0.9478. These three neural networks all provide very high sensitivity, specificity, and precision values.

Conclusion: GoogLeNet, ResNet, and DenseNet can provide accurate pathological classification of vocal cord leukoplakia. It facilitates early diagnosis, providing judgment on conservative treatment or surgical treatment of different degrees, and reducing the burden on endoscopists.

Download full-text PDF

Source
http://dx.doi.org/10.1002/hed.27543DOI Listing

Publication Analysis

Top Keywords

vocal cord
24
cord leukoplakia
24
deep learning
16
leukoplakia classification
12
learning models
12
white light
12
light narrow
8
narrow band
8
band imaging
8
nbi image
8

Similar Publications

A 48-year-old man with a superior labral tear and medical history including hemidiaphragmatic paresis, obstructive sleep apnea, vocal cord paresis, and glottic narrowing, underwent arthroscopic biceps tenodesis. Reduction in respiratory function presented anesthetic management challenges with general anesthesia or an interscalene brachial plexus block. Instead, ultrasound guidance was used to deliver a selective upper-trunk block with 1 % lidocaine and an axillary nerve block with 0.

View Article and Find Full Text PDF

Utilizing biomaterials for laryngeal respiratory mucosal tissue repair in an animal model.

Biomater Biosyst

September 2025

ENT and Head and Neck Research Center and Department, The Five Senses Health Institute, School of Medicine, Iran University of Medical Sciences, Tehran, Iran.

Introduction: The airway mucosa plays a crucial role in protection and various physiological functions. Current methods for restoring airway mucosa, such as myocutaneous flaps or split skin grafts, create a stratified squamous layer that lacks the cilia and mucus-secreting glands of the native columnar-lined airway. This study examines the application of various injectable biopolymers as active molecules for a potential approach to regenerating laryngeal epithelial tissue.

View Article and Find Full Text PDF

Bi-layered microflap surgery for the treatment of anterior glottic web.

Eur Arch Otorhinolaryngol

September 2025

Department of Otolaryngology Head And Neck Surgery, Far Eastern Memorial Hospital, No. 21, Section 2, Nan-Ya South Road, New Taipei City, Taiwan.

Introduction: Anterior glottic webs are epithelium-covered fibrous tissue formations at the anterior commissure, leading to synechiae between the bilateral vocal folds. They manifest with symptoms ranging from hoarseness to airway obstruction. However, treating anterior glottic webs are challenging due to their high recurrence rates.

View Article and Find Full Text PDF

Vocal tract contribution to vocal intensity: Interaction between vocal fold adduction, formant tuning, and fundamental frequency.

J Acoust Soc Am

September 2025

Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehab Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA.

The goal of this study was to understand the interaction between the voice source spectral shape, formant tuning, and fundamental frequency in determining the vocal tract contribution to vocal intensity. Computational voice simulations were performed with parametric variations in both vocal fold and vocal tract configurations. The vocal tract contribution to vocal intensity was quantified as the difference in the A-weighted sound pressure level between the radiated sound pressure and the sound pressure at the glottis.

View Article and Find Full Text PDF

Background: The benefits of intraoperative nerve monitoring for identifying recurrent laryngeal nerves during esophageal cancer surgery have recently been reported. However, no standardized procedures have been established for the use of this system. This study aimed to identify factors affecting the diagnostic accuracy of intraoperative nerve monitoring for recurrent laryngeal nerve palsy and explore approaches to improve the precision and efficiency of intraoperative nerve monitoring in esophageal cancer surgery.

View Article and Find Full Text PDF