An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.

Sensors (Basel)

Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University, São José do Rio Preto 15054-000, SP, Brazil.

Published: May 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256083PMC
http://dx.doi.org/10.3390/s23115196DOI Listing

Publication Analysis

Top Keywords

dysphonia detection
8
voice signal
8
voice
7
experimental analysis
4
analysis multicepstral
4
multicepstral projection
4
projection representation
4
representation strategies
4
dysphonia
4
strategies dysphonia
4

Similar Publications

Unraveling Sulcus Vergeture (Type II): Demographic and Clinical Characteristics.

J Voice

August 2025

Marmara University Faculty of Medicine, Pendik Training and Research Hospital, Department of Otorhinolaryngology, İstanbul, Turkey.

Objective: Multiple studies have addressed pathological sulcus (types II and III) collectively; however, it is important to note that type II and III represent distinct entities rather than varying degrees of the same disease. This study aims to focus on the frequency, demographic features, clinical presentation, and typical examination results of patients with type II sulcus (sulcus vergeture).

Methods: An analysis was performed on the data of 3023 patients who were admitted to the voice and swallowing outpatient clinic due to dysphonia.

View Article and Find Full Text PDF

Voice disorders represent a common medical condition affecting up to 16.9% of the general population, with unilateral vocal fold paralysis (UVFP) being particularly severe. This condition causes breathy dysphonia, maladaptive articulatory behaviours, and cortical alterations in sensory processing.

View Article and Find Full Text PDF

Benign and malignant vocal fold lesions can alter voice quality and lead to significant morbidity or, in the case of malignancy, mortality. Early, noninvasive identification of these lesions using voice as a biomarker may improve diagnostic access and outcomes. In this study, we analyzed data from the initial release of the Bridge2AI-Voice dataset to evaluate which acoustic features best distinguish laryngeal cancer and benign vocal fold lesions from other vocal pathologies and healthy voice function.

View Article and Find Full Text PDF

[Comment to the article "Clinical-pathological characterization of laryngeal cancer in the Mexican population"].

Rev Med Inst Mex Seguro Soc

July 2025

Instituto Mexicano del Seguro Social, Centro Médico Nacional La Raza, Hospital General "Dr. Gaudencio González Garza", Servicio de Otorrinolaringología. Ciudad de México, México.

I read with interest the article by Castillo López et al. on the characterization of patients with laryngeal cancer. Although they highlight risk factors such as smoking, advanced age, and prolonged dysphonia, I believe it is important to include other risks, such as HPV16 infection and alcohol consumption, as reported in recent studies.

View Article and Find Full Text PDF

Introduction: Juvenile dermatomyositis (JDM) is a rare immune-mediated disease, characterised by proximal muscle weakness and typical skin rashes1. We present a patient with severe JDM, to highlight the importance of a timely diagnosis and early initiation of treatment.

Case Description: A 9-year-old girl presented to the hospital due to asthenia, rash, generalized oedema, and inability to walk.

View Article and Find Full Text PDF