Analyzing Wav2Vec 1.0 Embeddings for Cross-Database Parkinson's Disease Detection and Speech Features Extraction.

Sensors (Basel)

Department of Biomedical Informatics, Faculty of Biomedical Engineering, Czech Technical University in Prague, 16000 Prague, Czech Republic.

Published: August 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Advancements in deep learning speech representations have facilitated the effective use of extensive unlabeled speech datasets for Parkinson's disease (PD) modeling with minimal annotated data. This study employs the non-fine-tuned wav2vec 1.0 architecture to develop machine learning models for PD speech diagnosis tasks, such as cross-database classification and regression to predict demographic and articulation characteristics. The primary aim is to analyze overlapping components within the embeddings on both classification and regression tasks, investigating whether latent speech representations in PD are shared across models, particularly for related tasks. Firstly, evaluation using three multi-language PD datasets showed that wav2vec accurately detected PD based on speech, outperforming feature extraction using mel-frequency cepstral coefficients in the proposed cross-database classification scenarios. In cross-database scenarios using Italian and English-read texts, wav2vec demonstrated performance comparable to intra-dataset evaluations. We also compared our cross-database findings against those of other related studies. Secondly, wav2vec proved effective in regression, modeling various quantitative speech characteristics related to articulation and aging. Ultimately, subsequent analysis of important features examined the presence of significant overlaps between classification and regression models. The feature importance experiments discovered shared features across trained models, with increased sharing for related tasks, further suggesting that wav2vec contributes to improved generalizability. The study proposes wav2vec embeddings as a next promising step toward a speech-based universal model to assist in the evaluation of PD.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11398018PMC
http://dx.doi.org/10.3390/s24175520DOI Listing

Publication Analysis

Top Keywords

classification regression
12
wav2vec embeddings
8
parkinson's disease
8
speech representations
8
cross-database classification
8
speech
7
wav2vec
6
cross-database
5
analyzing wav2vec
4
embeddings cross-database
4

Similar Publications

An Investigation of Hyperostosis Frontalis Interna in a Modern Anatomical Body Donor Population.

Clin Anat

September 2025

Department of Communication Disorders and Sciences, Rush University Medical Center, Chicago, Illinois, USA.

This research sought to examine the prevalence and severity of hyperostosis frontalis interna (HFI) in the Chicagoland anatomical body donor population. The study further aimed to elucidate potential demographic risk factors for HFI, including sex, age at death, and structural vulnerability index (SVI), as well as any common comorbidities, as gleaned from death certificates. HFI is an irregular bony overgrowth of the endocranial surface of the frontal bone.

View Article and Find Full Text PDF

Background: Poststroke cognitive impairment (PSCI) affects 30% to 50% of stroke survivors, severely impacting functional outcomes and quality of life. This study uses functional near-infrared spectroscopy (fNIRS) to assess task-evoked brain activation and its potential for stratifying the severity in patients with PSCI.

Method: A cross-sectional study was conducted at Nanchong Central Hospital between June 2023 and April 2024.

View Article and Find Full Text PDF

Work-related stress among sworn and non-sworn law enforcement personnel.

Int J Police Sci Manag

November 2024

Division of Environmental Health Sciences, School of Public Health, University of Minnesota, USA.

Sworn law enforcement personnel in the United States face high rates of work-related stress. Yet, the well-being of more than 300,000 non-sworn personnel, particularly regarding work-related trauma and stress, remains underexplored. This study aims to test the hypothesis that non-sworn personnel experience lower levels of stress, comparing stress and probable post-traumatic stress disorder (PTSD) between sworn and non-sworn personnel.

View Article and Find Full Text PDF

Accurate differentiation between persistent vegetative state (PVS) and minimally conscious state and estimation of recovery likelihood in patients in PVS are crucial. This study analyzed electroencephalography (EEG) metrics to investigate their relationship with consciousness improvements in patients in PVS and developed a machine learning prediction model. We retrospectively evaluated 19 patients in PVS, categorizing them into two groups: those with improved consciousness ( = 7) and those without improvement ( = 12).

View Article and Find Full Text PDF

Objectives: To develop a novel risk score (RS) model to predict the probability of progression to castration-resistant prostate cancer (PCa) (CRPC) after intensity-modulated radiation therapy (IMRT) for patients with high- and very high-risk PCa according to the National Comprehensive Cancer Network (NCCN) risk classification, since accurate prediction of the clinical outcome of definitive radiation therapy for patients with high- and very high-risk PCa remains challenging due to its heterogeneity.

Materials And Methods: We conducted a retrospective review of 600 patients with high- and very high-risk PCa treated with IMRT at our institution. They were randomly divided into discovery (n = 300) and validation (n = 300) cohorts.

View Article and Find Full Text PDF