Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Published: September 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments. This work uses an EEG dataset recorded from a 64 channel device during the imagination of long-words, short words, and vowels with 15 human subjects. First, the raw EEG data is filtered between 1 Hz and 100 Hz, then segmented using a sliding window-based data augmentation technique with a window size of 100 and 50% overlap. The Fourier Transform is applied to each windowed segment to compute the amplitude and phase spectrum of the signal at each frequency point. The next step is to extract 50 statistical features from the amplitude and phase spectrum of frequency domain segments. Out of these, the 25 most statistically significant features are selected by applying the Kruskal-Wallis test.The extracted feature vectors are classified using six different machine learning based classifiers named support vector machine (SVM), K nearest neighbor (KNN), Random Forest (RF), XGBoost, LightGBM, and CatBoost. The CatBoost classifier outperforms other machine learning classifiers by achieving the highest accuracy of 91.72±1.52% for long words classification, 91.68±1.54% for long vs short word classification, 88.05±3.07% for short word classification, and 88.89±1.97% for vowel classification. The performance of the proposed model is assessed using five performance evaluation metrics: accuracy, F1-score, precision, recall, and Cohen's kappa. Compared to the existing literature, this study has achieved a 5%-7 % improvement with the CatBoost classifier and extracted feature matrix. .

Download full-text PDF	Source
http://dx.doi.org/10.1088/2057-1976/ae04ee	DOI Listing

Publication Analysis

Top Keywords

machine learning

amplitude phase

phase spectrum

learning based

spectrum frequency

frequency domain

based classification

imagined speech

domain eeg

eeg data

Similar Publications

Two step approach for detecting and segmenting the second mesiobuccal canal of maxillary first molars on cone beam computed tomography (CBCT) images via artificial intelligence.

BMC Oral Health

September 2025

Oral and Maxillofacial Radiology Department, Cairo university, Cairo, Egypt.

Sally Mansour , Enas Anter , Ali Khater Mohamed , Mushira M Dahaba , Arwa Mousa

Aim: The purpose of this study was to assess the accuracy of a customized deep learning model based on CNN and U-Net for detecting and segmenting the second mesiobuccal canal (MB2) of maxillary first molar teeth on cone beam computed tomography (CBCT) scans.

Methodology: CBCT scans of 37 patients were imported into 3D slicer software to crop and segment the canals of the mesiobuccal (MB) root of the maxillary first molar. The annotated data were divided into two groups: 80% for training and validation and 20% for testing.

View Article and Find Full Text PDF

Similar Publications

Correction: An explainable machine learning model for early warning of hypertensive and hypotensive anomalies in maintenance hemodialysis patients.

BMC Nephrol

September 2025

School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China.

Zhuoyu Li , Siying Hao , Shujun Shi , Lin Li , Ziwei Tao

View Article and Find Full Text PDF

Similar Publications

A systematic review of EEG-based machine learning classifications for obsessive-compulsive disorder: current status and future directions.

BMC Psychiatry

September 2025

Department of Cognitive Neuroscience, Faculty of Biology, Bielefeld University, Bielefeld, Germany.

Mahdi Naderi , Amir Jahanian-Najafabadi

Obsessive-compulsive disorder (OCD) is a chronic and disabling condition affecting approximately 3.5% of the global population, with diagnosis on average delayed by 7.1 years or often confounded with other psychiatric disorders.

View Article and Find Full Text PDF

Similar Publications

Early prediction of orthodontic gingival enlargement using S100A4: a biomarker-based risk stratification model.

Odontology

September 2025

Department of Periodontics, Saveetha Dental College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India.

Víctor Simancas-Escorcia , Jaime Plazas-Román , Antonio Díaz-Caballero , Adel Martínez-Martínez , Carlos M Ardila

Orthodontic-induced gingival enlargement (OIGE) affects approximately 15-30% of patients undergoing orthodontic treatment and remains largely unpredictable, often relying on subjective clinical assessments made after irreversible tissue changes have occurred. S100A4 is a well-characterized marker of activated fibroblasts involved in pathological tissue remodeling. This was a cross-sectional precision biomarker study that analyzed gingival tissue samples from three groups: healthy controls (n = 60), orthodontic patients without gingival enlargement (n = 31), and patients with clinically diagnosed OIGE (n = 61).

View Article and Find Full Text PDF

Similar Publications

The imitation game: large language models versus multidisciplinary tumor boards: benchmarking AI against 21 sarcoma centers from the ring trial.

J Cancer Res Clin Oncol

September 2025

Department of Surgery, Mannheim School of Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.

Cheng-Peng Li , Aimé Terence Kalisa , Siyer Roohani , Kamal Hummedah , Franka Menge

Purpose: The study aims to compare the treatment recommendations generated by four leading large language models (LLMs) with those from 21 sarcoma centers' multidisciplinary tumor boards (MTBs) of the sarcoma ring trial in managing complex soft tissue sarcoma (STS) cases.

Methods: We simulated STS-MTBs using four LLMs-Llama 3.2-vison: 90b, Claude 3.

View Article and Find Full Text PDF

Similar Publications