[Speech coding strategy based on amplitude and frequency modulation for cochlear implants].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi

Medical Engineering Support Center of Chinese, PLA General Hospital, Beijing 100853, China.

Published: April 2011


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

To enhance speech recognition in noise, as well as tone recognition, we presented a new kind of speech coding strategy, called one-octave wavelet transform zero-crossing stimulation (WTZS), for cochlear implants based on amplitude and frequency modulation. We selected 15 volunteers with normal hearing ability to carry out hearing simulation experiments by picking up the amplitude (amplitude modulation, AM), zero-crossings (frequency modulation, FM) and gradient parameters from processed speech signal in the domain of one-octave wavelet transform to synthesize the stimulating pulstile series. The experimental results demonstrated that the phonetic recognition in quiet surroundings with amplitude modulation only strategy (CIS) is similar to that of amplitude and frequency modulations strategies (FAME and WTZS), while the tone perception of CIS is inferior to that of FAME and WTZS strategies. However, in noisy environment, the phonetic recognition, tone perception, as well as sentence recognition of WTZS strategy are better than those of CIS and FAME strategies. WTZS strategy, utilizing amplitude (AM), zero-crossings (FM) and gradient parameters to synthesize stimulus, can enhance the phonetic and tonal language recognition in noise environment effectively, and could be used in cochlear implant system for speech processor design after arithmetic optimization.

Download full-text PDF

Source

Publication Analysis

Top Keywords

amplitude frequency
12
frequency modulation
12
coding strategy
8
based amplitude
8
recognition noise
8
one-octave wavelet
8
wavelet transform
8
amplitude modulation
8
gradient parameters
8
phonetic recognition
8

Similar Publications

Understanding gastric physiology in rodents is critical for advancing preclinical neurogastroenterology research. However, existing techniques are often invasive, terminal, or limited in resolution. This study aims to develop a non-invasive, standardized MRI protocol capable of capturing whole-stomach dynamics in anesthetized rats with high spatiotemporal resolution.

View Article and Find Full Text PDF

Layer 6 corticothalamic (L6CT) neurons project to both cortex and thalamus, inducing multiple effects including the modulation of cortical and thalamic firing, and the emergence of high gamma oscillations in the cortical local field potential (LFP). We hypothesize that the high gamma oscillations driven by L6CT neuron activation reflect the dynamic engagement of intracortical and cortico-thalamo-cortical circuits. To test this, we optogenetically activated L6CT neurons in NTSR1-cre mice (both male and female) expressing channelrhodopsin-2 in L6CT neurons.

View Article and Find Full Text PDF

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

September 2025

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments.

View Article and Find Full Text PDF

Experimental study on identifying catastrophic failure in the brittle fracture process via multi-source acoustic characteristics.

Ultrasonics

September 2025

Faculty of Land Resource Engineering, Kunming University of Science and Technology, Yunnan 650093, China; Key Laboratory of Geohazard Forecast and Geoecological Restoration in Plateau Mountainous Area, Ministry of Natural Resources of the People's Republic of China, Yunnan Province, Kunming, Yunnan

Identifying and predicting the catastrophic failure of brittle rock remains a challenging task, yet it is crucial for developing early warning systems and preventing dynamic rock hazards. In this study, we employed the propagative parameters of ultrasonic waves and information from acoustic emission (AE) events to characterize the brittle failure of a flawed sandstone sample under uniaxial compression. A sliding event window method was developed to obtain the temporal b-value, effectively revealing microcrack growth based on the frequency-magnitude distribution of AE events.

View Article and Find Full Text PDF

Vocal tract contribution to vocal intensity: Interaction between vocal fold adduction, formant tuning, and fundamental frequency.

J Acoust Soc Am

September 2025

Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehab Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA.

The goal of this study was to understand the interaction between the voice source spectral shape, formant tuning, and fundamental frequency in determining the vocal tract contribution to vocal intensity. Computational voice simulations were performed with parametric variations in both vocal fold and vocal tract configurations. The vocal tract contribution to vocal intensity was quantified as the difference in the A-weighted sound pressure level between the radiated sound pressure and the sound pressure at the glottis.

View Article and Find Full Text PDF