The development of deep convolutional generative adversarial network to synthesize odontocetes' clicks.

J Acoust Soc Am

Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361005, China.

Published: January 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Odontocetes are capable of dynamically changing their echolocation clicks to efficiently detect targets, and learning their clicking strategy can facilitate the design of man-made detecting signals. In this study, we developed deep convolutional generative adversarial networks guided by an acoustic feature vector (AF-DCGANs) to synthesize narrowband clicks of the finless porpoise (Neophocaena phocaenoides sunameri) and broadband clicks of the bottlenose dolphins (Tursiops truncatus). The average short-time objective intelligibility (STOI), spectral correlation coefficient (Spe-CORR), waveform correlation coefficient (Wave-CORR), and dynamic time warping distance (DTW-Distance) of the synthetic clicks were 0.975, 0.968, 0.877, and 0.992, respectively. AF-DCGAN outperformed the minimum phase signal reconstruction (MPSR) method and variational quantized variational autoencoders (VQ-VAE) by 5.9% and 3.7% in STOI, 5.2% and 3.5% in Spe-CORR, and 5.8% and 2.8% in Wave-CORR, respectively. In addition, AF-DCGAN reduced DTW-Distances by 29.9% and 9.4% compared to MPSR and VQ-VAE, respectively. Results showed that AF-DCGAN was robust in synthesizing both narrowband and broadband clicks that can produce a substantial number of high-fidelity odontocetes' clicks with flexibility in modulating parameters. Employing AF-DCGAN to synthesize odontocete-like clicks could advance the development of a click database, offering promising applications in the research of biomimetic target detection and recognition.

Download full-text PDF

Source
http://dx.doi.org/10.1121/10.0034865DOI Listing

Publication Analysis

Top Keywords

deep convolutional
8
convolutional generative
8
generative adversarial
8
clicks
8
odontocetes' clicks
8
broadband clicks
8
correlation coefficient
8
development deep
4
adversarial network
4
network synthesize
4

Similar Publications

This study explores how differences in colors presented separately to each eye (binocular color differences) can be identified through EEG signals, a method of recording electrical activity from the brain. Four distinct levels of green-red color differences, defined in the CIELAB color space with constant luminance and chroma, are investigated in this study. Analysis of Event-Related Potentials (ERPs) revealed a significant decrease in the amplitude of the P300 component as binocular color differences increased, suggesting a measurable brain response to these differences.

View Article and Find Full Text PDF

Neuroimaging Data Informed Mood and Psychosis Diagnosis Using an Ensemble Deep Multimodal Framework.

Hum Brain Mapp

September 2025

Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, and Emory University, Atlanta, Georgia, USA.

Investigating neuroimaging data to identify brain-based markers of mental illnesses has gained significant attention. Nevertheless, these endeavors encounter challenges arising from a reliance on symptoms and self-report assessments in making an initial diagnosis. The absence of biological data to delineate nosological categories hinders the provision of additional neurobiological insights into these disorders.

View Article and Find Full Text PDF

Hybrid two-stage CNN for detection and staging of periodontitis on panoramic radiographs.

J Oral Biol Craniofac Res

August 2025

Neura Integrasi Solusi, Jl. Kebun Raya No. 73, Rejowinangun, Kotagede, Yogyakarta, 55171, Indonesia.

Background: Periodontal disease is an inflammatory condition causing chronic damage to the tooth-supporting connective tissues, leading to tooth loss in adults. Diagnosing periodontitis requires clinical and radiographic examinations, with panoramic radiographs crucial in identifying and assessing its severity and staging. Convolutional Neural Networks (CNNs), a deep learning method for visual data analysis, and Dense Convolutional Networks (DenseNet), which utilize direct feed-forward connections between layers, enable high-performance computer vision tasks with reduced computational demands.

View Article and Find Full Text PDF

DeepRNAac4C: a hybrid deep learning framework for RNA N4-acetylcytidine site prediction.

Front Genet

August 2025

Hunan Provincial Key Laboratory of Finance and Economics Big Data Science and Technology, Hunan University of Finance and Economics, Changsha, China.

RNA N4-acetylcytidine (ac4C) is a crucial chemical modification involved in various biological processes, influencing RNA properties and functions. Accurate prediction of RNA ac4C sites is essential for understanding the roles of RNA molecules in gene expression and cellular regulation. While existing methods have made progress in ac4C site prediction, they still struggle with limited accuracy and generalization.

View Article and Find Full Text PDF

Diffuse large B-cell lymphoma is the most common type of non-Hodgkin lymphoma (NHL) in humans, accounting for about 30-40% of NHL cases worldwide. Canine diffuse large B-cell lymphoma (cDLBCL) is the most common lymphoma subtype in dogs and demonstrates an aggressive biologic behaviour. For tissue biopsies, current confirmatory diagnostic approaches for enlarged lymph nodes rely on expert histopathological assessment, which is time-consuming and requires specialist expertise.

View Article and Find Full Text PDF