Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Many odontocetes produce whistles that feature characteristic contour shapes in spectrogram representations of their calls. Automatically extracting the time × frequency tracks of whistle contours has numerous subsequent applications, including species classification, identification, and density estimation. Deep-learning-based methods, which train models using analyst-annotated whistles, offer a promising way to reliably extract whistle contours. However, the application of such methods can be limited by the significant amount of time and labor required for analyst annotation. To overcome this challenge, a technique that learns from automatically generated pseudo-labels has been developed. These annotations are less accurate than those generated by human analysts but more cost-effective to generate. It is shown that standard training methods do not learn effective models from these pseudo-labels. An improved loss function designed to compensate for pseudo-label error that significantly increases whistle extraction performance is introduced. The experiments show that the developed technique performs well when trained with pseudo-labels generated by two different algorithms. Models trained with the generated pseudo-labels can extract whistles with an F1-score (the harmonic mean of precision and recall) of 86.31% and 87.2% for the two sets of pseudo-labels that are considered. This performance is competitive with a model trained with 12 539 expert-annotated whistles (F1-score of 87.47%).

Download full-text PDF

Source
http://dx.doi.org/10.1121/10.0020274DOI Listing

Publication Analysis

Top Keywords

whistle contours
12
time frequency
8
generated pseudo-labels
8
whistles f1-score
8
pseudo-labels
5
deep learning
4
learning track
4
track time
4
whistle
4
frequency whistle
4

Similar Publications

Understanding the vocal repertoire of delphinid species is essential for effective long-term studies. In this research, we present the first quantitative analysis of whistle and burst-pulse sound characteristics for the critically endangered Burrunan dolphin (). Acoustic data were collected from the two known resident populations in Port Phillip Bay (PPB) and the Gippsland Lakes (GL), Victoria, Australia, between 2016 and 2023.

View Article and Find Full Text PDF

Infant-directed singing (IDSi) is a natural means of dyadic communication that contributes to children's mental health by enhancing emotion expression, close relationships, exploration and learning. Therefore, it is important to learn about factors that impact the IDSi. This study modeled the mother- (mental health), infant- (emotional responses and health status) and environment (war trauma)-related factors influencing acoustic IDSi features, such as pitch (F0) variability, amplitude and vibration and the F0 contour of shapes and movements.

View Article and Find Full Text PDF

The lip flip: a systematic review of botulinum toxin lip augmentation.

Arch Dermatol Res

May 2025

Mohs Micrographic & Reconstructive Surgery, Epiphany Dermatology, Dallas, TX, USA.

The cosmetic use of botulinum toxin type-A (BTA) has expanded to include the "lip flip" procedure, which enhances lip contour and fullness without adding volume. This minimally invasive and cost-effective technique offers a favorable safety profile for patients seeking subtle lip enhancement. This review systematically examines the technique, clinical outcomes, and safety of the lip flip through neuromodulation with BTA.

View Article and Find Full Text PDF

Singing is a universal human attribute. Previous studies suggest that the ability to produce words through singing can be preserved in poststroke aphasia (PSA) and that this is mainly subserved by the spared parts of the left-lateralized language network. However, it remains unclear to what extent the production of rhythmic-melodic acoustic patterns in singing remains preserved in aphasia and which neural networks and hemisphere(s) are involved in this.

View Article and Find Full Text PDF

Bottlenose dolphins produce individually distinctive signature whistles (SWs) to broadcast identity. Stability of the SW frequency contour is essential to preserve identity information; however, SWs could carry additional information which requires variability. We used acoustic recordings from provisioned free-swimming Indo-Pacific bottlenose dolphins at Moreton Island (Mulgumpin), Australia, collected in 2002 and 2017-2018, to assess the long-term (15-year) stability in SWs, and if variability of acoustic parameters increases with whistle complexity.

View Article and Find Full Text PDF