Aligning syntactic structure to the dynamics of verbal communication: A pipeline for annotating syntactic phrases onto speech acoustics.

Behav Res Methods

Department of Psychology, Johann Wolfgang Goethe-Universität Frankfurt am Main, Theodor-W.-Adorno-Platz 6, Frankfurt am Main, 60623, Germany.

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

To investigate how the human brain encodes the complex dynamics of natural languages, any viable and reproducible analysis pipeline must rely on either manual annotations or natural language processing (NLP) tools, which extract relevant physical (e.g., acoustic, gestural), and structure-building information from speech and language signals. However, annotating syntactic structure for a given natural language is arguably a harder task than annotating the onset and offset of speech units such as phonemes and syllables, as the latter can be identified by relying on the physically overt and temporally measurable properties of the signal, while syntactic units are generally covert and their chunking is model-driven. We describe and validate a pipeline that takes into account both physical and theoretical aspects of speech and language signals, and operates a theory-driven and explicit alignment between overt speech units and covert syntactic units.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12331764	PMC
http://dx.doi.org/10.3758/s13428-025-02747-7	DOI Listing

Publication Analysis

Top Keywords

syntactic structure

annotating syntactic

natural language

speech language

language signals

speech units

syntactic units

speech

aligning syntactic

structure dynamics

Similar Publications

The role of Universal Grammar and crosslinguistic influence in the interpretation of recursive set-subset adjectives in adult Romanian L1-English L2 bilinguals.

Front Hum Neurosci

August 2025

Brain and Cognitive Sciences Program, School of Psychological and Behavioral Sciences, Southern Illinois University Carbondale, Carbondale, IL, United States.

Adina Camelia Bleotu , Deborah Foucault , Tom Roeper , Usha Lakshmanan

Our research contributes to debates about the role of Universal Grammar constraints and crosslinguistic influence in sequential bilingual acquisition and use. We investigate experimentally how adult Romanian L1-English L2 bilinguals interpret sequential adjectival modifiers of a noun in recursive set-subset contexts in both languages (e.g.

View Article and Find Full Text PDF

Similar Publications

Sentence Production in Standard Indonesian Agrammatism.

J Psycholinguist Res

September 2025

Center for Language and Cognition Groningen (CLCG), University of Groningen, Groningen, The Netherlands.

Bernard A J Jap , Roelien Bastiaanse

For individuals with agrammatic aphasia, producing sentences with non-canonical word orders is a challenging feat. Studies on different languages report deficits in this area of sentence production: some citing problems related to retrieval of verb morphology while others pursue a more holistic approach by attributing the root of the deficit towards the process of thematic role assignment. It has been shown that agrammatic speakers of Standard Indonesian are relatively unimpaired in the use (in spontaneous speech) and comprehension of passive constructions.

View Article and Find Full Text PDF

Similar Publications

Syntactic knowledge does support working memory for serial order: A comparison across French and German languages.

Mem Cognit

September 2025

Department of Psychology, Psychology & Cognitive Neuroscience Research Unit, University of Liège, Place Des Orateurs 1 (B33), 4000, Liège, Belgium.

Hong Xiao , Judith Schweppe , Pauline Querella , Lucie Attout , Friederike Contier

A large body of research demonstrates robust interactions between verbal working memory (WM) and phonological and lexico-semantic language knowledge, particularly for item recall. The role of syntactic knowledge, involving knowledge about word positioning in a verbal sequence, has been explored to a lesser extent but is of theoretical interest given that this type of knowledge may also support serial order recall in multi-item sequences. This hypothesis has not been supported so far, either by French or German language studies.

View Article and Find Full Text PDF

Similar Publications

Structural Priming in Aphasia: A State-of-the-Art Review and Future Directions.

Aphasiology

December 2024

Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana, USA.

Jiyeon Lee

Background & Aims: Decades of research on structural priming - speakers' tacit reuse of previously encountered syntactic structures in subsequent production and comprehension of sentences - has made substantial contributions to theories of syntactic representations, processing, and language learning and acquisition. There is growing interest in the application of structural priming to assess and facilitate language processing and learning in clinical populations. Yet, little research has explored structural priming in aphasia.

View Article and Find Full Text PDF

Similar Publications

A complete, multi-layered quranic treebank dataset with hybrid syntactic annotations for classical arabic processing.

Data Brief

October 2025

Department of Language Preparation, Arabic Language Teaching Institute, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia.

Wadee A Nashir , Abdulqader M Mohsen , Asma A Al-Shargabi , Mohamed K Nour , Badriyya B Al-Onazi

This article describes the Extended Quranic Treebank (EQTB), a comprehensive, multi-layered, and computationally accessible linguistic resource for Classical Arabic (CA), meticulously developed to overcome the documented limitations of the original Quranic Treebank. Leveraging foundational data from established Quranic digital resources, EQTB features systematically expanded orthographic representations generated via algorithmic processing and validation; rigorously refined morphological annotations based on expanded expert-informed schemas, automated re-annotation, and manual curation; and critically, a novel, complete syntactic layer constructed through algorithmic conversion of prior graphical data, Deep Learning-based parsing achieving full coverage under a hybrid constituency-dependency framework, and expert validation. Encompassing the entire Quran (∼132,736 tokens), the dataset is structured in an adapted CoNLL-X format across 43 columns, detailing multiple orthographies, fine-grained morphology (45 tags), and complete hybrid syntax (140 tags/labels), complemented by auxiliary lexicons and schemas.

View Article and Find Full Text PDF

Similar Publications