98%
921
2 minutes
20
To investigate how the human brain encodes the complex dynamics of natural languages, any viable and reproducible analysis pipeline must rely on either manual annotations or natural language processing (NLP) tools, which extract relevant physical (e.g., acoustic, gestural), and structure-building information from speech and language signals. However, annotating syntactic structure for a given natural language is arguably a harder task than annotating the onset and offset of speech units such as phonemes and syllables, as the latter can be identified by relying on the physically overt and temporally measurable properties of the signal, while syntactic units are generally covert and their chunking is model-driven. We describe and validate a pipeline that takes into account both physical and theoretical aspects of speech and language signals, and operates a theory-driven and explicit alignment between overt speech units and covert syntactic units.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12331764 | PMC |
http://dx.doi.org/10.3758/s13428-025-02747-7 | DOI Listing |
Front Hum Neurosci
August 2025
Brain and Cognitive Sciences Program, School of Psychological and Behavioral Sciences, Southern Illinois University Carbondale, Carbondale, IL, United States.
Our research contributes to debates about the role of Universal Grammar constraints and crosslinguistic influence in sequential bilingual acquisition and use. We investigate experimentally how adult Romanian L1-English L2 bilinguals interpret sequential adjectival modifiers of a noun in recursive set-subset contexts in both languages (e.g.
View Article and Find Full Text PDFJ Psycholinguist Res
September 2025
Center for Language and Cognition Groningen (CLCG), University of Groningen, Groningen, The Netherlands.
For individuals with agrammatic aphasia, producing sentences with non-canonical word orders is a challenging feat. Studies on different languages report deficits in this area of sentence production: some citing problems related to retrieval of verb morphology while others pursue a more holistic approach by attributing the root of the deficit towards the process of thematic role assignment. It has been shown that agrammatic speakers of Standard Indonesian are relatively unimpaired in the use (in spontaneous speech) and comprehension of passive constructions.
View Article and Find Full Text PDFMem Cognit
September 2025
Department of Psychology, Psychology & Cognitive Neuroscience Research Unit, University of Liège, Place Des Orateurs 1 (B33), 4000, Liège, Belgium.
A large body of research demonstrates robust interactions between verbal working memory (WM) and phonological and lexico-semantic language knowledge, particularly for item recall. The role of syntactic knowledge, involving knowledge about word positioning in a verbal sequence, has been explored to a lesser extent but is of theoretical interest given that this type of knowledge may also support serial order recall in multi-item sequences. This hypothesis has not been supported so far, either by French or German language studies.
View Article and Find Full Text PDFAphasiology
December 2024
Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana, USA.
Background & Aims: Decades of research on structural priming - speakers' tacit reuse of previously encountered syntactic structures in subsequent production and comprehension of sentences - has made substantial contributions to theories of syntactic representations, processing, and language learning and acquisition. There is growing interest in the application of structural priming to assess and facilitate language processing and learning in clinical populations. Yet, little research has explored structural priming in aphasia.
View Article and Find Full Text PDFData Brief
October 2025
Department of Language Preparation, Arabic Language Teaching Institute, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia.
This article describes the Extended Quranic Treebank (EQTB), a comprehensive, multi-layered, and computationally accessible linguistic resource for Classical Arabic (CA), meticulously developed to overcome the documented limitations of the original Quranic Treebank. Leveraging foundational data from established Quranic digital resources, EQTB features systematically expanded orthographic representations generated via algorithmic processing and validation; rigorously refined morphological annotations based on expanded expert-informed schemas, automated re-annotation, and manual curation; and critically, a novel, complete syntactic layer constructed through algorithmic conversion of prior graphical data, Deep Learning-based parsing achieving full coverage under a hybrid constituency-dependency framework, and expert validation. Encompassing the entire Quran (∼132,736 tokens), the dataset is structured in an adapted CoNLL-X format across 43 columns, detailing multiple orthographies, fine-grained morphology (45 tags), and complete hybrid syntax (140 tags/labels), complemented by auxiliary lexicons and schemas.
View Article and Find Full Text PDF