Cracking the neural code for word recognition in convolutional neural networks.

PLoS Comput Biol

Cognitive Neuroimaging Unit, CEA, INSERM U 992, Université Paris-Saclay, NeuroSpin center, Gif/Yvette, France.

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of "space bigram" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11410253PMC
http://dx.doi.org/10.1371/journal.pcbi.1012430DOI Listing

Publication Analysis

Top Keywords

word recognition
12
neural code
8
ordinal position
8
word
5
cracking neural
4
code word
4
recognition
4
recognition convolutional
4
neural
4
convolutional neural
4

Similar Publications

Computer vision has been identified as one of the solutions to bridge communication barriers between speech-impaired populations and those without impairment as most people are unaware of the sign language used by speech-impaired individuals. Numerous studies have been conducted to address this challenge. However, recognizing word signs, which are usually dynamic and involve more than one frame per sign, remains a challenge.

View Article and Find Full Text PDF

A distinctive feature of the lexicon is its susceptibility to the order in which words are acquired; those learned earlier are accessed and retrieved more quickly than those acquired later-a phenomenon known as the age of acquisition (AoA) effect. This study investigates how vocabulary size (i.e.

View Article and Find Full Text PDF

The word-length effect refers to the finding that memory on many short-term/working memory tasks is better for words with fewer syllables than words with more syllables. The standard account attributes this result to a combination of decay offset by rehearsal: More short words can be rehearsed because they take less time to articulate. However, most studies have confounded length with lexical and other long-term memory factors that covary with length.

View Article and Find Full Text PDF

Mentalizing skills-the capacity to attribute mental states-play critical roles in word learning during typical language development. In autism, mentalizing difficulties may constrain word-learning pathways, limiting language-acquisition opportunities. We ask how autistic children encode and retrieve novel words and what drives individual differences.

View Article and Find Full Text PDF

Background: We sought to investigate the association between circulating inflammatory and cardiovascular proteomics biomarkers and cardiac autonomic nervous dysfunction-sensitive heart rate variability indices.

Methods: Using the population-based KORA (Cooperative Health Research in the Region of Augsburg) cohort, 233 proteomics biomarkers were quantified in baseline plasma samples of 1389 individuals using proximity extension assay technology. Five heart rate variability indices (Rényi entropy of the histogram with order [α] 4, total power of the density spectra, SD of word sequence, SD of the short-term normal-to-normal interval variability, compression entropy) were assessed at baseline in 982 individuals and in 407 individuals at baseline and at 14-year follow-up.

View Article and Find Full Text PDF