Cracking the neural code for word recognition in convolutional neural networks.

Aakash Agrawal , Stanislas Dehaene

PLoS Comput Biol

Cognitive Neuroimaging Unit, CEA, INSERM U 992, Université Paris-Saclay, NeuroSpin center, Gif/Yvette, France.

Published: September 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of "space bigram" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11410253	PMC
http://dx.doi.org/10.1371/journal.pcbi.1012430	DOI Listing

Publication Analysis

Top Keywords

word recognition

neural code

ordinal position

word

cracking neural

code word

recognition

recognition convolutional

neural

convolutional neural

Similar Publications

Efficient spatio-temporal modeling for sign language recognition using CNN and RNN architectures.

Front Artif Intell

August 2025

School of Computation and Communication Science and Engineering, The Nelson Mandela African Institution of Science and Technology, Arusha, Tanzania.

Kasian Myagila , Devotha Godfrey Nyambo , Mussa Ally Dida

Computer vision has been identified as one of the solutions to bridge communication barriers between speech-impaired populations and those without impairment as most people are unaware of the sign language used by speech-impaired individuals. Numerous studies have been conducted to address this challenge. However, recognizing word signs, which are usually dynamic and involve more than one frame per sign, remains a challenge.

View Article and Find Full Text PDF

Similar Publications

EXPRESS: Influence of word Age-of-Acquisition (AoA), vocabulary size, formal-lexical similarity, and semantic richness of words on lexical recognition and production: A study on foreign-word training.

Q J Exp Psychol (Hove)

September 2025

Psychology Department, Swansea University, Swansea, UK.

Miguel Á Pérez-Sánchez , Lidia Gómez-Cobos , Javier Marín , Hans Stadthagen-Gonzalez , Cristina Izura

A distinctive feature of the lexicon is its susceptibility to the order in which words are acquired; those learned earlier are accessed and retrieved more quickly than those acquired later-a phenomenon known as the age of acquisition (AoA) effect. This study investigates how vocabulary size (i.e.

View Article and Find Full Text PDF

Similar Publications

Word length vs. lexical factors: Re-examining what causes the word-length effect in serial recognition.

Mem Cognit

September 2025

Department of Psychology, Virginia Tech, Blacksburg, VA, 24061, USA.

Dominic Guitard , Ian Neath , Aimée M Surprenant

The word-length effect refers to the finding that memory on many short-term/working memory tasks is better for words with fewer syllables than words with more syllables. The standard account attributes this result to a combination of decay offset by rehearsal: More short words can be rehearsed because they take less time to articulate. However, most studies have confounded length with lexical and other long-term memory factors that covary with length.

View Article and Find Full Text PDF

Similar Publications

From encoding to remembering: pragmatic inferences reveal distinct routes of word learning in autistic children.

Front Hum Neurosci

August 2025

Department of Psychology, Northeastern University, Boston, MA, United States.

Katherine Marie Trice , Zhenghan Qi

Mentalizing skills-the capacity to attribute mental states-play critical roles in word learning during typical language development. In autism, mentalizing difficulties may constrain word-learning pathways, limiting language-acquisition opportunities. We ask how autistic children encode and retrieve novel words and what drives individual differences.

View Article and Find Full Text PDF

Similar Publications

Association of Inflammatory and Cardiovascular Proteomics Biomarkers With Indices of Heart Rate Variability in the General Population: KORA S4/FF4 Study.

J Am Heart Assoc

September 2025

Institute for Clinical Diabetology, German Diabetes Center Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf Düsseldorf Germany.

Kolade Oluwagbemigun , Dan Ziegler , Alexander Strom , Margit Heier , Gidon Bönhof

Background: We sought to investigate the association between circulating inflammatory and cardiovascular proteomics biomarkers and cardiac autonomic nervous dysfunction-sensitive heart rate variability indices.

Methods: Using the population-based KORA (Cooperative Health Research in the Region of Augsburg) cohort, 233 proteomics biomarkers were quantified in baseline plasma samples of 1389 individuals using proximity extension assay technology. Five heart rate variability indices (Rényi entropy of the histogram with order [α] 4, total power of the density spectra, SD of word sequence, SD of the short-term normal-to-normal interval variability, compression entropy) were assessed at baseline in 982 individuals and in 407 individuals at baseline and at 14-year follow-up.

View Article and Find Full Text PDF

Similar Publications