98%
921
2 minutes
20
Optical Character Recognition (OCR) systems play a crucial role in converting printed Arabic text into digital formats, enabling various applications such as education and digital archiving. However, the complex characteristics of the Arabic script, including its cursive nature, diacritical marks, handwriting, and ligatures, present significant challenges for accurate character recognition. This study proposes a hybrid transformer encoder-based model for Arabic printed and handwritten character classification. The methodology integrates transfer learning techniques utilizing pre-trained VGG16 and ResNet50 models for feature extraction, followed by a feature ensemble process. The transformer encoder architecture leverages its self-attention mechanism and multilayer perceptron (MLP) components to capture global dependencies and refine feature representations. The training and evaluation were conducted on the Arabic OCR and Arabic Handwritten Character Recognition (AHCR) datasets, achieving exceptional results with an accuracy of 99.51% and 98.19%, respectively. The proposed model is evaluated in an extension ablation study using the Arabic Char 4k OCR dataset for training, while testing on the part of the AHCR dataset to evaluate performance on unseen data. The proposed model significantly outperforms individual CNN-based models and ensemble techniques, demonstrating its robustness and efficiency for Arabic character classification. This research establishes a foundation for improved OCR systems, offering a reliable solution for real-world Arabic text recognition tasks.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12356925 | PMC |
http://dx.doi.org/10.1038/s41598-025-12045-z | DOI Listing |
Dev Cogn Neurosci
August 2025
State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China. Electronic address:
The relationship between brain activity and reading acquisition has been a research focus in recent years. In the current cross-sectional and longitudinal study, we aimed to investigate whether and how resting-state (rs) and task-state brain electrophysiological activity would predict children's reading ability. Here, we tracked 73 primary school children' orthographic awareness, reading ability, and EEG signals during both rest and completed a Chinese character recognition task over two consecutive years.
View Article and Find Full Text PDFMed Phys
August 2025
The University of Texas MD Anderson Cancer Houston, Houston, Texas, USA.
Background: To guarantee high-quality patient scans, thorough quality assurance (QA) of SPECT or gamma cameras, including performance, review, and documentation, is essential.
Purpose: We developed a novel Nuclear Medicine Quality Assurance server (NMQA) with an AI deep learning (AIDL) optical character recognition (OCR) system to automate QA data retrieval and review from SPECT and gamma cameras. The system extracts and compares daily and weekly QA data against specifications.
PLoS One
September 2025
Centre for Educational Neuroscience, Birkbeck University of London, Camden, London, United Kingdom.
Smooth social interactions rely on children's abilities to decode others' social signals, which includes what an individual may say or do, and their facial emotional expressions. Failure can lead to exclusion from social groups. Consequently, a number of social and emotional learning (SEL) training programmes have been developed, with some evidence of positive impacts on those skills themselves and on academic achievement.
View Article and Find Full Text PDFPhytoKeys
August 2025
CSIRO, Centre for Australian National Biodiversity Research (a joint venture of Parks Australia and CSIRO), Clunies Ross Street, Canberra ACT 2601, Australia Centre for Australian National Biodiversity Research Canberra Australia.
Morphological data are critical for taxonomy, evolutionary biology, ecology, and species identification. However, no widely used central database for morphological data exists as it does for DNA sequences or specimen data. Most of these data are "locked up" in taxonomic literature.
View Article and Find Full Text PDFSci Rep
August 2025
College of Information Science and Technology, HaiNan Normal University, Haikou, 571158, China.
The speech recognition task of the HaiNan dialect faces significant differences in phonology, intonation, and grammatical structure among dialects, which in turn show significant regionalization characteristics, which makes the task of dialect-to-Mandarin conversion more complex. Currently, the research on the HaiNan dialect speech recognition is still in its early stages and lacks sufficient corpus resources, especially in the task of multi-dialect recognition. Traditional models are difficult to solve with the problem of data scarcity and diverse dialect characteristics effectively.
View Article and Find Full Text PDF