98%
921
2 minutes
20
When deep convolutional neural networks (CNNs) are trained "end-to-end" on raw data, some of the feature detectors they develop in their early layers resemble the representations found in early visual cortex. This result has been used to draw parallels between deep learning systems and human visual perception. In this study, we show that when CNNs are trained end-to-end they learn to classify images based on whatever feature is predictive of a category within the dataset. This can lead to bizarre results where CNNs learn idiosyncratic features such as high-frequency noise-like masks. In the extreme case, our results demonstrate image categorisation on the basis of a single pixel. Such features are extremely unlikely to play any role in human object recognition, where experiments have repeatedly shown a strong preference for shape. Through a series of empirical studies with standard high-performance CNNs, we show that these networks do not develop a shape-bias merely through regularisation methods or more ecologically plausible training regimes. These results raise doubts over the assumption that simply learning end-to-end in standard CNNs leads to the emergence of similar representations to the human visual system. In the second part of the paper, we show that CNNs are less reliant on these idiosyncratic features when we forgo end-to-end learning and introduce hard-wired Gabor filters designed to mimic early visual processing in V1.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.visres.2020.04.013 | DOI Listing |
Anal Methods
September 2025
College of Science, Kunming University of Science and Technology, Kunming, 650500, China.
To address the technical challenges associated with determining the chronological order of overlapping stamps and textual content in forensic document examination, this study proposes a novel non-destructive method that integrates hyperspectral imaging (HSI) with convolutional neural networks (CNNs). A multi-type cross-sequence dataset was constructed, comprising 60 samples of handwriting-stamp sequences and 20 samples of printed text-stamp sequences, all subjected to six months of natural aging. Spectral responses were collected across the 400-1000 nm range in the overlapping regions.
View Article and Find Full Text PDFIEEE J Biomed Health Inform
September 2025
Thyroid eye disease (TED) is a prevalent autoimmune orbital disorder that can severely impair visual function and significantly diminish patients' quality of life. In recent years, several studies have attempted to automate TED diagnosis using optical coherence tomography (OCT) images. However, existing approaches primarily rely on convolutional neural networks (CNNs) combined with attention mechanisms and are mostly trained using traditional cross-entropy loss.
View Article and Find Full Text PDFJ Oral Biol Craniofac Res
August 2025
Neura Integrasi Solusi, Jl. Kebun Raya No. 73, Rejowinangun, Kotagede, Yogyakarta, 55171, Indonesia.
Background: Periodontal disease is an inflammatory condition causing chronic damage to the tooth-supporting connective tissues, leading to tooth loss in adults. Diagnosing periodontitis requires clinical and radiographic examinations, with panoramic radiographs crucial in identifying and assessing its severity and staging. Convolutional Neural Networks (CNNs), a deep learning method for visual data analysis, and Dense Convolutional Networks (DenseNet), which utilize direct feed-forward connections between layers, enable high-performance computer vision tasks with reduced computational demands.
View Article and Find Full Text PDFFront Vet Sci
August 2025
Pathobiology and Population Science, Royal Veterinary College, Hatfield, United Kingdom.
Diffuse large B-cell lymphoma is the most common type of non-Hodgkin lymphoma (NHL) in humans, accounting for about 30-40% of NHL cases worldwide. Canine diffuse large B-cell lymphoma (cDLBCL) is the most common lymphoma subtype in dogs and demonstrates an aggressive biologic behaviour. For tissue biopsies, current confirmatory diagnostic approaches for enlarged lymph nodes rely on expert histopathological assessment, which is time-consuming and requires specialist expertise.
View Article and Find Full Text PDFJ Food Sci
September 2025
Faculty of Computing, Federal University of Uberlandia, Uberlândia, Brazil.
The coffee roasting process is a critical factor in determining the final quality of the beverage, influencing its flavour, aroma, and acidity. Traditionally, roast-level classification has relied on manual inspection, which is time-consuming, subjective, and prone to inconsistencies. However, advancements in machine learning (ML) and computer vision, particularly convolutional neural networks (CNNs), have shown great promise in automating and improving the accuracy of this process.
View Article and Find Full Text PDF