Hiding a plane with a pixel: examining shape-bias in CNNs and the benefit of building in biological constraints.

Gaurav Malhotra , Benjamin D Evans , Jeffrey S Bowers

Vision Res

School of Psychological, Science University of Bristol, Bristol BS8 1TU, UK.

Published: September 2020

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

When deep convolutional neural networks (CNNs) are trained "end-to-end" on raw data, some of the feature detectors they develop in their early layers resemble the representations found in early visual cortex. This result has been used to draw parallels between deep learning systems and human visual perception. In this study, we show that when CNNs are trained end-to-end they learn to classify images based on whatever feature is predictive of a category within the dataset. This can lead to bizarre results where CNNs learn idiosyncratic features such as high-frequency noise-like masks. In the extreme case, our results demonstrate image categorisation on the basis of a single pixel. Such features are extremely unlikely to play any role in human object recognition, where experiments have repeatedly shown a strong preference for shape. Through a series of empirical studies with standard high-performance CNNs, we show that these networks do not develop a shape-bias merely through regularisation methods or more ecologically plausible training regimes. These results raise doubts over the assumption that simply learning end-to-end in standard CNNs leads to the emergence of similar representations to the human visual system. In the second part of the paper, we show that CNNs are less reliant on these idiosyncratic features when we forgo end-to-end learning and introduce hard-wired Gabor filters designed to mimic early visual processing in V1.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.visres.2020.04.013	DOI Listing

Publication Analysis

Top Keywords

cnns trained

early visual

human visual

idiosyncratic features

cnns

hiding plane

plane pixel

pixel examining

examining shape-bias

shape-bias cnns

Similar Publications

Chronological differentiation of printed or handwritten text and stamps based on hyperspectral imaging and convolutional neural networks.

Anal Methods

September 2025

College of Science, Kunming University of Science and Technology, Kunming, 650500, China.

Xiaoquan Lu , Jianqiang Zhang , Fan Li , Jiaquan Wu , Xinyu Zhang

To address the technical challenges associated with determining the chronological order of overlapping stamps and textual content in forensic document examination, this study proposes a novel non-destructive method that integrates hyperspectral imaging (HSI) with convolutional neural networks (CNNs). A multi-type cross-sequence dataset was constructed, comprising 60 samples of handwriting-stamp sequences and 20 samples of printed text-stamp sequences, all subjected to six months of natural aging. Spectral responses were collected across the 400-1000 nm range in the overlapping regions.

View Article and Find Full Text PDF

Similar Publications

MediTEDNet: Visual State Space Model for Thyroid Eye Disease Classification.

IEEE J Biomed Health Inform

September 2025

Meihui Han , Junfen Chen , Mingyan Yao , Bojun Xie

Thyroid eye disease (TED) is a prevalent autoimmune orbital disorder that can severely impair visual function and significantly diminish patients' quality of life. In recent years, several studies have attempted to automate TED diagnosis using optical coherence tomography (OCT) images. However, existing approaches primarily rely on convolutional neural networks (CNNs) combined with attention mechanisms and are mostly trained using traditional cross-entropy loss.

View Article and Find Full Text PDF

Similar Publications

Hybrid two-stage CNN for detection and staging of periodontitis on panoramic radiographs.

J Oral Biol Craniofac Res

August 2025

Neura Integrasi Solusi, Jl. Kebun Raya No. 73, Rejowinangun, Kotagede, Yogyakarta, 55171, Indonesia.

Rini Widyaningrum , Eha Renwi Astuti , Adioro Soetojo , Amalia Nur Faadiya , Aga Satria Nurrachman

Background: Periodontal disease is an inflammatory condition causing chronic damage to the tooth-supporting connective tissues, leading to tooth loss in adults. Diagnosing periodontitis requires clinical and radiographic examinations, with panoramic radiographs crucial in identifying and assessing its severity and staging. Convolutional Neural Networks (CNNs), a deep learning method for visual data analysis, and Dense Convolutional Networks (DenseNet), which utilize direct feed-forward connections between layers, enable high-performance computer vision tasks with reduced computational demands.

View Article and Find Full Text PDF

Similar Publications

Deep-learning based morphological segmentation of canine diffuse large B-cell lymphoma.

Front Vet Sci

August 2025

Pathobiology and Population Science, Royal Veterinary College, Hatfield, United Kingdom.

Kenneth Ancheta , Androniki Psifidi , Andrew D Yale , Sophie Le Calvez , Jonathan Williams

Diffuse large B-cell lymphoma is the most common type of non-Hodgkin lymphoma (NHL) in humans, accounting for about 30-40% of NHL cases worldwide. Canine diffuse large B-cell lymphoma (cDLBCL) is the most common lymphoma subtype in dogs and demonstrates an aggressive biologic behaviour. For tissue biopsies, current confirmatory diagnostic approaches for enlarged lymph nodes rely on expert histopathological assessment, which is time-consuming and requires specialist expertise.

View Article and Find Full Text PDF

Similar Publications

Automated Coffee Roast Level Classification Using Machine Learning and Deep Learning Models.

J Food Sci

September 2025

Faculty of Computing, Federal University of Uberlandia, Uberlândia, Brazil.

René Ernesto García Rivas , Pedro Luiz Lima Bertarini , Henrique Fernandes

The coffee roasting process is a critical factor in determining the final quality of the beverage, influencing its flavour, aroma, and acidity. Traditionally, roast-level classification has relied on manual inspection, which is time-consuming, subjective, and prone to inconsistencies. However, advancements in machine learning (ML) and computer vision, particularly convolutional neural networks (CNNs), have shown great promise in automating and improving the accuracy of this process.

View Article and Find Full Text PDF

Similar Publications