Interpretable CNNs for Object Classification.

Quanshi Zhang , Xin Wang , Ying Nian Wu , Huilin Zhou , Song-Chun Zhu

IEEE Trans Pattern Anal Mach Intell

Published: October 2021

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part. Our method does not require additional annotations of object parts or textures for supervision. Instead, we use the same training data as traditional CNNs. Our method automatically assigns each interpretable filter in a high conv-layer with an object part of a certain category during the learning process. Such explicit knowledge representations in conv-layers of the CNN help people clarify the logic encoded in the CNN, i.e., answering what patterns the CNN extracts from an input image and uses for prediction. We have tested our method using different benchmark CNNs with various architectures to demonstrate the broad applicability of our method. Experiments have shown that our interpretable filters are much more semantically meaningful than traditional filters.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2020.2982882	DOI Listing

Publication Analysis

Top Keywords

object classification

interpretable filter

interpretable

object

method

interpretable cnns

cnns object

classification paper

paper proposes

proposes generic

Similar Publications

[Petr Petrovich Einbrodt - a pioneer of forensic ballistics in Russia (to the 200th anniversary of the first national work on gunshot wound)].

Sud Med Ekspert

January 2025

Russian University of Medicine, Moscow, Russia.

A P Bozhchenko , E Kh Barinov

Unlabelled: In 2024, the 200th anniversary of the first domestic work devoted to the study of gunshot injury was celebrated.

Objective: To present little-known information from the biography of its author, Professor P.P.

View Article and Find Full Text PDF

Similar Publications

Expandable Residual Approximation for Knowledge Distillation.

IEEE Trans Neural Netw Learn Syst

September 2025

Zhaoyi Yan , Binghui Chen , Yunfan Liu , Qixiang Ye

Knowledge distillation (KD) aims to transfer knowledge from a large-scale teacher model to a lightweight one, significantly reducing computational and storage requirements. However, the inherent learning capacity gap between the teacher and student often hinders the sufficient transfer of knowledge, motivating numerous studies to address this challenge. Inspired by the progressive approximation principle in the Stone-Weierstrass theorem, we propose expandable residual approximation (ERA), a novel KD method that decomposes the approximation of residual knowledge into multiple steps, reducing the difficulty of mimicking the teacher's representation through a divide-and-conquer approach.

View Article and Find Full Text PDF

Similar Publications

Utilizing Detectron2 for accurate and efficient colon cancer detection in histopathological images.

Front Bioeng Biotechnol

August 2025

Department of Gastroenterology, The Third Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.

Luxi Chen , Jie Shen , Xinyu Li , Rongzhou Li , Xiaoyun Gao

Introduction: Colon cancer ranks among the most prevalent and lethal cancers globally, emphasizing the urgent need for accurate and early diagnostic tools. Recent advances in deep learning have shown promise in medical image analysis, offering potential improvements in detection accuracy and efficiency.

Methods: This study proposes a novel approach for classifying colon tissue images as normal or cancerous using Detectron2, a deep learning framework known for its superior object detection and segmentation capabilities.

View Article and Find Full Text PDF

Similar Publications

Neural representations of visual statistical learning based on temporal duration.

Imaging Neurosci (Camb)

September 2025

Graduate School of Human and Environmental Studies, Kyoto University, Sakyo-ku, Kyoto, Japan.

Sachio Otsuka , Jun Saiki

Time perception is an essential aspect of daily life, and transitional probabilities can be learned based on temporal durations that are independent of individual objects. Previous studies on temporal and spatial visual statistical learning (VSL) have shown that the hippocampus and lateral occipital cortex are engaged in learning visual regularities. However, it remains unclear whether VSL on temporal duration unlinked to object identity is represented in brain regions involved in VSL and object recognition or in those involved in time perception without sensory cortex involvement.

View Article and Find Full Text PDF

Similar Publications

Cortico-Cortical Evoked Potentials: Automated Localization and Classification of Early and Late Responses.

J Neurosci Methods

September 2025

Department of Electrical and Computer Engineering, University of Alabama at Birmingham, Birmingham, AL, USA.

Sahaj A Patel , Helen Brinyark , Caila Coyne , Noshin Tasnia , Rebekah Chatfield

Background: Cortico-cortical evoked potentials (CCEPs), elicited via single-pulse electrical stimulation, are used to map brain networks. These responses comprise early (N1) and late (N2) components, which reflect direct and indirect cortical connectivity. Reliable identification of these components remains difficult due to substantial variability in amplitude, phase, and timing.

View Article and Find Full Text PDF

Similar Publications