Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part. Our method does not require additional annotations of object parts or textures for supervision. Instead, we use the same training data as traditional CNNs. Our method automatically assigns each interpretable filter in a high conv-layer with an object part of a certain category during the learning process. Such explicit knowledge representations in conv-layers of the CNN help people clarify the logic encoded in the CNN, i.e., answering what patterns the CNN extracts from an input image and uses for prediction. We have tested our method using different benchmark CNNs with various architectures to demonstrate the broad applicability of our method. Experiments have shown that our interpretable filters are much more semantically meaningful than traditional filters.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2020.2982882DOI Listing

Publication Analysis

Top Keywords

object classification
8
interpretable filter
8
interpretable
5
object
5
method
5
interpretable cnns
4
cnns object
4
classification paper
4
paper proposes
4
proposes generic
4

Similar Publications

Unlabelled: In 2024, the 200th anniversary of the first domestic work devoted to the study of gunshot injury was celebrated.

Objective: To present little-known information from the biography of its author, Professor P.P.

View Article and Find Full Text PDF

Knowledge distillation (KD) aims to transfer knowledge from a large-scale teacher model to a lightweight one, significantly reducing computational and storage requirements. However, the inherent learning capacity gap between the teacher and student often hinders the sufficient transfer of knowledge, motivating numerous studies to address this challenge. Inspired by the progressive approximation principle in the Stone-Weierstrass theorem, we propose expandable residual approximation (ERA), a novel KD method that decomposes the approximation of residual knowledge into multiple steps, reducing the difficulty of mimicking the teacher's representation through a divide-and-conquer approach.

View Article and Find Full Text PDF

Introduction: Colon cancer ranks among the most prevalent and lethal cancers globally, emphasizing the urgent need for accurate and early diagnostic tools. Recent advances in deep learning have shown promise in medical image analysis, offering potential improvements in detection accuracy and efficiency.

Methods: This study proposes a novel approach for classifying colon tissue images as normal or cancerous using Detectron2, a deep learning framework known for its superior object detection and segmentation capabilities.

View Article and Find Full Text PDF

Neural representations of visual statistical learning based on temporal duration.

Imaging Neurosci (Camb)

September 2025

Graduate School of Human and Environmental Studies, Kyoto University, Sakyo-ku, Kyoto, Japan.

Time perception is an essential aspect of daily life, and transitional probabilities can be learned based on temporal durations that are independent of individual objects. Previous studies on temporal and spatial visual statistical learning (VSL) have shown that the hippocampus and lateral occipital cortex are engaged in learning visual regularities. However, it remains unclear whether VSL on temporal duration unlinked to object identity is represented in brain regions involved in VSL and object recognition or in those involved in time perception without sensory cortex involvement.

View Article and Find Full Text PDF

Background: Cortico-cortical evoked potentials (CCEPs), elicited via single-pulse electrical stimulation, are used to map brain networks. These responses comprise early (N1) and late (N2) components, which reflect direct and indirect cortical connectivity. Reliable identification of these components remains difficult due to substantial variability in amplitude, phase, and timing.

View Article and Find Full Text PDF