Hyper-convolutions via implicit kernels for medical image analysis.

Med Image Anal

School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA; Cornell Tech, NYC, NY, USA; Department of Radiology, Weill Cornell Medical School, NYC, NY, USA. Electronic address:

Published: May 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares weights across all pixels. A standard CNN's capacity, and thus its performance, is directly related to the number of learnable kernel weights, which is determined by the number of channels and the kernel size (support). In this paper, we present the hyper-convolution, a novel building block that implicitly encodes the convolutional kernel using spatial coordinates. Unlike a regular convolutional kernel, whose weights are independently learned, hyper-convolution kernel weights are correlated through an encoder that maps spatial coordinates to their corresponding values. Hyper-convolutions decouple kernel size from the total number of learnable parameters, enabling a more flexible architecture design. We demonstrate in our experiments that replacing regular convolutions with hyper-convolutions can improve performance with less parameters, and increase robustness against noise. We provide our code here: https://github.com/tym002/Hyper-Convolution.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.media.2023.102796DOI Listing

Publication Analysis

Top Keywords

convolutional kernel
12
kernel weights
12
building block
8
number learnable
8
kernel size
8
spatial coordinates
8
kernel
7
hyper-convolutions implicit
4
implicit kernels
4
kernels medical
4

Similar Publications

YOLOv10-kiwi: a YOLOv10-based lightweight kiwifruit detection model in trellised orchards.

Front Plant Sci

August 2025

College of Mathematics and Computer Science, Yan'an University, Yan'an, Shaanxi, China.

To address the challenge of real-time kiwifruit detection in trellised orchards, this paper proposes YOLOv10-Kiwi, a lightweight detection model optimized for resource-constrained devices. First, a more compact network is developed by adjusting the scaling factors of the YOLOv10n architecture. Second, to further reduce model complexity, a novel C2fDualHet module is proposed by integrating two consecutive Heterogeneous Kernel Convolution (HetConv) layers as a replacement for the traditional Bottleneck structure.

View Article and Find Full Text PDF

Stochastic Kriging (SK) is a generalized variant of Gaussian process regression, and it is developed for dealing with non-i.i.d.

View Article and Find Full Text PDF

Directed message passing neural networks enhanced graph convolutional learning for accurate polymer density prediction.

J Chem Phys

September 2025

National Synchrotron Radiation Laboratory, State Key Laboratory of Advanced Glass Materials, Anhui Provincial Engineering Research Center for Advanced Functional Polymer Films, University of Science and Technology of China, Hefei, Anhui 230029, China.

Polymer density is a critical factor influencing material performance and industrial applications, and it can be tailored by modifying the chemical structure of repeating units. Traditional polymer density characterization methods rely heavily on domain expertise; however, the vast chemical space comprising over one million potential polymer structures makes conventional experimental screening inefficient and costly. In this study, we proposed a machine learning framework for polymer density prediction, rigorously evaluating four models: neural networks (NNs), random forest (RF), XGBoost, and graph convolutional neural networks (GCNNs).

View Article and Find Full Text PDF

Cascade Aggregation Network for Accurate Polyp Segmentation.

IET Syst Biol

September 2025

School of Computer and Information Techonology, Xinyang Normal University, Xinyang, China.

Accurate polyp segmentation is crucial for computer-aided diagnosis and early detection of colorectal cancer. Whereas feature pyramid network (FPN) and its variants are widely used in polyp segmentation, inherent limitations existing in FPN include: (1) repeated upsampling degrades fine details, reducing small polyp segmentation accuracy and (2) naive feature fusion (e.g.

View Article and Find Full Text PDF

E-SegNet: E-Shaped Structure Networks for Accurate 2D and 3D Medical Image Segmentation.

Research (Wash D C)

September 2025

Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China.

U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications.

View Article and Find Full Text PDF