98%
921
2 minutes
20
The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares weights across all pixels. A standard CNN's capacity, and thus its performance, is directly related to the number of learnable kernel weights, which is determined by the number of channels and the kernel size (support). In this paper, we present the hyper-convolution, a novel building block that implicitly encodes the convolutional kernel using spatial coordinates. Unlike a regular convolutional kernel, whose weights are independently learned, hyper-convolution kernel weights are correlated through an encoder that maps spatial coordinates to their corresponding values. Hyper-convolutions decouple kernel size from the total number of learnable parameters, enabling a more flexible architecture design. We demonstrate in our experiments that replacing regular convolutions with hyper-convolutions can improve performance with less parameters, and increase robustness against noise. We provide our code here: https://github.com/tym002/Hyper-Convolution.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.media.2023.102796 | DOI Listing |
Front Plant Sci
August 2025
College of Mathematics and Computer Science, Yan'an University, Yan'an, Shaanxi, China.
To address the challenge of real-time kiwifruit detection in trellised orchards, this paper proposes YOLOv10-Kiwi, a lightweight detection model optimized for resource-constrained devices. First, a more compact network is developed by adjusting the scaling factors of the YOLOv10n architecture. Second, to further reduce model complexity, a novel C2fDualHet module is proposed by integrating two consecutive Heterogeneous Kernel Convolution (HetConv) layers as a replacement for the traditional Bottleneck structure.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
September 2025
Stochastic Kriging (SK) is a generalized variant of Gaussian process regression, and it is developed for dealing with non-i.i.d.
View Article and Find Full Text PDFJ Chem Phys
September 2025
National Synchrotron Radiation Laboratory, State Key Laboratory of Advanced Glass Materials, Anhui Provincial Engineering Research Center for Advanced Functional Polymer Films, University of Science and Technology of China, Hefei, Anhui 230029, China.
Polymer density is a critical factor influencing material performance and industrial applications, and it can be tailored by modifying the chemical structure of repeating units. Traditional polymer density characterization methods rely heavily on domain expertise; however, the vast chemical space comprising over one million potential polymer structures makes conventional experimental screening inefficient and costly. In this study, we proposed a machine learning framework for polymer density prediction, rigorously evaluating four models: neural networks (NNs), random forest (RF), XGBoost, and graph convolutional neural networks (GCNNs).
View Article and Find Full Text PDFIET Syst Biol
September 2025
School of Computer and Information Techonology, Xinyang Normal University, Xinyang, China.
Accurate polyp segmentation is crucial for computer-aided diagnosis and early detection of colorectal cancer. Whereas feature pyramid network (FPN) and its variants are widely used in polyp segmentation, inherent limitations existing in FPN include: (1) repeated upsampling degrades fine details, reducing small polyp segmentation accuracy and (2) naive feature fusion (e.g.
View Article and Find Full Text PDFResearch (Wash D C)
September 2025
Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China.
U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications.
View Article and Find Full Text PDF