One Neuron Saved is One Neuron Earned: On Parametric Efficiency of Quadratic Networks.

Feng-Lei Fan , Hang-Cheng Dong , Zhongming Wu , Lecheng Ruan , Tieyong Zeng , Yiming Cui , Jing-Xiao Liao

IEEE Trans Pattern Anal Mach Intell

Published: July 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Inspired by neuronal diversity in the biological neural system, a plethora of studies proposed to design novel types of artificial neurons and introduce neuronal diversity into artificial neural networks. Recently proposed quadratic neuron, which replaces the inner-product operation in conventional neurons with a quadratic one, have achieved great success in many essential tasks. Despite the promising results of quadratic neurons, there is still an unresolved issue: Is the superior performance of quadratic networks simply due to the increased parameters or due to the intrinsic expressive capability? Without clarifying this issue, the performance of quadratic networks is always suspicious. Additionally, resolving this issue is reduced to finding killer applications of quadratic networks. In this paper, with theoretical and empirical studies, we show that quadratic networks enjoy parametric efficiency, thereby confirming that the superior performance of quadratic networks is due to the intrinsic expressive capability. This intrinsic expressive ability comes from that quadratic neurons can easily represent nonlinear interaction, while it is hard for conventional neurons. Theoretically, we derive the approximation efficiency of quadratic networks over conventional ones in terms of real space and manifolds. Moreover, from the perspective of the Barron space, we demonstrate that there exists a functional space whose functions can be approximated by quadratic networks in a dimension-free error, but the approximation error of conventional networks is dependent on dimensions. Empirically, experimental results on synthetic data, classic benchmarks, and real-world applications show that quadratic models broadly enjoy parametric efficiency, and the gain of efficiency depends on the task. We have shared our code in https://github.com/asdvfghg/quadratic_efficiency.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2025.3588894	DOI Listing

Publication Analysis

Top Keywords

quadratic networks

quadratic

parametric efficiency

performance quadratic

intrinsic expressive

networks

efficiency quadratic

neuronal diversity

conventional neurons

quadratic neurons

Similar Publications

Long sequence temporal knowledge tracing for student performance prediction via integrating LSTM and informer.

PLoS One

September 2025

School of Electrical and Information Engineering, Hunan Institute of Technology, Hengyang, Hunan, China.

Ailian Gao , Zenglei Liu

Knowledge tracing can reveal students' level of knowledge in relation to their learning performance. Recently, plenty of machine learning algorithms have been proposed to exploit to implement knowledge tracing and have achieved promising outcomes. However, most of the previous approaches were unable to cope with long sequence time-series prediction, which is more valuable than short sequence prediction that is extensively utilized in current knowledge-tracing studies.

View Article and Find Full Text PDF

Similar Publications

Segmenting the motion components of a video: A long-term unsupervised model.

IEEE Trans Pattern Anal Mach Intell

September 2025

Etienne Meunier , Patrick Bouthemy

Human beings have the ability to continuously analyze a video and immediately extract the motion components. We want to adopt this paradigm to provide a coherent and stable motion segmentation over the video sequence. In this perspective, we propose a novel long-term spatio-temporal model operating in a totally unsupervised way.

View Article and Find Full Text PDF

Similar Publications

Optimized reconstruction of undersampled Dixon sequences using new memory-efficient unrolled deep neural networks: HalfVarNet and HalfDIRCN.

Magn Reson Med

September 2025

Aix Marseille Univ, CNRS, Centrale Med, Institut Fresnel, Marseille, France.

Sandra Martin , Amira Trabelsi , Maxime Guye , Marc Dubois , Redha Abdeddaim

Purpose: Fat fraction (FF) quantification in individual muscles using quantitative MRI is of major importance for monitoring disease progression and assessing disease severity in neuromuscular diseases. Undersampling of MRI acquisitions is commonly used to reduce scanning time. The present paper introduces novel unrolled neural networks for the reconstruction of undersampled MRI acquisitions.

View Article and Find Full Text PDF

Similar Publications

PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference.

IEEE Trans Pattern Anal Mach Intell

September 2025

Ye Li , Chen Tang , Yuan Meng , Jiajun Fan , Zenghao Chai

The troublesome model size and quadratic computational complexity associated with token quantity pose significant deployment challenges for Vision Transformers (ViTs) in practical applications. Despite recent advancements in model pruning and token reduction techniques speed up the inference speed of ViTs, these approaches either adopt a fixed sparsity ratio or overlook the meaningful interplay between architectural optimization and token selection. Consequently, this static and single-dimension compression often leads to pronounced accuracy degradation under aggressive compression rates, as they fail to fully explore redundancies across these two orthogonal dimensions.

View Article and Find Full Text PDF

Similar Publications

Analysis of spatial correlation network and influencing factors of green energy efficiency in urban agglomerations.

Sci Rep

September 2025

College of Management Science, Chengdu University of Technology, Chengdu, 610059, China.

Kun Shuai , Haodong Chang , Fujun He , Xin He

Exploring the spatial correlation characteristics of urban green total factor energy efficiency (GTFEE) is of great significance to promote the green and low-carbon energy transformation among cities and realize the integrated and coordinated development of green energy. Taking the urban agglomeration of the central area of the Yangtze River Delta as the research object, the Super-SBM model is used to estimate the urban GTFEE from 2006 to 2022, and exploratory spatio-temporal data analysis is used to explore its spatial pattern. Meanwhile, social network analysis (SNA) and quadratic assignment procedure (QAP) method are used to explore the spatial correlation network characteristics and influencing factors of GTFEE.

View Article and Find Full Text PDF

Similar Publications