Omni-dimensional dynamic convolution feature coordinate attention network for pneumonia classification.

Yufei Li , Yufei Xin , Xinni Li , Yinrui Zhang , Cheng Liu , Zhengwen Cao , Shaoyi Du , Lin Wang

Vis Comput Ind Biomed Art

School of Information Science and Technology, Northwest University, Xi'an, 710127, Shaanxi Province, China.

Published: July 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Pneumonia is a serious disease that can be fatal, particularly among children and the elderly. The accuracy of pneumonia diagnosis can be improved by combining artificial-intelligence technology with X-ray imaging. This study proposes X-ODFCANet, which addresses the issues of low accuracy and excessive parameters in existing deep-learning-based pneumonia-classification methods. This network incorporates a feature coordination attention module and an omni-dimensional dynamic convolution (ODConv) module, leveraging the residual module for feature extraction from X-ray images. The feature coordination attention module utilizes two one-dimensional feature encoding processes to aggregate feature information from different spatial directions. Additionally, the ODConv module extracts and fuses feature information in four dimensions: the spatial dimension of the convolution kernel, input and output channel quantities, and convolution kernel quantity. The experimental results demonstrate that the proposed method can effectively improve the accuracy of pneumonia classification, which is 3.77% higher than that of ResNet18. The model parameters are 4.45M, which was reduced by approximately 2.5 times. The code is available at https://github.com/limuni/X-ODFCANET .

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11231110	PMC
http://dx.doi.org/10.1186/s42492-024-00168-5	DOI Listing

Publication Analysis

Top Keywords

omni-dimensional dynamic

dynamic convolution

pneumonia classification

accuracy pneumonia

feature coordination

coordination attention

attention module

odconv module

convolution kernel

feature

Similar Publications

Multi-scale Autoencoder Suppression Strategy for Hyperspectral Image Anomaly Detection.

IEEE Trans Image Process

August 2025

Bing Tu , Tao Zhou , Bo Liu , Yan He , Jun Li

Autoencoders (AEs) have received extensive attention in hyperspectral anomaly detection (HAD) due to their capability to separate the background from the anomaly based on the reconstruction error. However, the existing AE methods routinely fail to adequately exploit spatial information and may precisely reconstruct anomalies, thereby affecting the detection accuracy. To address these issues, this study proposes a novel Multi-scale Autoencoder Suppression Strategy (MASS).

View Article and Find Full Text PDF

Similar Publications

ILViT: An Inception-Linear Attention-Based Lightweight Vision Transformer for Microscopic Cell Classification.

J Imaging

July 2025

Department of Electrical and Computer Engineering, University of Massachusetts Lowell, Lowell, MA 01854, USA.

Zhangda Liu , Panpan Wu , Ziping Zhao , Hengyong Yu

Microscopic cell classification is a fundamental challenge in both clinical diagnosis and biological research. However, existing methods still struggle with the complexity and morphological diversity of cellular images, leading to limited accuracy or high computational costs. To overcome these constraints, we propose an efficient classification method that balances strong feature representation with a lightweight design.

View Article and Find Full Text PDF

Similar Publications

Omni-dimensional dynamic convolution with coordinate attention detection scheme.

Sci Prog

June 2025

Computer Engineering Department, Jiangsu Second Normal University, Nanjing, Jiangsu, China.

Lufeng Bai , Zhi Jun Song

The paper proposes improvements to YOLOv8n to enhance small target detection capabilities and introduces coordinate attention (CA) to the C2f module to improve focus on spatial information and local details. CA enhances spatial feature representation and small object recognition and replaces Path Aggregation Network with Bidirectional Feature Pyramid Network (BiFPN) in the neck to better fuse multi-scale features. BiFPN enables more effective fusion of features at different scales and adds a smaller detection head to improve perception of very small targets.

View Article and Find Full Text PDF

Similar Publications

Dense dynamic convolutional network for Bel canto vocal technique assessment.

Sci Rep

May 2025

University of Shanghai for Science and Technology, Shanghai, 200093, China.

Zhenyi Hou , Xu Zhao , Shanggerile Jiang , Daijun Luo , Xinyu Sheng

The Bel Canto performance is a complex and multidimensional art form encompassing pitch, timbre, technique, and affective expression. To accurately reflect a performer's singing proficiency, it is essential to quantify and evaluate their vocal technical execution precisely. Convolutional Neural Networks (CNNs), renowned for their robust ability to capture spatial hierarchical information, have been widely adopted in various tasks, including audio pattern recognition.

View Article and Find Full Text PDF

Similar Publications

LO-MLPRNN: A Classification Algorithm for Multispectral Remote Sensing Images by Fusing Selective Convolution.

Sensors (Basel)

April 2025

Liuzhou Survey and Mapping Research Institute Co., Ltd., Liuzhou 545005, China.

Xiangsuo Fan , Yan Zhang , Yong Peng , Qi Li , Xianqiang Wei

To address the limitation of traditional deep learning algorithms in fully utilizing contextual information in multispectral remote sensing (RS) images, this paper proposes an improved vegetation cover classification algorithm called LO-MLPRNN, which integrates Large Selective Kernel Network (LSK) and Omni-Dimensional Dynamic Convolution (ODC) with a Multi-Layer Perceptron Recurrent Neural Network (MLPRNN). The algorithm employs parallel-connected ODC and LSK modules to adaptively adjust convolution kernel parameters across multiple dimensions and dynamically optimize spatial receptive fields, enabling multi-perspective feature fusion for efficient processing of multispectral band information. The extracted features are mapped to a high-dimensional space through a Gate Recurrent Unit (GRU) and fully connected layers, with nonlinear characteristics enhanced by activation functions, ultimately achieving pixel-level land cover classification.

View Article and Find Full Text PDF

Similar Publications