Omni-dimensional dynamic convolution feature coordinate attention network for pneumonia classification.

Vis Comput Ind Biomed Art

School of Information Science and Technology, Northwest University, Xi'an, 710127, Shaanxi Province, China.

Published: July 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Pneumonia is a serious disease that can be fatal, particularly among children and the elderly. The accuracy of pneumonia diagnosis can be improved by combining artificial-intelligence technology with X-ray imaging. This study proposes X-ODFCANet, which addresses the issues of low accuracy and excessive parameters in existing deep-learning-based pneumonia-classification methods. This network incorporates a feature coordination attention module and an omni-dimensional dynamic convolution (ODConv) module, leveraging the residual module for feature extraction from X-ray images. The feature coordination attention module utilizes two one-dimensional feature encoding processes to aggregate feature information from different spatial directions. Additionally, the ODConv module extracts and fuses feature information in four dimensions: the spatial dimension of the convolution kernel, input and output channel quantities, and convolution kernel quantity. The experimental results demonstrate that the proposed method can effectively improve the accuracy of pneumonia classification, which is 3.77% higher than that of ResNet18. The model parameters are 4.45M, which was reduced by approximately 2.5 times. The code is available at https://github.com/limuni/X-ODFCANET .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11231110PMC
http://dx.doi.org/10.1186/s42492-024-00168-5DOI Listing

Publication Analysis

Top Keywords

omni-dimensional dynamic
8
dynamic convolution
8
pneumonia classification
8
accuracy pneumonia
8
feature coordination
8
coordination attention
8
attention module
8
odconv module
8
convolution kernel
8
feature
7

Similar Publications

Autoencoders (AEs) have received extensive attention in hyperspectral anomaly detection (HAD) due to their capability to separate the background from the anomaly based on the reconstruction error. However, the existing AE methods routinely fail to adequately exploit spatial information and may precisely reconstruct anomalies, thereby affecting the detection accuracy. To address these issues, this study proposes a novel Multi-scale Autoencoder Suppression Strategy (MASS).

View Article and Find Full Text PDF

Microscopic cell classification is a fundamental challenge in both clinical diagnosis and biological research. However, existing methods still struggle with the complexity and morphological diversity of cellular images, leading to limited accuracy or high computational costs. To overcome these constraints, we propose an efficient classification method that balances strong feature representation with a lightweight design.

View Article and Find Full Text PDF

The paper proposes improvements to YOLOv8n to enhance small target detection capabilities and introduces coordinate attention (CA) to the C2f module to improve focus on spatial information and local details. CA enhances spatial feature representation and small object recognition and replaces Path Aggregation Network with Bidirectional Feature Pyramid Network (BiFPN) in the neck to better fuse multi-scale features. BiFPN enables more effective fusion of features at different scales and adds a smaller detection head to improve perception of very small targets.

View Article and Find Full Text PDF

The Bel Canto performance is a complex and multidimensional art form encompassing pitch, timbre, technique, and affective expression. To accurately reflect a performer's singing proficiency, it is essential to quantify and evaluate their vocal technical execution precisely. Convolutional Neural Networks (CNNs), renowned for their robust ability to capture spatial hierarchical information, have been widely adopted in various tasks, including audio pattern recognition.

View Article and Find Full Text PDF

To address the limitation of traditional deep learning algorithms in fully utilizing contextual information in multispectral remote sensing (RS) images, this paper proposes an improved vegetation cover classification algorithm called LO-MLPRNN, which integrates Large Selective Kernel Network (LSK) and Omni-Dimensional Dynamic Convolution (ODC) with a Multi-Layer Perceptron Recurrent Neural Network (MLPRNN). The algorithm employs parallel-connected ODC and LSK modules to adaptively adjust convolution kernel parameters across multiple dimensions and dynamically optimize spatial receptive fields, enabling multi-perspective feature fusion for efficient processing of multispectral band information. The extracted features are mapped to a high-dimensional space through a Gate Recurrent Unit (GRU) and fully connected layers, with nonlinear characteristics enhanced by activation functions, ultimately achieving pixel-level land cover classification.

View Article and Find Full Text PDF