Adaptive pixel attention network for hyperspectral image classification.

Yuefeng Zhao , Chengmin Zai , Nannan Hu , Lu Shi , Xue Zhou , Jingqi Sun

Sci Rep

Shandong Provincial Engineering and Technical Center of Light Manipulation, Shandong Provincial Key Laboratory of Optics and Photonic Devices, School of Physics and Electronics, Shandong Normal University, Jinan, 250014, China.

Published: November 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Patch features obtained by fixed convolution kernel have become the main form in hyperspectral image (HSI) classification processing. However, the fixed convolution kernel limits the weight learning of channels, which results in the potential connections between pixels not being captured in patches, and seriously affects the classification performance. To tackle the above issues, we propose a novel Adaptive Pixel Attention Network, which can improve HSI classification by further mining the connections between pixels in patch features. Specifically, a Spectral-Spatial Superposition Enhancement module is first proposed for enhancing the spectral-spatial information of 3D input cubes via complementing the 1D spectral vectors by zero and reflection filling operations. More importantly, we also propose a new Adaptive Pixel Attention mechanism, which explores Cosine and Euclidean similarity to adaptively explore the distance and angle relationship between pixels of different scale convolution patch features. Moreover, the Cross-Layer Information Complement module is designed to form a contextual interaction by integrating the output features of different convolution layers, which can prevent the omission of discriminative information and further improve the network performance. Experimental results on four widely used HSI datasets IP, UP, HU, and KSC show that the proposed network is superior to other state-of-the-art classification models in accuracy, and it also has a better efficiency than other 3D works.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11585658	PMC
http://dx.doi.org/10.1038/s41598-024-73988-3	DOI Listing

Publication Analysis

Top Keywords

adaptive pixel

pixel attention

patch features

attention network

hyperspectral image

fixed convolution

convolution kernel

hsi classification

connections pixels

classification

Similar Publications

De-MSI: A Deep Learning-Based Data Denoising Method to Enhance Mass Spectrometry Imaging by Leveraging the Chemical Prior Knowledge.

Anal Chem

September 2025

State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong SAR 999077, China.

Lei Guo , Chengyi Xie , Xin Diao , Thomas Ka Yam Lam , Yanhui Zhong

Mass spectrometry imaging (MSI) is a label-free technique that enables the visualization of the spatial distribution of thousands of ions within biosamples. Data denoising is the computational strategy aimed at enhancing the MSI data quality, providing an effective alternative to experimental methods. However, due to the complex noise pattern inherent in MSI data and the difficulty in obtaining ground truth from noise-free data, achieving reliable denoised images remains challenging.

View Article and Find Full Text PDF

Similar Publications

Inter-modality feature prediction through multimodal fusion for 3D shape defect detection.

Neural Netw

September 2025

School of Automation and Intelligent Sensing, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai, 200240, China.

Mujtaba Asad , Waqar Azeem , Hafiz Tayyab Mustafa , Yuming Fang , Jie Yang

3D shape defect detection plays an important role in autonomous industrial inspection. However, accurate detection of anomalies remains challenging due to the complexity of multimodal sensor data, especially when both color and structural information are required. In this work, we propose a lightweight inter-modality feature prediction framework that effectively utilizes multimodal fused features from the inputs of RGB, depth and point clouds for efficient 3D shape defect detection.

View Article and Find Full Text PDF

Similar Publications

Fine-grained multiclass nuclei segmentation with molecular empowered all-in-SAM model.

J Med Imaging (Bellingham)

September 2025

Vanderbilt University, Data Science Institute, Nashville, Tennessee, United States.

Xueyuan Li , Can Cui , Ruining Deng , Yucheng Tang , Quan Liu

Purpose: Recent developments in computational pathology have been driven by advances in vision foundation models (VFMs), particularly the Segment Anything Model (SAM). This model facilitates nuclei segmentation through two primary methods: prompt-based zero-shot segmentation and the use of cell-specific SAM models for direct segmentation. These approaches enable effective segmentation across a range of nuclei and cells.

View Article and Find Full Text PDF

Similar Publications

Using preprocessed datasets to construct and interpret multiclass identification models.

Front Plant Sci

August 2025

Key Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of China National Tobacco Corporation (CNTC), Zhengzhou, China.

Cong Wang , Yufeng Fu , Ran Wan , Le Zhao , Hongbo Wang

Introduction: Image and near-infrared (NIR) spectroscopic data are widely used for constructing analytical models in precision agriculture. While model interpretation can provide valuable insights for quality control and improvement, the inherent ambiguity of individual image pixels or spectral data points often hinders practical interpretability when using raw data directly. Furthermore, the presence of imbalanced datasets can lead to model overfitting and consequently, poor robustness.

View Article and Find Full Text PDF

Similar Publications

LGMMFusion: A LiDAR-guided multi-modal fusion framework for enhanced 3D object detection.

PLoS One

September 2025

School of Mechanical and Electrical Engineering, China University of Mining and Technology (Beijing), Beijing, China.

Haixing Cheng , Chengyong Liu , Wenzhe Gu , Yuyi Wu , Mengye Zhao

Multi-modal data fusion plays a critical role in enhancing the accuracy and robustness of perception systems for autonomous driving, especially for the detection of small objects. However, small object detection remains particularly challenging due to sparse LiDAR points and low-resolution image features, which often lead to missed or imprecise detections. Currently, many methods process LiDAR point clouds and visible-light camera images separately, and then fuse them in the detection head.

View Article and Find Full Text PDF

Similar Publications