98%
921
2 minutes
20
Patch features obtained by fixed convolution kernel have become the main form in hyperspectral image (HSI) classification processing. However, the fixed convolution kernel limits the weight learning of channels, which results in the potential connections between pixels not being captured in patches, and seriously affects the classification performance. To tackle the above issues, we propose a novel Adaptive Pixel Attention Network, which can improve HSI classification by further mining the connections between pixels in patch features. Specifically, a Spectral-Spatial Superposition Enhancement module is first proposed for enhancing the spectral-spatial information of 3D input cubes via complementing the 1D spectral vectors by zero and reflection filling operations. More importantly, we also propose a new Adaptive Pixel Attention mechanism, which explores Cosine and Euclidean similarity to adaptively explore the distance and angle relationship between pixels of different scale convolution patch features. Moreover, the Cross-Layer Information Complement module is designed to form a contextual interaction by integrating the output features of different convolution layers, which can prevent the omission of discriminative information and further improve the network performance. Experimental results on four widely used HSI datasets IP, UP, HU, and KSC show that the proposed network is superior to other state-of-the-art classification models in accuracy, and it also has a better efficiency than other 3D works.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11585658 | PMC |
http://dx.doi.org/10.1038/s41598-024-73988-3 | DOI Listing |
Anal Chem
September 2025
State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong SAR 999077, China.
Mass spectrometry imaging (MSI) is a label-free technique that enables the visualization of the spatial distribution of thousands of ions within biosamples. Data denoising is the computational strategy aimed at enhancing the MSI data quality, providing an effective alternative to experimental methods. However, due to the complex noise pattern inherent in MSI data and the difficulty in obtaining ground truth from noise-free data, achieving reliable denoised images remains challenging.
View Article and Find Full Text PDFNeural Netw
September 2025
School of Automation and Intelligent Sensing, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai, 200240, China.
3D shape defect detection plays an important role in autonomous industrial inspection. However, accurate detection of anomalies remains challenging due to the complexity of multimodal sensor data, especially when both color and structural information are required. In this work, we propose a lightweight inter-modality feature prediction framework that effectively utilizes multimodal fused features from the inputs of RGB, depth and point clouds for efficient 3D shape defect detection.
View Article and Find Full Text PDFJ Med Imaging (Bellingham)
September 2025
Vanderbilt University, Data Science Institute, Nashville, Tennessee, United States.
Purpose: Recent developments in computational pathology have been driven by advances in vision foundation models (VFMs), particularly the Segment Anything Model (SAM). This model facilitates nuclei segmentation through two primary methods: prompt-based zero-shot segmentation and the use of cell-specific SAM models for direct segmentation. These approaches enable effective segmentation across a range of nuclei and cells.
View Article and Find Full Text PDFFront Plant Sci
August 2025
Key Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of China National Tobacco Corporation (CNTC), Zhengzhou, China.
Introduction: Image and near-infrared (NIR) spectroscopic data are widely used for constructing analytical models in precision agriculture. While model interpretation can provide valuable insights for quality control and improvement, the inherent ambiguity of individual image pixels or spectral data points often hinders practical interpretability when using raw data directly. Furthermore, the presence of imbalanced datasets can lead to model overfitting and consequently, poor robustness.
View Article and Find Full Text PDFPLoS One
September 2025
School of Mechanical and Electrical Engineering, China University of Mining and Technology (Beijing), Beijing, China.
Multi-modal data fusion plays a critical role in enhancing the accuracy and robustness of perception systems for autonomous driving, especially for the detection of small objects. However, small object detection remains particularly challenging due to sparse LiDAR points and low-resolution image features, which often lead to missed or imprecise detections. Currently, many methods process LiDAR point clouds and visible-light camera images separately, and then fuse them in the detection head.
View Article and Find Full Text PDF