MTA-Net: Multi-scale triplet attention-aware network for multiclass skin lesion classification.

Comput Biol Med

Department of Computer Science and Engineering, Sardar Vallabhbhai National Institute of Technology, Surat, Gujarat, India. Electronic address:

Published: September 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Multiclass classification of skin lesions plays a crucial role in computer-aided skin cancer diagnosis and still remains challenging due to the high similarity between different classes and large variations within the same classes. Additionally, the color, shape, and size variation of lesions raise significant concerns. Existing convolutional neural network (CNN)-based approaches have shown prominence over traditional feature engineering-based methods for learning high-level feature representations from dermoscopy images; however, they exhibit limited capabilities to capture subtle lesion variations. Although recent attention-driven CNN models have partially addressed the above issues, they struggle to learn fine-grained lesion information at various scales, hindering performance in multiclass skin lesion classification settings. To address these challenges, this paper proposes a multi-scale triplet attention-aware network (MTA-Net) for multiclass skin lesion classification. Specifically, an MTA module, the core component of MTA-Net, is introduced on top of a pre-trained CNN architecture to facilitate efficient feature learning focused on lesion regions. The MTA module consists of a sequence of multi-scale triplet spatial attention (MTSA) and multi-scale triplet channel attention (MTCA), which aid in learning detailed feature relationships across spatial and channel dimensions at various scales, enriching feature representations. The proposed MTA-Net is evaluated on two benchmark datasets: HAM10000 and ISIC 2019. The experimental results demonstrate that the MTA-Net outperforms the baseline CNN frameworks and state-of-the-art methods across both datasets while requiring no external data, achieving an accuracy of 91.51% and 78.4%, and a balanced multiclass accuracy (BMCA) of 87.18% and 66.7% on HAM10000 and ISIC 2019 datasets, respectively. Further, the ablation studies and visualization results indicate the potency of the proposed MTA module.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2025.110729DOI Listing

Publication Analysis

Top Keywords

multi-scale triplet
16
multiclass skin
12
skin lesion
12
lesion classification
12
mta module
12
triplet attention-aware
8
attention-aware network
8
feature representations
8
ham10000 isic
8
isic 2019
8

Similar Publications

Apple leaf diseases severely affect the quality and yield of apples, and accurate classification is crucial for reducing losses. However, in natural environments, the similarity between backgrounds and lesion areas makes it difficult for existing models to balance lightweight design and high accuracy, limiting their practical applications. In order to resolve the aforementioned problem, this paper introduces a lightweight converged attention multi-branch network named LCAMNet.

View Article and Find Full Text PDF

Building segmentation of high-resolution remote sensing images using deep learning effectively reduces labor costs, but still faces the key challenges of effectively modeling cross-scale contextual relationships and preserving fine spatial details. Current Transformer-based approaches demonstrate superior long-range dependency modeling, but still suffer from the problem of progressive information loss during hierarchical feature encoding. Therefore, this study proposed a new semantic segmentation network named SegTDformer to extract buildings in remote sensing images.

View Article and Find Full Text PDF

Dual-Branch Deep Learning with Dynamic Stage Detection for CT Tube Life Prediction.

Sensors (Basel)

August 2025

Innovation Institute for Integration of Medicine and Engineering, West China Hospital, Sichuan University, Chengdu 610041, China.

CT scanners are essential tools in modern medical imaging. Sudden failures of their X-ray tubes can lead to equipment downtime, affecting healthcare services and patient diagnosis. However, existing prediction methods based on a single model struggle to adapt to the multi-stage variation characteristics of tube lifespan and have limited modeling capabilities for temporal features.

View Article and Find Full Text PDF

Learning discriminative features between abnormal and normal instances is crucial for video anomaly detection within the multiple instance learning framework. Existing methods primarily focus on instances with the highest anomaly scores, neglecting the identification and differentiation of hard samples, leading to misjudgments and high false alarm rates. To address these challenges, we propose a dual triplet contrastive loss strategy.

View Article and Find Full Text PDF

Multiclass classification of skin lesions plays a crucial role in computer-aided skin cancer diagnosis and still remains challenging due to the high similarity between different classes and large variations within the same classes. Additionally, the color, shape, and size variation of lesions raise significant concerns. Existing convolutional neural network (CNN)-based approaches have shown prominence over traditional feature engineering-based methods for learning high-level feature representations from dermoscopy images; however, they exhibit limited capabilities to capture subtle lesion variations.

View Article and Find Full Text PDF