CXR-MultiTaskNet a unified deep learning framework for joint disease localization and classification in chest radiographs.

Sci Rep

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Aziz Nagar, Hyderabad, Telangana, 500075, India.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Chest X-ray (CXR) is a challenging problem in automated medical diagnosis, where complex visual patterns of thoracic diseases must be precisely identified through multi-label classification and lesion localization. Current approaches typically consider classification and localization in isolation, resulting in a piecemeal system that does not exploit common representations and is often not clinically interpretable, as well as limited in handling multi-label diseases. Although multi-task learning frameworks, such as DeepChest and CLN, appear to meet this goal, they suffer from task interference and poor explainability, which limits their practical application in real-world clinical workflows. To address these limitations, we present a unified multi-task deep learning framework, CXR-MultiTaskNet, for simultaneously classifying thoracic diseases and localizing lesions in chest X-rays. Our framework comprises a standard ResNet50 feature extractor, two task-specific heads for multi-task learning, and a Grad-CAM-based explainability module that provides accurate predictions and enhances clinical explainability. We formulate a joint loss that weighs the relative importance of representation extraction, which is large due to class variations, and the final loss, which is larger in the detection loss that occurs in extreme class imbalances between days and the detectability of varying disease manifestation types. Recent advances made by deep learning methods in the identification of disease in chest X-ray images are promising; however, there are limitations in their performance for complete analysis due to the lack of interpretability, some inherent weaknesses of convolutional neural networks (CNN), and prior learning of classification at the image level before localization of the disease. In this paper, we propose a dual-attention-based hierarchical feature extraction approach, which addresses the challenges of deep learning in detecting diseases in chest X-ray images. Through the use of visual attention maps, the detection steps can be better tracked, and therefore, the entire process is made more interpretable than with a traditional CNN-embedding model. We also manage to obtain both disease-level and pixel-level predictions, which enable explainable and comprehensive analysis of each image and aid in localizing each detected abnormality area. The proposed approach was further optimized for X-ray images by computing the objective losses during training, which ultimately gives higher significance to smaller lesions. Experimental evaluations on a benchmark chest X-ray dataset demonstrate the potential of the proposed approach achieving a macro F1-score of 0.965 (0.968 micro F1-score) for disease classification and mean IoU of 0.851 (mAP@0.50) for localization of diseases Content: Model intepretability, Chest X-ray image disease detection, Detection region localization, Weakly supervised transfer learning Lesion localization → 5 of 0.927 Compared to state-of-the-art single-task and multi-task baselines, these results are consistently better. The presented framework provides an integrated, method-based approach to chest X-ray analysis that is clinically useful, interpretable, and scalable for automation, allowing for efficient diagnostic pathways and enhanced clinical decision-making. This single framework can serve as a router for next-gen explainable AI in radiology.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12399747PMC
http://dx.doi.org/10.1038/s41598-025-16669-zDOI Listing

Publication Analysis

Top Keywords

chest x-ray
24
deep learning
16
x-ray images
12
learning
8
learning framework
8
chest
8
thoracic diseases
8
lesion localization
8
clinically interpretable
8
multi-task learning
8

Similar Publications

A 27-year-old man from Ethiopia had undergone an extraction of a molar in Libya ten months earlier, after which a submental swelling developed. For that reason, an oral and maxillofacial surgery department was consulted. Clinical examination showed a tender, firm-to-the-touch, non-mobile swelling with central ulceration.

View Article and Find Full Text PDF

Background: Patent foramen ovale (PFO) has been identified as a potential risk factor for cryptogenic stroke (CS). Although transesophageal echocardiography (TEE) is considered the gold standard for PFO detection, false-negative results remain a clinical concern, particularly in CS patients with high suspicion of PFO-related etiology.

Aims: To evaluate the clinical utility of transcatheter PFO exploration (TPFOE) in CS patients with negative TEE findings but high suspicion of PFO-related etiology.

View Article and Find Full Text PDF

Delayed diagnosis of post-traumatic bronchial transection in a pediatric patient: A case report.

Int J Surg Case Rep

September 2025

Al-Mouwasat University Hospital, Faculty of Medicine, Damascus University, Damascus, Syrian Arab Republic. Electronic address:

Background: Tracheobronchial injuries (TBI) are rare yet potentially fatal complications of blunt chest trauma, often underdiagnosed due to nonspecific clinical manifestations.

Case Presentation: We report the case of an 11-year-old Arab girl who developed progressive dyspnea two months after a motor vehicle accident initially managed conservatively. Imaging revealed complete atelectasis of the right lung and obstruction of the right main bronchus by granulation tissue.

View Article and Find Full Text PDF

Background: Older patients presenting with nonspecific complaints (NSC) in the Emergency Department (ED) pose diagnostic challenges. The lack of clear symptoms leads to high misdiagnosis rates, extended hospital stays, and functional impairment. However, limited research exists on diagnostic test utilization for this population.

View Article and Find Full Text PDF

Background The prognostic value of baseline visual emphysema scoring at low-dose CT (LDCT) in lung cancer screening cohorts is unknown. Purpose To determine whether a single visual emphysema score at LDCT is predictive of 25-year mortality from all causes, chronic obstructive pulmonary disease (COPD), and cardiovascular disease (CVD). Materials and Methods In this prospective cohort study, asymptomatic adults aged 40-85 years with a history of smoking underwent baseline LDCT screening for lung cancer between June 2000 and December 2008.

View Article and Find Full Text PDF