Dual-modality visual feature flow for medical report generation.

Med Image Anal

Chongqing Key Laboratory of Image Cognition, College of Computer Science and Technology, Chongqing University of Posts and Telecommunication, Chongqing, 400065, China.

Published: April 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Medical report generation, a cross-modal task of generating medical text information, aiming to provide professional descriptions of medical images in clinical language. Despite some methods have made progress, there are still some limitations, including insufficient focus on lesion areas, omission of internal edge features, and difficulty in aligning cross-modal data. To address these issues, we propose Dual-Modality Visual Feature Flow (DMVF) for medical report generation. Firstly, we introduce region-level features based on grid-level features to enhance the method's ability to identify lesions and key areas. Then, we enhance two types of feature flows based on their attributes to prevent the loss of key information, respectively. Finally, we align visual mappings from different visual feature with report textual embeddings through a feature fusion module to perform cross-modal learning. Extensive experiments conducted on four benchmark datasets demonstrate that our approach outperforms the state-of-the-art methods in both natural language generation and clinical efficacy metrics.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.media.2024.103413DOI Listing

Publication Analysis

Top Keywords

visual feature
12
medical report
12
report generation
12
dual-modality visual
8
feature flow
8
feature
5
medical
5
flow medical
4
report
4
generation
4

Similar Publications

Asymmetrically Coordinated CoN Sites with Enhanced Oxidase-like Activity for Dual-Mode Colorimetric Sensing of Nitrite.

ACS Appl Mater Interfaces

September 2025

State Key Laboratory of Advanced Materials for Intelligent Sensing & Key Laboratory of Organic Integrated Circuit Ministry of Education & Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science & Institute of Molecular Aggregation Science, Tianjin Univ

The design of efficient and user-friendly methods for nitrite detection is of great significance owing to its critical role in food safety and environmental protection. Herein, we report a novel cobalt single-atom nanozyme (CoN SA) featuring a highly asymmetric CoN coordination environment. This structural configuration stabilizes high-spin Co species and significantly enhances the oxidase-like activity.

View Article and Find Full Text PDF

Functional recovery after total knee arthroplasty (TKA) varies widely among individuals, and traditional assessments often fail to detect subtle changes in real-world walking ability. Wearable sensors offer continuous and objective tracking of gait outside of clinical settings. In this prospective, longitudinal study, thirty-one patients undergoing unilateral TKA wore thigh-mounted accelerometers continuously from 2 weeks before surgery through 90 days postoperatively.

View Article and Find Full Text PDF

Acute lymphoblastic leukemia (ALL) preferentially localizes in the bone marrow (BM) and displays recurrent patterns of medullary and extra-medullary involvement. Leukemic cells exploit their niche for propagation and survive selective pressure by chemotherapy in the BM microenvironment, suggesting the existence of protective mechanisms. Here, we established a three-dimensional (3D) BM mimic with human mesenchymal stromal cells and endothelial cells that resemble vasculature-like structures to explore the interdependence of leukemic cells with their microenvironment.

View Article and Find Full Text PDF

In-line multi-wavelength non-destructive pharma quality monitoring with ultrabroadband carbon nanotubes photo-thermoelectric imaging scanners.

Light Sci Appl

September 2025

Department of Electrical, Electronic, and Communication Engineering, Faculty of Science and Engineering, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551, Japan.

While non-destructive in-line monitoring at manufacturing sites is essential for safe distribution cycles of pharmaceuticals, efforts are still insufficient to develop analytical systems for detailed dynamic visualisation of foreign substances and material composition in target pills. Although spectroscopies, expected towards pharma testing, have faced technical challenges in in-line setups for bulky equipment housing, this work demonstrates compact dynamic photo-monitoring systems by selectively extracting informative irradiation-wavelengths from comprehensive optical references of target pills. This work develops a non-destructive in-line dynamic inspection system for pharma agent pills with carbon nanotube (CNT) photo-thermoelectric imagers and the associated ultrabroadband sub-terahertz (THz)-infrared (IR) multi-wavelength monitoring.

View Article and Find Full Text PDF

A 22-year-old woman had an 8-year history of progressive bilateral vision loss and of diabetes mellitus. Her mother had diabetes and two first cousins had severe congenital deafness. On examination, her visual acuities were 6/36 bilaterally, with absent colour vision and gross optic disc pallor.

View Article and Find Full Text PDF