Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Research on transformers in remote sensing (RS), which started to increase after 2021, is facing the problem of a relative lack of review. To understand the trends of transformers in RS, we undertook a quantitative analysis of the major research on transformers over the past two years by dividing the application of transformers into eight domains: land use/land cover (LULC) classification, segmentation, fusion, change detection, object detection, object recognition, registration, and others. Quantitative results show that transformers achieve a higher accuracy in LULC classification and fusion, with more stable performance in segmentation and object detection. Combining the analysis results on LULC classification and segmentation, we have found that transformers need more parameters than convolutional neural networks (CNNs). Additionally, further research is also needed regarding inference speed to improve transformers' performance. It was determined that the most common application scenes for transformers in our database are urban, farmland, and water bodies. We also found that transformers are employed in the natural sciences such as agriculture and environmental protection rather than the humanities or economics. Finally, this work summarizes the analysis results of transformers in remote sensing obtained during the research process and provides a perspective on future directions of development.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11175147PMC
http://dx.doi.org/10.3390/s24113495DOI Listing

Publication Analysis

Top Keywords

transformers remote
12
remote sensing
12
lulc classification
12
transformers
10
analysis transformers
8
classification segmentation
8
detection object
8
object detection
8
sensing systematic
4
systematic review
4

Similar Publications

The speech and language rehabilitation are essential to people who have disorders of communication that may occur due to the condition of neurological disorder, developmental delays, or bodily disabilities. With the advent of deep learning, we introduce an improved multimodal rehabilitation pipeline that incorporates audio, video, and text information in order to provide patient-tailored therapy that adapts to the patient. The technique uses a cross-attention fusion multimodal hierarchical transformer architectural model that allows it to jointly design speech acoustics as well as the facial dynamics, lip articulation, and linguistic context.

View Article and Find Full Text PDF

[This corrects the article DOI: 10.3389/fncom.2024.

View Article and Find Full Text PDF

River water quality forecasting: a novel LSTM-Transformer approach enhanced by multi-source data.

Environ Monit Assess

August 2025

School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, 213164, China.

Water quality prediction holds crucial importance as a fundamental technical support for efficient water resource management and strong ecological protection. In this study, aiming to meet the pressing requirement for eutrophication prevention and control in the water body of the Changzhou section of the Beijing-Hangzhou Canal, a prediction model for total phosphorus (TP) and total nitrogen (TN) concentrations, driven by deep learning, was constructed. A comprehensive multivariate dataset was formed by combining automated water quality monitoring data within the basin, remotely sensed interpretations of land types, and meteorological factors.

View Article and Find Full Text PDF

Background: Recent progress has demonstrated the potential of deep learning models in analyzing electrocardiogram (ECG) pathologies. However, this method is intricate, expensive to develop, and designed for specific purposes. Large language models show promise in medical image interpretation, and yet their effectiveness in ECG analysis remains understudied.

View Article and Find Full Text PDF

Building segmentation of high-resolution remote sensing images using deep learning effectively reduces labor costs, but still faces the key challenges of effectively modeling cross-scale contextual relationships and preserving fine spatial details. Current Transformer-based approaches demonstrate superior long-range dependency modeling, but still suffer from the problem of progressive information loss during hierarchical feature encoding. Therefore, this study proposed a new semantic segmentation network named SegTDformer to extract buildings in remote sensing images.

View Article and Find Full Text PDF