Transformers for Remote Sensing: A Systematic Review and Analysis.

Ruikun Wang , Lei Ma , Guangjun He , Brian Alan Johnson , Ziyun Yan , Ming Chang , Ying Liang

Sensors (Basel)

Beijing Institute of Satellite Information Engineering, Beijing 100095, China.

Published: May 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Research on transformers in remote sensing (RS), which started to increase after 2021, is facing the problem of a relative lack of review. To understand the trends of transformers in RS, we undertook a quantitative analysis of the major research on transformers over the past two years by dividing the application of transformers into eight domains: land use/land cover (LULC) classification, segmentation, fusion, change detection, object detection, object recognition, registration, and others. Quantitative results show that transformers achieve a higher accuracy in LULC classification and fusion, with more stable performance in segmentation and object detection. Combining the analysis results on LULC classification and segmentation, we have found that transformers need more parameters than convolutional neural networks (CNNs). Additionally, further research is also needed regarding inference speed to improve transformers' performance. It was determined that the most common application scenes for transformers in our database are urban, farmland, and water bodies. We also found that transformers are employed in the natural sciences such as agriculture and environmental protection rather than the humanities or economics. Finally, this work summarizes the analysis results of transformers in remote sensing obtained during the research process and provides a perspective on future directions of development.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11175147	PMC
http://dx.doi.org/10.3390/s24113495	DOI Listing

Publication Analysis

Top Keywords

transformers remote

remote sensing

lulc classification

transformers

analysis transformers

classification segmentation

detection object

object detection

sensing systematic

systematic review

Similar Publications

Multimodal deep learning methods for speech and language rehabilitation: a cross-sectional observational study.

Disabil Rehabil Assist Technol

September 2025

School of Foreign Languages, Ningbo University of Technology, Ningbo, China.

Xinqiao Cen

The speech and language rehabilitation are essential to people who have disorders of communication that may occur due to the condition of neurological disorder, developmental delays, or bodily disabilities. With the advent of deep learning, we introduce an improved multimodal rehabilitation pipeline that incorporates audio, video, and text information in order to provide patient-tailored therapy that adapts to the patient. The technique uses a cross-attention fusion multimodal hierarchical transformer architectural model that allows it to jointly design speech acoustics as well as the facial dynamics, lip articulation, and linguistic context.

View Article and Find Full Text PDF

Similar Publications

Correction: Multi-label remote sensing classification with self-supervised gated multi-modal transformers.

Front Comput Neurosci

August 2025

Origin Dynamics Intelligent Robot Co., Ltd., Zhengzhou, China.

Na Liu , Ye Yuan , Guodong Wu , Sai Zhang , Jie Leng

[This corrects the article DOI: 10.3389/fncom.2024.

View Article and Find Full Text PDF

Similar Publications

River water quality forecasting: a novel LSTM-Transformer approach enhanced by multi-source data.

Environ Monit Assess

August 2025

School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, 213164, China.

Juan Huan , Chen Zhang , Xiangen Xu , Yunxin Qian , Hao Zhang

Water quality prediction holds crucial importance as a fundamental technical support for efficient water resource management and strong ecological protection. In this study, aiming to meet the pressing requirement for eutrophication prevention and control in the water body of the Changzhou section of the Beijing-Hangzhou Canal, a prediction model for total phosphorus (TP) and total nitrogen (TN) concentrations, driven by deep learning, was constructed. A comprehensive multivariate dataset was formed by combining automated water quality monitoring data within the basin, remotely sensed interpretations of land types, and meteorological factors.

View Article and Find Full Text PDF

Similar Publications

Effectiveness of the GPT-4o Model in Interpreting Electrocardiogram Images for Cardiac Diagnostics: Diagnostic Accuracy Study.

JMIR AI

August 2025

Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel.

Haya Engelstein , Roni Ramon-Gonen , Avi Sabbag , Eyal Klang , Karin Sudri

Background: Recent progress has demonstrated the potential of deep learning models in analyzing electrocardiogram (ECG) pathologies. However, this method is intricate, expensive to develop, and designed for specific purposes. Large language models show promise in medical image interpretation, and yet their effectiveness in ECG analysis remains understudied.

View Article and Find Full Text PDF

Similar Publications

Dynamic atrous attention and dual branch context fusion for cross scale Building segmentation in high resolution remote sensing imagery.

Sci Rep

August 2025

School of Mining Engineering, Heilongjiang University of Science and Technology, Haerbin, 150000, China.

Yaohui Liu , Shuzhe Zhang , Xinkai Wang , Rui Zhai , Hu Jiang

Building segmentation of high-resolution remote sensing images using deep learning effectively reduces labor costs, but still faces the key challenges of effectively modeling cross-scale contextual relationships and preserving fine spatial details. Current Transformer-based approaches demonstrate superior long-range dependency modeling, but still suffer from the problem of progressive information loss during hierarchical feature encoding. Therefore, this study proposed a new semantic segmentation network named SegTDformer to extract buildings in remote sensing images.

View Article and Find Full Text PDF

Similar Publications