Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Glaucoma is one of the major eye diseases that leads to progressive optic nerve fiber damage and irreversible blindness, afflicting millions of individuals. Glaucoma forecast is a good solution to early screening and intervention of potential patients, which is helpful to prevent further deterioration of the disease. It leverages a series of historical fundus images of an eye and forecasts the likelihood of glaucoma occurrence in the future. However, the irregular sampling nature and the imbalanced class distribution are two challenges in the development of disease forecasting approaches. To this end, we introduce the Multi-scale Spatio-temporal Transformer Network (MST-former) based on the transformer architecture tailored for sequential image inputs, which can effectively learn representative semantic information from sequential images on both temporal and spatial dimensions. Specifically, we employ a multi-scale structure to extract features at various resolutions, which can largely exploit rich spatial information encoded in each image. Besides, we design a time distance matrix to scale time attention in a non-linear manner, which could effectively deal with the irregularly sampled data. Furthermore, we introduce a temperature-controlled Balanced Softmax Cross-entropy loss to address the class imbalance issue. Extensive experiments on the Sequential fundus Images for Glaucoma Forecast (SIGF) dataset demonstrate the superiority of the proposed MST-former method, achieving an AUC of 96.6% for glaucoma forecasting. Besides, our method shows excellent generalization capability on the Alzheimer's Disease Neuroimaging Initiative (ADNI) MRI dataset, with an accuracy of 88.2% for mild cognitive impairment and Alzheimer's disease prediction, outperforming the compared method by a large margin. A series of ablation studies further verify the contribution of our proposed components in addressing the irregular sampled and class imbalanced problems.

Download full-text PDF

Source
http://dx.doi.org/10.1109/JBHI.2024.3523298DOI Listing

Publication Analysis

Top Keywords

multi-scale spatio-temporal
8
glaucoma forecasting
8
images glaucoma
8
glaucoma forecast
8
fundus images
8
alzheimer's disease
8
glaucoma
6
spatio-temporal transformer-based
4
transformer-based imbalanced
4
imbalanced longitudinal
4

Similar Publications

Thyroid hormones are significant for controlling metabolism, and two common thyroid disorders, such as hypothyroidism. The hyperthyroidism are directly affect the metabolic rate of the human body. Predicting and diagnosing thyroid disease remain significant challenges in medical research due to the complexity of thyroid hormone regulation and its impact on metabolism.

View Article and Find Full Text PDF

Modeling spatial processes of extreme heat impacts on global economy: a multi-scale spatio-temporal approach.

Sci Bull (Beijing)

July 2025

Climate Change and Carbon Neutrality Lab, Henan University, Zhengzhou 450046, China; Key Research Institute of Yellow River Civilization and Sustainable Development, Henan University, Kaifeng 475001, China; Faculty of Geographical Science and Engineering, Henan University, Zhengzhou 450046, China.

Rising frequency, intensity, and geographic scope of extreme heat profoundly impede global sustainable economic development. However, existing climate econometric models are limited in capturing the spatial processes through which extreme heat affects the global economy, often resulting in downward-biased estimates of total economic losses. This study develops a novel multi-scale spatio-temporal model that integrates classic multi-level modeling with spatial statistics, explicitly addressing key challenges faced by climate econometrics.

View Article and Find Full Text PDF

Traffic prediction plays an essential role in intelligent transportation systems by supporting urban traffic management and public safety. A major challenge lies in addressing both the limitations of static assumptions and the inherent complexity they introduce when modeling dynamic and heterogeneous traffic systems. Traditional methods often simplify complex spatio-temporal data into a single-dimensional framework, potentially overlooking intricate node interactions and detailed network characteristics.

View Article and Find Full Text PDF

Emotion analysis based on electroencephalogram (EEG) sensors is pivotal for human-machine interaction yet faces key challenges in spatio-temporal feature fusion and cross-band and brain-region integration from multi-channel sensor-derived signals. This paper proposes MB-MSTFNet, a novel framework for EEG emotion recognition. The model constructs a 3D tensor to encode band-space-time correlations of sensor data, explicitly modeling frequency-domain dynamics and spatial distributions of EEG sensors across brain regions.

View Article and Find Full Text PDF

Accurate athlete pose estimation in basketball is crucial for game analysis, player training, and tactical decision-making. However, existing pose estimation methods struggle to effectively address common challenges in basketball, such as motion blur, occlusions, and complex backgrounds. To tackle these issues, this paper proposes a basketball action pose estimation framework, which first leverages a multi-dimensional data stream network to extract spatial, temporal, and contextual information separately.

View Article and Find Full Text PDF