Optimizing Performance of Transformer-based Models for Fetal Brain MR Image Segmentation.

Radiol Artif Intell

From the Neuroradiology Unit and CERMAC (N.P., P.A.D.R., M. Canini, G.N., A.F., A.C., C.B.) and Departments of Nuclear Medicine (P.S.) and Obstetrics and Gynecology (P.I.C., M. Candiani), IRCCS Ospedale San Raffaele, Via Olgettina 58-60, 20132 Milan, Italy; and Vita-Salute San Raffaele University, M

Published: November 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Purpose To test the performance of a transformer-based model when manipulating pretraining weights, dataset size, and input size and comparing the best model with the reference standard and state-of-the-art models for a resting-state functional (rs-fMRI) fetal brain extraction task. Materials and Methods An internal retrospective dataset (172 fetuses, 519 images; collected 2018-2022) was used to investigate influence of dataset size, pretraining approaches, and image input size on Swin-U-Net transformer (UNETR) and UNETR models. The internal and external (131 fetuses, 561 images) datasets were used to cross-validate and to assess generalization capability of the best model versus state-of-the-art models on different scanner types and number of gestational weeks (GWs). The Dice similarity coefficient (DSC) and the balanced average Hausdorff distance (BAHD) were used as segmentation performance metrics. Generalized equation estimation multifactorial models were used to assess significant model and interaction effects of interest. Results The Swin-UNETR model was not affected by the pretraining approach and dataset size and performed best with the mean dataset image size, with a mean DSC of 0.92 and BAHD of 0.097. Swin-UNETR was not affected by scanner type. Generalization results on the internal dataset showed that Swin-UNETR had lower performance compared with the reference standard models and comparable performance on the external dataset. Cross-validation on internal and external test sets demonstrated better and comparable performance of Swin-UNETR versus convolutional neural network architectures during the late-fetal period (GWs > 25) but lower performance during the midfetal period (GWs ≤ 25). Conclusion Swin-UNTER showed flexibility in dealing with smaller datasets, regardless of pretraining approaches. For fetal brain extraction from rs-fMR images, Swin-UNTER showed comparable performance with that of reference standard models during the late-fetal period and lower performance during the early GW period. Transformers, CNN, Medical Imaging Segmentation, MRI, Dataset Size, Input Size, Transfer Learning © RSNA, 2024.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11605146PMC
http://dx.doi.org/10.1148/ryai.230229DOI Listing

Publication Analysis

Top Keywords

dataset size
16
fetal brain
12
input size
12
reference standard
12
lower performance
12
comparable performance
12
performance transformer-based
8
performance
8
dataset
8
size
8

Similar Publications

Three methods for fair ranking of multiple protected items.

Sci Rep

September 2025

Department of Information Engineering, University of Padova, Via Giovanni Gradenigo, 6, 35131, Padova, PD, Italy.

Three approaches to fair ranking in retrieval systems are compared in this paper: mPFR, which is based on the theory of preferences and eigensystems; cRR, which is a simple' 'round robin" method; and mMLP, which is based on linear programming. In order to increase fairness without sacrificing retrieval effectiveness, the techniques post-process the rankings that a retrieval system sends back to users. The findings demonstrate that when it comes to protecting elements, mPFR and cRR accomplish the same level of effectiveness and fairness.

View Article and Find Full Text PDF

Red blood cell (RBC) size constrains the rate of diffusion of gases between (i) the environment and the capillary beds of the gas exchanger and (ii) the blood and organs. In birds, small RBCs with a high surface area to volume ratio permit a high O diffusion capacity and facilitate sustained, vigorous exercise. Unfortunately, our knowledge of archosaur cardiovascular evolution is incomplete without fossilized RBCs and blood vessels.

View Article and Find Full Text PDF

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

September 2025

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments.

View Article and Find Full Text PDF

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder lacking objective biomarkers for early diagnosis. DNA methylation is a promising epigenetic marker, and machine learning offers a data-driven classification approach. However, few studies have examined whole-blood, genome-wide DNA methylation profiles for ASD diagnosis in school-aged children.

View Article and Find Full Text PDF

Machine learning-based analysis of the impact of 5' untranslated region on protein expression.

Nucleic Acids Res

September 2025

School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, No. 100 Waihuanxi Road, Guangzhou 510006, China.

The 5' untranslated region (5'UTR) plays a crucial regulatory role in messenger RNA (mRNA), with modified 5'UTRs extensively utilized in vaccine production, gene therapy, etc. Nevertheless, manually optimizing 5'UTRs may encounter difficulties in balancing the effects of various cis-elements. Consequently, multiple 5'UTR libraries have been created, and machine learning models have been employed to analyze and predict translation efficiency (TE) and protein expression, providing insights into critical regulatory features.

View Article and Find Full Text PDF