Siamese comparative transformer-based network for unsupervised landmark detection.

Can Zhao , Tao Wu , Jianlin Zhang , Zhiyong Xu , Meihui Li , Dongxu Liu

PLoS One

National Key Laboratory of Optical Field Manipulation Science and Technology, Chinese Academy of Sciences, Chengdu, China.

Published: December 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Landmark detection is a common task that benefits downstream computer vision tasks. Current landmark detection algorithms often train a sophisticated image pose encoder by reconstructing the source image to identify landmarks. Although a well-trained encoder can effectively capture landmark information through image reconstruction, it overlooks the semantic relationships between landmarks. This contradicts the goal of achieving semantic representations in landmark detection tasks. To address these challenges, we introduce a novel Siamese comparative transformer-based network that strengthens the semantic connections among detected landmarks. Specifically, the connection between landmarks with the same semantics has been enhanced by employing a Siamese contrastive regularizer. In addition, we integrate a lightweight direction-guided Transformer into the image pose encoder to perceive global feature relationships, thereby improving the representation and encoding of landmarks. Experiments on the CelebA, AFLW, and Cat Heads benchmarks demonstrate that our proposed method achieves competitive performance compared to existing unsupervised methods and even supervised methods.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11687641	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0313518	PLOS

Publication Analysis

Top Keywords

landmark detection

siamese comparative

comparative transformer-based

transformer-based network

image pose

pose encoder

landmark

landmarks

network unsupervised

unsupervised landmark

Similar Publications

Toward universal immunofluorescence normalization for multiplex tissue imaging with UniFORM.

Cell Rep Methods

August 2025

Department of Biomedical Engineering and Computational Biology Program, OHSU, Portland, OR, USA; Knight Cancer Institute, OHSU, Portland, OR, USA. Electronic address:

Kunlun Wang , Kaoutar Ait-Ahmad , Sam Kupp , Zachary Sims , Eric Cramer

We present UniFORM, a non-parametric, Python-based pipeline for normalizing multiplex tissue imaging (MTI) data at both the feature and pixel levels. UniFORM employs an automated rigid landmark registration method tailored to the distributional characteristics of MTI, with UniFORM operating without prior distributional assumptions and handling both unimodal and bimodal patterns. By aligning the biologically invariant negative populations, UniFORM removes technical variation while preserving tissue-specific expression patterns in positive populations.

View Article and Find Full Text PDF

Similar Publications

Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection along the Circle of Willis.

IEEE J Biomed Health Inform

September 2025

Rafic Nader , Vincent L'Allinec , Romain Bourcier , Florent Autrusseau

Intracranial aneurysms (ICA) commonly occur in specific segments of the Circle of Willis (CoW), primarily, onto thirteen major arterial bifurcations. An accurate detection of these critical landmarks is necessary for a prompt and efficient diagnosis. We introduce a fully automated landmark detection approach for CoW bifurcations using a two-step neural networks process.

View Article and Find Full Text PDF

Similar Publications

Assessing Diagnostic Accuracy in Cephalometry: A Comparative Study of Manual and Digital Tracing Techniques.

Cureus

August 2025

Orthodontics and Dentofacial Orthopaedics, Faculty of Dental Sciences, Institute of Medical Sciences, Banaras Hindu University, Varanasi, IND.

Aparajita Pandey , T P Chaturvedi , Adit Srivastava , Saumya Shukla , Savitha Priyadarsini S

Aim: This study aimed to statistically evaluate and compare the accuracy, reliability, and efficiency of manual versus artificial intelligence (AI)-assisted digital cephalometric tracing using Steiner's and Down's analyses in orthodontic diagnostics.

Materials And Methods: A retrospective study was conducted using 20 lateral cephalograms obtained using the NewTom GiANO HR cone-beam computed tomography (CBCT) system (Quantitative Radiology, Verona, Italy). Manual tracings were performed on acetate sheets, while digital analysis employed the AudaxCeph® software (Audax d.

View Article and Find Full Text PDF

Similar Publications

Dairy DigiD: a keypoint-based deep learning system for classifying dairy cattle by physiological and reproductive status.

Front Artif Intell

August 2025

Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada.

Shubhangi Mahato , Hanqing Bi , Suresh Neethirajan

Precision livestock farming increasingly relies on non-invasive, high-fidelity systems capable of monitoring cattle with minimal disruption to behavior or welfare. Conventional identification methods, such as ear tags and wearable sensors, often compromise animal comfort and produce inconsistent data under real-world farm conditions. This study introduces Dairy DigiD, a deep learning-based biometric classification framework that categorizes dairy cattle into four physiologically defineda groups-young, mature milking, pregnant, and dry cows-using high-resolution facial images.

View Article and Find Full Text PDF

Similar Publications

Sparc Suppresses Microglial Neuroinflammation and Promotes Axonal Regeneration by Interacting With Uba52.

Front Biosci (Landmark Ed)

August 2025

Department of Spine Surgery, Zhongda Hospital Southeast University, 210009 Nanjing, Jiangsu, China.

Hangyu Ji , Kun Wang , Yili Hu , Yefu Xu , Jiangkai Yu

Background: After spinal cord injury (SCI), pro-inflammatory microglia accumulate and impede axonal regeneration. We explored whether secreted protein acidic and rich in cysteine (Sparc) restrains microglial inflammation and fosters neurite outgrowth.

Methods: Mouse microglial BV2 cells were polarized to a pro-inflammatory phenotype with lipopolysaccharides (LPSs).

View Article and Find Full Text PDF

Similar Publications