PCANet-Based Structural Representation for Nonrigid Multimodal Medical Image Registration.

Sensors (Basel)

Department of Biomedical Engineering, School of Life Science and Technology, Ministry of Education Key Laboratory of Molecular Biophysics, Huazhong University of Science and Technology, No 1037, Luoyu Road, Wuhan 430074, China.

Published: May 2018


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Nonrigid multimodal image registration remains a challenging task in medical image processing and analysis. The structural representation (SR)-based registration methods have attracted much attention recently. However, the existing SR methods cannot provide satisfactory registration accuracy due to the utilization of hand-designed features for structural representation. To address this problem, the structural representation method based on the improved version of the simple deep learning network named PCANet is proposed for medical image registration. In the proposed method, PCANet is firstly trained on numerous medical images to learn convolution kernels for this network. Then, a pair of input medical images to be registered is processed by the learned PCANet. The features extracted by various layers in the PCANet are fused to produce multilevel features. The structural representation images are constructed for two input images based on nonlinear transformation of these multilevel features. The Euclidean distance between structural representation images is calculated and used as the similarity metrics. The objective function defined by the similarity metrics is optimized by L-BFGS method to obtain parameters of the free-form deformation (FFD) model. Extensive experiments on simulated and real multimodal image datasets show that compared with the state-of-the-art registration methods, such as modality-independent neighborhood descriptor (MIND), normalized mutual information (NMI), Weber local descriptor (WLD), and the sum of squared differences on entropy images (ESSD), the proposed method provides better registration performance in terms of target registration error (TRE) and subjective human vision.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5982469PMC
http://dx.doi.org/10.3390/s18051477DOI Listing

Publication Analysis

Top Keywords

structural representation
24
medical image
12
image registration
12
nonrigid multimodal
8
registration
8
multimodal image
8
registration methods
8
features structural
8
proposed method
8
medical images
8

Similar Publications

Background: According to the Council of International Organizations and Medical Sciences (CIOMS) 2016, post-trial access (PTA) refers to the ethical imperative that requires the sponsor, researchers, and relevant public health authority, "to make available as soon as possible any intervention or product developed, and knowledge generated, for the population or community in which the research is carried out." Law, policy, and practical guidance for PTA has so far been vague but has recently attracted and increased attention in the context of benefit sharing of scientific research results with low- and middle-income countries (LMICs).Although the number of clinical trials conducted in the Sub Saharan (SSA) countries has increased in the past two decades, plans and practices for PTA are underreported.

View Article and Find Full Text PDF

Active use of latent tree-structured sentence representation in humans and large language models.

Nat Hum Behav

September 2025

Key Laboratory for Biomedical Engineering of Ministry of Education, College of Biomedical Engineering and Instrument Sciences, Zhejiang University, Hangzhou, China.

Understanding how sentences are represented in the human brain, as well as in large language models (LLMs), poses a substantial challenge for cognitive science. Here we develop a one-shot learning task to investigate whether humans and LLMs encode tree-structured constituents within sentences. Participants (total N = 372, native Chinese or English speakers, and bilingual in Chinese and English) and LLMs (for example, ChatGPT) were asked to infer which words should be deleted from a sentence.

View Article and Find Full Text PDF

EndoChat: Grounded multimodal large language model for endoscopic surgery.

Med Image Anal

August 2025

The Chinese University of Hong Kong, 999077, Hong Kong Special Administrative Region of China. Electronic address:

Recently, Multimodal Large Language Models (MLLMs) have demonstrated their immense potential in computer-aided diagnosis and decision-making. In the context of robotic-assisted surgery, MLLMs can serve as effective tools for surgical training and guidance. However, there is still a deficiency of MLLMs specialized for surgical scene understanding in endoscopic procedures.

View Article and Find Full Text PDF

Expanding conceptualizations of beauty can promote positive body image and reduce appearance-related concerns. In collectivist cultures, beauty is often perceived through both appearance and inner or social qualities. This study examined the psychometric properties of the Thai version of the Broad Conceptualization of Beauty Scale (TH-BCBS), a culturally adapted measure reflecting Thai women's inclusive beauty beliefs.

View Article and Find Full Text PDF

Cortical networks with multiple interneuron types generate oscillatory patterns during predictive coding.

PLoS Comput Biol

September 2025

Faculty of Science, Cognitive and Systems Neuroscience Group, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, the Netherlands.

Predictive coding (PC) proposes that our brains work as an inference machine, generating an internal model of the world and minimizing predictions errors (i.e., differences between external sensory evidence and internal prediction signals).

View Article and Find Full Text PDF