Vector Quantization-Based Clustered Federated Learning With Global Feature Anchors for Improved Representation and Generalization.

Xiaohong Chen , Yuhang Zhang , Xuesong Xu , Dongbin Hu , Guanying Xu

IEEE Trans Neural Netw Learn Syst

Published: July 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Clustered federated learning (CFL) addresses the challenge of data heterogeneity in federated learning (FL) by customizing models for different groups of clients. However, existing CFL methods heavily rely on indirect metrics, such as model parameters, gradient information, or loss function values, for client clustering. These approaches often fail to fully capture the diversity and intrinsic characteristics of client data distributions, leading to inaccurate representations of client data features. To address this issue, we propose a novel CFL framework called vector quantization-based CFL (VQCFL). First, we introduce a vector quantization network (VQNet), which effectively captures the intrinsic structure of client data by mapping the local feature space into discrete feature dictionary vectors. In addition, to prevent drift in the feature dictionary vectors, we propose a global feature anchor strategy that aligns feature dictionary vectors across clients, ensuring consistent updates within the same feature space. Furthermore, we present a novel cross-cluster knowledge-sharing mechanism that integrates feature information from different clusters through global aggregation of feature dictionary vectors. Combined with a personalized cross-cluster classifier weight adjustment strategy, this mechanism significantly enhances the model's generalization performance in the presence of mixed data heterogeneity. Experimental results under various settings demonstrate that VQCFL achieves superior local personalization and global generalization performance.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2025.3589186	DOI Listing

Publication Analysis

Top Keywords

feature dictionary

dictionary vectors

federated learning

client data

feature

vector quantization-based

clustered federated

global feature

data heterogeneity

feature space

Similar Publications

Multi-compartment diffusion-relaxation MR signal representation in the spherical 3D-SHORE basis.

Comput Biol Med

September 2025

Laboratorio de Procesado de Imagen (LPI), ETSI Telecomunicación, Universidad de Valladolid, Valladolid, Spain. Electronic address:

Fabian Bogusz , Tomasz Pieciak

Modelling the diffusion-relaxation magnetic resonance (MR) signal obtained from multi-parametric sequences has recently gained immense interest in the community due to new techniques significantly reducing data acquisition time. A preferred approach for examining the diffusion-relaxation MR data is to follow the continuum modelling principle that employs kernels to represent the tissue features, such as the relaxations or diffusion properties. However, constructing reasonable dictionaries with predefined signal components depends on the sampling density of model parameter space, thus leading to a geometrical increase in the number of atoms per extra tissue parameter considered in the model.

View Article and Find Full Text PDF

Similar Publications

Interpretable EEG biomarkers for neurological disease models in mice using bag-of-waves classifiers.

bioRxiv

August 2025

Department of Electrical and Computer Engineering, Department of Computer and Information Sciences, University of Delaware, Newark, Delaware, USA.

Maria Isabel Cano Achuri , Montana Kay Lara , Khalil Abed Rabbo , Benjamin T Wilson , Austin Meek

Electroencephalograms (EEGs) are time-series records of the electrical potential from collective neural activity in the brain. EEG waveform patterns-rhythmic and irregular oscillations and transient patterns of sharp waves or spikes-are potential phenotypical biomarkers, reflecting genotype-specific neural activity. This is especially relevant to diagnosing epilepsy without direct seizure observations, which is common in clinical settings, as well as in animal models, which often have subtle neurological phenotypes without overt epilepsy.

View Article and Find Full Text PDF

Similar Publications

The fruits of data shepherding: A collection of open FAIR datasets for titanium dioxide coated photocatalytic surfaces.

NanoImpact

August 2025

CNR-ISSMC Istituto di Scienza e Tecnologia dei Materiali Ceramici, Via Granarolo, 64, 48018 Faenza, RA, Italy. Electronic address:

Irini Furxhi , Massimo Perucca , Giovanni Baldi , Valentina Dami , Andrea Cioni

This paper presents a large-scale collaborative effort within a multi-partner consortium, to systematically structure, curate, and openly share data in alignment with the FAIR principles. The data result from a case study of titanium dioxide (TiO₂) nanomaterials (NMs) for photocatalytic depolluting surfaces, produced via various spray coating techniques under the Safe and Sustainable by Design (SSbD) approach. The data are publicly available through a dedicated Zenodo community (https://zenodo.

View Article and Find Full Text PDF

Similar Publications

Deep dictionary learning with reconstruction for texture recognition.

Sci Rep

August 2025

School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China.

Pengwen Xiong , Ke Zhang , Zhi Shi , MengChu Zhou , Aiguo Song

Texture recognition underpins critical applications in industrial quality control, robotic manipulation, and biomedical imaging. Traditional deep dictionary learning methods for texture recognition often emphasize deep feature extraction. However, they tend to lose crucial features as model depth increases, which can reduce their overall effectiveness.

View Article and Find Full Text PDF

Similar Publications

Learning Compact Discriminant Representation via Low-rank Bilinear Pooling.

IEEE Trans Pattern Anal Mach Intell

August 2025

Kun Song , Hao Li , Gong Cheng , Junwei Han , Feiping Nie

In this paper, we explain the mechanism of bilinear pooling as a module of hard sample generation, and find that bilinear pooling significantly expands variances of the first-order vectors when it produces discriminative bilinear features. In conjunction with the extremely high dimensionality of the obtained bilinear features, those variances lead to overfitting in subsequent learning models. To solve this issue, we construct a bi-level optimization problem, where the high-level problem is the supervised classification loss, and the low-level problem is the principal component analysis (PCA).

View Article and Find Full Text PDF

Similar Publications