98%
921
2 minutes
20
Clustered federated learning (CFL) addresses the challenge of data heterogeneity in federated learning (FL) by customizing models for different groups of clients. However, existing CFL methods heavily rely on indirect metrics, such as model parameters, gradient information, or loss function values, for client clustering. These approaches often fail to fully capture the diversity and intrinsic characteristics of client data distributions, leading to inaccurate representations of client data features. To address this issue, we propose a novel CFL framework called vector quantization-based CFL (VQCFL). First, we introduce a vector quantization network (VQNet), which effectively captures the intrinsic structure of client data by mapping the local feature space into discrete feature dictionary vectors. In addition, to prevent drift in the feature dictionary vectors, we propose a global feature anchor strategy that aligns feature dictionary vectors across clients, ensuring consistent updates within the same feature space. Furthermore, we present a novel cross-cluster knowledge-sharing mechanism that integrates feature information from different clusters through global aggregation of feature dictionary vectors. Combined with a personalized cross-cluster classifier weight adjustment strategy, this mechanism significantly enhances the model's generalization performance in the presence of mixed data heterogeneity. Experimental results under various settings demonstrate that VQCFL achieves superior local personalization and global generalization performance.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2025.3589186 | DOI Listing |
Comput Biol Med
September 2025
Laboratorio de Procesado de Imagen (LPI), ETSI Telecomunicación, Universidad de Valladolid, Valladolid, Spain. Electronic address:
Modelling the diffusion-relaxation magnetic resonance (MR) signal obtained from multi-parametric sequences has recently gained immense interest in the community due to new techniques significantly reducing data acquisition time. A preferred approach for examining the diffusion-relaxation MR data is to follow the continuum modelling principle that employs kernels to represent the tissue features, such as the relaxations or diffusion properties. However, constructing reasonable dictionaries with predefined signal components depends on the sampling density of model parameter space, thus leading to a geometrical increase in the number of atoms per extra tissue parameter considered in the model.
View Article and Find Full Text PDFbioRxiv
August 2025
Department of Electrical and Computer Engineering, Department of Computer and Information Sciences, University of Delaware, Newark, Delaware, USA.
Electroencephalograms (EEGs) are time-series records of the electrical potential from collective neural activity in the brain. EEG waveform patterns-rhythmic and irregular oscillations and transient patterns of sharp waves or spikes-are potential phenotypical biomarkers, reflecting genotype-specific neural activity. This is especially relevant to diagnosing epilepsy without direct seizure observations, which is common in clinical settings, as well as in animal models, which often have subtle neurological phenotypes without overt epilepsy.
View Article and Find Full Text PDFNanoImpact
August 2025
CNR-ISSMC Istituto di Scienza e Tecnologia dei Materiali Ceramici, Via Granarolo, 64, 48018 Faenza, RA, Italy. Electronic address:
This paper presents a large-scale collaborative effort within a multi-partner consortium, to systematically structure, curate, and openly share data in alignment with the FAIR principles. The data result from a case study of titanium dioxide (TiO₂) nanomaterials (NMs) for photocatalytic depolluting surfaces, produced via various spray coating techniques under the Safe and Sustainable by Design (SSbD) approach. The data are publicly available through a dedicated Zenodo community (https://zenodo.
View Article and Find Full Text PDFSci Rep
August 2025
School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China.
Texture recognition underpins critical applications in industrial quality control, robotic manipulation, and biomedical imaging. Traditional deep dictionary learning methods for texture recognition often emphasize deep feature extraction. However, they tend to lose crucial features as model depth increases, which can reduce their overall effectiveness.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
August 2025
In this paper, we explain the mechanism of bilinear pooling as a module of hard sample generation, and find that bilinear pooling significantly expands variances of the first-order vectors when it produces discriminative bilinear features. In conjunction with the extremely high dimensionality of the obtained bilinear features, those variances lead to overfitting in subsequent learning models. To solve this issue, we construct a bi-level optimization problem, where the high-level problem is the supervised classification loss, and the low-level problem is the principal component analysis (PCA).
View Article and Find Full Text PDF