PAC-Bayes Guarantees for Data-Adaptive Pairwise Learning.

Entropy (Basel)

School of Computer Science, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

We study the generalization properties of stochastic optimization methods under adaptive data sampling schemes, focusing on the setting of pairwise learning, which is central to tasks like ranking, metric learning, and AUC maximization. Unlike pointwise learning, pairwise methods must address statistical dependencies between input pairs-a challenge that existing analyses do not adequately handle when sampling is adaptive. In this work, we extend a general framework that integrates two algorithm-dependent approaches-algorithmic stability and PAC-Bayes analysis for this purpose. Specifically, we examine (1) Pairwise Stochastic Gradient Descent (Pairwise SGD), widely used across machine learning applications, and (2) Pairwise Stochastic Gradient Descent Ascent (Pairwise SGDA), common in adversarial training. Our analysis avoids artificial randomization and leverages the inherent stochasticity of gradient updates instead. Our results yield generalization guarantees of order n-1/2 under non-uniform adaptive sampling strategies, covering both smooth and non-smooth convex settings. We believe these findings address a significant gap in the theory of pairwise learning with adaptive sampling.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12385679PMC
http://dx.doi.org/10.3390/e27080845DOI Listing

Publication Analysis

Top Keywords

pairwise learning
12
pairwise
8
pairwise stochastic
8
stochastic gradient
8
gradient descent
8
adaptive sampling
8
learning
6
pac-bayes guarantees
4
guarantees data-adaptive
4
data-adaptive pairwise
4

Similar Publications

Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental challenges represented by feature heterogeneity and structural heterogeneity. Recent efforts have been made to address feature heterogeneity via Large Language Models (LLMs) on text-attributed graphs (TAGs) by generating fixed-length text representations as node features.

View Article and Find Full Text PDF

Radiomics nomogram from multiparametric magnetic resonance imaging for preoperative prediction of substantial lymphovascular space invasion in endometrial cancer.

Abdom Radiol (NY)

September 2025

Department of Radiology, Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics and Gynecology and Pediatrics, Fujian Medical University, Fuzhou, China.

Background: We aimed to develop and validate a radiomics-based machine learning nomogram using multiparametric magnetic resonance imaging to preoperatively predict substantial lymphovascular space invasion in patients with endometrial cancer.

Methods: This retrospective dual-center study included patients with histologically confirmed endometrial cancer who underwent preoperative magnetic resonance imaging (MRI). The patients were divided into training and test sets.

View Article and Find Full Text PDF

Background: Within public online forums, patients often seek reassurance and guidance from the community regarding postoperative symptoms and expectations, and when to seek medical assistance. Others are using artificial intelligence in the form of online search engines or chatbots such as ChatGPT or Perplexity. Artificial intelligence chatbot assistants have been growing in popularity; however, clinicians may be hesitant to use them because of concerns about accuracy.

View Article and Find Full Text PDF

Purpose: Heterogeneity among people diagnosed with schizophrenia-spectrum disorders (schizophrenia) and high prevalence of co-occurring disorders makes identification of optimal treatments difficult. This study identified behavioral health phenotypes using machine learning with Medicaid claims of adults with schizophrenia. We compared the phenotypes' clinical outcomes and psychotropic medication prescription patterns for clinical validity.

View Article and Find Full Text PDF

Road traffic crashes claim around 1.19 million lives annually worldwide, with over half of the fatalities involving vulnerable road users (VRUs). While several studies have explored the risk factors associated with specific categories of VRUs in Pakistan, research focusing on VRUs collectively, considering all categories and their unique safety challenges, remains limited.

View Article and Find Full Text PDF