Domain Generalization with Correlated Style Uncertainty.

IEEE Winter Conf Appl Comput Vis

Machine & Hybrid Intelligence Lab, Northwestern University, USA.

Published: January 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Domain generalization (DG) approaches intend to extract domain invariant features that can lead to a more robust deep learning model. In this regard, style augmentation is a strong DG method taking advantage of instance-specific feature statistics containing informative style characteristics to synthetic novel domains. While it is one of the state-of-the-art methods, prior works on style augmentation have either disregarded the interdependence amongst distinct feature channels or have solely constrained style augmentation to linear interpolation. To address these research gaps, in this work, we introduce a novel augmentation approach, named Correlated Style Uncertainty (CSU), surpassing the limitations of linear interpolation in style statistic space and simultaneously preserving vital correlation information. Our method's efficacy is established through extensive experimentation on diverse cross-domain computer vision and medical imaging classification tasks: PACS, Office-Home, and Camelyon17 datasets, and the Duke-Market1501 instance retrieval task. The results showcase a remarkable improvement margin over existing state-of-the-art techniques. The source code is available https://github.com/freshman97/CSU.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11230655PMC
http://dx.doi.org/10.1109/wacv57701.2024.00200DOI Listing

Publication Analysis

Top Keywords

style augmentation
12
domain generalization
8
correlated style
8
style uncertainty
8
linear interpolation
8
style
7
generalization correlated
4
uncertainty domain
4
generalization approaches
4
approaches intend
4

Similar Publications

Skinspan: A Holistic Roadmap for Extending Skin Longevity With Evidence-Based Interventions.

J Cosmet Dermatol

September 2025

Cosmetic Laser Dermatology, San Diego, California, USA.

Background: With the rise of regenerative medicine and geroscience, translational research has shifted focus from lifespan to healthspan-years lived in good health. Applied to aesthetic medicine, the authors introduce the concept of "skinspan," to both describe the period during which skin maintains a youthful, healthy appearance, and additionally to serve as a tool for the cosmetic consult.

Aims: The aim of this comprehensive review is to illuminate "skinspan" as a framework for guiding long-term skin health.

View Article and Find Full Text PDF

Introduction: Implant-based breast reconstruction after skin-sparing mastectomy remains one of the most frequently used methods of breast reconstruction in the US. Patients with large, ptotic breasts often face poorer outcomes. We hypothesized that implant-based breast reconstruction with auto-augmentation techniques can minimize problems with acellular dermal matrices (ADM) by using less, and providing the benefit of prepectoral placement.

View Article and Find Full Text PDF

Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications.

Curr Med Imaging

September 2025

Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China.

Introduction: This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.

Methods: The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model.

View Article and Find Full Text PDF

Introduction: OpenStreetMap (OSM) road surface data is critical for navigation, infrastructure monitoring, and urban planning but is often incomplete or inconsistent. This study addresses the need for automated validation and classification of road surfaces by leveraging high-resolution aerial imagery and deep learning techniques.

Methods: We propose a MaskCNN-based deep learning model enhanced with attention mechanisms and a hierarchical loss function to classify road surfaces into four types: asphalt, concrete, gravel, and dirt.

View Article and Find Full Text PDF

Falling is a common but fatal human behavior in life. With the rapid growth of the aging population, fall-related human behavior recognition has been extensively investigated using radar. Nevertheless, human behavior recognition frequently exhibits suboptimal generalization capabilities due to the scarcity of labeled data.

View Article and Find Full Text PDF