Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Introduction: Synthetic data generation is a rapidly evolving field, with significant potential for improving data privacy. However, evaluating the performance of synthetic data generation methods, especially the tradeoff between fairness and utility of the generated data, remains a challenge.

Methodology: In this work, we present our comprehensive framework, which evaluates fair synthetic data generation methods, benchmarking them against state-of-the-art synthesizers.

Results: The proposed framework consists of selection, evaluation, and application components that assess fairness, utility, and resemblance in real-world scenarios. The framework was applied to state-of-the-art data synthesizers, including TabFairGAN, DECAF, TVAE, and CTGAN, using a publicly available medical dataset.

Discussion: The results reveal the strengths and limitations of each synthesizer, including their bias mitigation strategies and trade-offs between fairness and utility, thereby showing the framework's effectiveness. The proposed framework offers valuable insights into the fairness-utility tradeoff and evaluation of synthetic data generation methods, with far-reaching implications for various applications in the medical domain and beyond.

Conclusion: The findings demonstrate the importance of considering fairness in synthetic data generation and the need for fairness focused evaluation frameworks, highlighting the significance of continued research in this area.

Download full-text PDF

Source
http://dx.doi.org/10.3233/SHTI251376DOI Listing

Publication Analysis

Top Keywords

synthetic data
20
data generation
20
generation methods
12
fairness utility
12
data
9
fairness synthetic
8
proposed framework
8
fairness
6
framework
5
synthetic
5

Similar Publications

The development of synthetically-useful biocatalysts requires characterizing the behavior of an enzyme under conditions amenable to preparative-scale reactions. Whole cells harboring the catalyst of interest are often used in such reactions, as protein purification is laborious and expensive. However, monitoring reaction rates when using whole cells is challenging, as cellular debris precludes the use of a continuous assay.

View Article and Find Full Text PDF

Purpose: Hormonal contraceptives are linked to a higher prevalence of depressive symptoms. Given their popularity in Western countries, understanding the biochemical effects on neuronal cells is crucial to minimizing mental health risks.

Experimental Design: Neural progenitor cells were treated with ethinyl estradiol (EE) and levonorgestrel (LNG), two synthetic sex hormones commonly used in oral contraception, and S-23, a selective androgen receptor modulator developed as a potential synthetic sex hormone for male hormonal contraception.

View Article and Find Full Text PDF

Sports injury surveillance programs have been vital in advancing the understanding of injury epidemiology across various athlete populations. Surveillance-based epidemiological measures of injury occurrence are ubiquitous in the sports medicine literature, and the injury rate is one such commonly used measure. Traditional approaches to calculating injury rates have predominantly relied on frequentist methods, which, while informative, have limitations in addressing certain practical questions.

View Article and Find Full Text PDF

The chimpanzee adenovirus-vectored vaccine developed by the University of Oxford (ChAdOx1 nCoV-19) showed good stability when stored in refrigerator. However, the vaccine manufacturer prefers its transportation in frozen condition. Data regarding the stability of the vaccine after exposure to repeated freezing processes have not been explored yet.

View Article and Find Full Text PDF