Silver: Forging almost Gold Standard Datasets.

Genes (Basel)

Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5C9, Canada.

Published: September 2021


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Gene set analysis has been widely used to gain insight from high-throughput expression studies. Although various tools and methods have been developed for gene set analysis, there is no consensus among researchers regarding best practice(s). Most often, evaluation studies have reported contradictory recommendations of which methods are superior. Therefore, an unbiased quantitative framework for evaluations of gene set analysis methods will be valuable. Such a framework requires gene expression datasets where enrichment status of gene sets is known . In the absence of such gold standard datasets, artificial datasets are commonly used for evaluations of gene set analysis methods; however, they often rely on oversimplifying assumptions that make them biased in favor of or against a given method. In this paper, we propose a quantitative framework for evaluation of gene set analysis methods by synthesizing expression datasets using real data, without relying on oversimplifying or unrealistic assumptions, while preserving complex gene-gene correlations and retaining the distribution of expression values. The utility of the quantitative approach is shown by evaluating ten widely used gene set analysis methods. An implementation of the proposed method is publicly available. We suggest using Silver to evaluate existing and new gene set analysis methods. Evaluation using Silver provides a better understanding of current methods and can aid in the development of gene set analysis methods to achieve higher specificity without sacrificing sensitivity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535810PMC
http://dx.doi.org/10.3390/genes12101523DOI Listing

Publication Analysis

Top Keywords

gene set
32
set analysis
32
analysis methods
24
gene
10
methods
9
gold standard
8
standard datasets
8
set
8
analysis
8
quantitative framework
8

Similar Publications

Gepotidacin, a novel, bactericidal, first-in-class triazaacenaphthylene antibacterial, was noninferior to nitrofurantoin in two pivotal trials (EAGLE-2 and EAGLE-3) in females with uncomplicated urinary tract infections (uUTIs). Using pooled data, gepotidacin activity and clinical efficacy were evaluated for subsets of molecularly characterized isolates in the microbiological Intent-to-Treat population. The subsets of isolates were characterized based on phenotypic/MIC criteria; all microbiological failure isolates were also characterized.

View Article and Find Full Text PDF

Biofilm lifestyle across different lineages of ammonia-oxidizing archaea.

ISME J

September 2025

Department of Functional and Evolutionary Ecology, Archaea Biology and Ecogenomics Unit, University of Vienna, Djerassiplatz 1, 1030 Vienna, Austria.

Although ammonia-oxidizing archaea (AOA) are globally distributed in nature, growth in biofilms has been relatively little explored. Here we investigated six representatives of three different terrestrial and marine clades of AOA in a longitudinal and quantitative study for their ability to form biofilm, and studied gene expression patterns of three representatives. Although all strains grew on a solid surface, soil strains of the genera Nitrosocosmicus and Nitrososphaera exhibited the highest capacity for biofilm formation.

View Article and Find Full Text PDF

Nucleosome repositioning is essential for establishing nucleosome-depleted regions to initiate transcription. This process has been extensively studied using structural, biochemical, and single-molecule approaches, which require homogeneously positioned nucleosomes. This is often achieved using the Widom 601 sequence, a highly efficient nucleosome-positioning element (NPE) selected for its unusually strong binding to the H3-H4 histone tetramer.

View Article and Find Full Text PDF

Objective: The key molecular events signifying the -induced gastric carcinogenesis process are largely unknown.

Methods: Bulk tissue-proteomics profiling were leveraged across multi-stage gastric lesions from Linqu ( = 166) and Beijing sets ( = 99) and single-cell transcriptomic profiling ( = 18) to decipher key molecular signatures of -related gastric lesion progression and gastric cancer (GC) development. The association of key proteins association with gastric lesion progression and GC development were prospectively studied building on follow-up of the Linqu set and UK Biobank ( = 48,529).

View Article and Find Full Text PDF

Background: Colorectal cancer (CRC) is a complex, heterogeneous disease characterized by frequent relapses and metastasis. Previous studies have reported that the invasion and progression of CRC in several cases can be controlled by targeting fusion genes. This study aimed to screen for potent fusion transcripts as potential molecular biomarkers and therapeutic targets for metastatic CRC (mCRC) using an approach.

View Article and Find Full Text PDF