Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Motivation: The number of significantly associated regions reported in genome-wide association studies (GWAS) for polygenic traits typically increases with sample size. A traditional tool for quality control and identification of significant regions has been a visual inspection of how significant and correlated genetic variants cluster within a region. However, while inspecting hundreds of regions, this subjective method can misattribute significance to some loci or neglect others that are significant.

Results: The GWAS quality score (GQS) identifies suspicious regions and prevents erroneous interpretations with an objective, quantitative and automated method. The GQS assesses all measured single nucleotide polymorphisms (SNPs) that are linked by inheritance to each other [linkage disequilibrium (LD)] and compares the significance of trait association of each SNP to its LD value for the reported index SNP. A GQS value of 1.0 ascribes a high level of confidence to the entire region and its underlying gene(s), while GQS values <1.0 indicate the need to closely inspect the outliers. We applied the GQS to published and non-published genome-wide summary statistics and report suspicious regions requiring secondary inspection while supporting the majority of reported regions from large-scale published meta-analyses.

Availability And Implementation: The GQS code/scripts can be cloned from GitHub (https://github.com/Xswapnil/GQS/). The analyst can use whole-genome summary statistics to estimate GQS for each defined region. We also provide an online tool (http://35.227.18.38/) that gives access to the GQS. The quantitative measure of quality attributes by GQS and its visualization is an objective method that enhances the confidence of each genomic hit.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9891241PMC
http://dx.doi.org/10.1093/bioinformatics/btad004DOI Listing

Publication Analysis

Top Keywords

gwas quality
8
quality score
8
associated regions
8
regions
5
gwas
4
score evaluating
4
evaluating associated
4
regions gwas
4
gwas analyses
4
analyses motivation
4

Similar Publications

Liver abscesses are a concern in feedlot cattle, and little is known about the role of genetics in their development. This study aimed to estimate genetic parameters and to identify single nucleotide polymorphisms (SNP) associated with liver abscesses. Crossbred cattle representing 18 breeds in the United States Meat Animal Research Center Germplasm Evaluation Program were phenotyped for liver abscesses at slaughter (n = 9,044).

View Article and Find Full Text PDF

Increasing evidence indicates a potential link between macrophage colony-stimulating factor 1 (CSF1) and macrophage migration inhibitory factor (MIF) with nonalcoholic fatty liver disease (NAFLD). However, the causal relationships remain unclear. This study aims to clarify the causal associations between CSF1, MIF, and NAFLD using Mendelian randomization (MR) analysis.

View Article and Find Full Text PDF

Epidemiological studies have already established associations between air pollutants and adverse health outcomes, but the causal associations between air pollutants and chest pain (CP) and gingival pain (GP) remain unclear. This study aimed to explore the potential causal effects of air pollutants on CP and GP. Utilizing genome-wide association study summary statistics from European-ancestry populations, we conducted bidirectional two-sample Mendelian randomization (MR) analyses.

View Article and Find Full Text PDF

Bacterial leaf streak (BLS), caused by pv. (), has recently emerged as a significant threat to wheat production in the Northern Great Plains region of the US. Deploying resistant cultivars is an economical and practical method of controlling BLS.

View Article and Find Full Text PDF

Next-generation sequencing has greatly advanced genomics, enabling large-scale studies of population genetics and complex traits. Genomic DNA (gDNA) from white blood cells has traditionally been the main data source, but cell-free DNA (cfDNA), found in bodily fluids as fragmented DNA, is increasingly recognized as a valuable biomarker in clinical and genetic studies. However, a direct comparison between cfDNA and gDNA has not been fully explored.

View Article and Find Full Text PDF